Email Processing Worker - POC

A Cloudflare Worker that receives inbound emails, stores them in R2, and processes them asynchronously using Queues and D1.

How it works?

Producer receives the email and parses all the required metadata from the raw email. This is just a worker exposed to the internet, so cloudflare can scale it when required.
The entire raw mail is stored in R2 by the producer.
Producer saves the email metadata and internal data (status, emailId, storageKey).
Producer sends storageKey and emailId to a queue
A consumer receives the event
Consumer looks for the email data in D1 by the emailID
Consumer donwloads the email from R2
Consumer calculates the hash from the content of the raw email.
Consumer asks drive for a signed URL
Consumer uploads email content to the signed URL
Consumer notifies drive about the file upload finished
Consumer makes a request to email API
Consumer marks the email as 'processed'

Architecture

Flow: Incoming Email → Webhook → R2 + D1 → Queue → Process → External Drive (TODO)

Webhook (/webhook/inbound): Receives emails, parses with postal-mime, stores in R2, saves metadata to D1
R2 Storage: Temporary storage for raw email files (emails/{date}/{id}.eml)
D1 Database: Tracks status and metadata (from, to, subject, etc.)
Queue: Async processing with retries and dead letter queue
Consumer (src/queue.ts): Fetches email from R2, ready to send to external storage. This scales horizontally https://developers.cloudflare.com/queues/configuration/consumer-concurrency/

Setup

1. Install dependencies

npm install

2. Create resources

# Create R2 bucket
npx wrangler r2 bucket create email-storage

# Create D1 database (save the database_id from output)
npx wrangler d1 create email-inbox

# Create queues
npx wrangler queues create email-process-queue
npx wrangler queues create email-dlq

3. Configure wrangler.jsonc

Update database_id in wrangler.jsonc with the ID from step 2.

4. Apply migrations

npx wrangler d1 migrations apply email-inbox

5. Modify DRIVE_CONFIG in queue.ts

API_URL: drive's bridge URL
BUCKET_ID: User's bucket ID in Drive
AUTH_TOKEN: The value we use as header when retrieving files from drive.

6. Run locally

npx wrangler dev

Database Schema

email_inbox table:

id - Message ID or UUID
storage_key - R2 path
status - pending/processing/completed/failed
received_at - Timestamp
processed_at - Timestamp
metadata - JSON (from, to, subject, date, attachments)

TODO

Add authentication (according to the third party email provider)
Integrate with Internxt mail's API
We need to get the user credentials from somewhere to be able to ask for a drive signed URL.

Fallbacks

What should we do when any of the external request fails in the consumer? Should we make sure they are idempotent? We should not add duplicated emails if one of them fails.
How are we going to monitor DLQ? Should we add a consumer to the DLQ or search for a notification system to monitor this?
How are we going to notify the senders if the target mail does not exist? There are some email servers (or providers) that returns a response according to the webhook, but if we choose any external provider that does not check the webhook before accepting the email, we should not be able to notify senders about email size (unless the server rejects it directly) or not existent mails.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
migrations		migrations
src		src
.dev.vars.example		.dev.vars.example
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts
wrangler.jsonc		wrangler.jsonc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Email Processing Worker - POC

How it works?

Architecture

Setup

1. Install dependencies

2. Create resources

3. Configure wrangler.jsonc

4. Apply migrations

5. Modify DRIVE_CONFIG in queue.ts

6. Run locally

Database Schema

TODO

Fallbacks

About

Uh oh!

Releases

Packages

Languages

apsantiso/worker-poc

Folders and files

Latest commit

History

Repository files navigation

Email Processing Worker - POC

How it works?

Architecture

Setup

1. Install dependencies

2. Create resources

3. Configure wrangler.jsonc

4. Apply migrations

5. Modify DRIVE_CONFIG in queue.ts

6. Run locally

Database Schema

TODO

Fallbacks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages