feat: add AI-powered "For You" feed personalization by phatpham9 · Pull Request #2 · boringcode-dev/feedreader-edge

phatpham9 · 2026-06-23T11:31:10Z

Summary

Adds an opt-in "For You" personalization mode: users describe their interests in free text (Settings → AI personalization), and matching items are reranked instead of shown chronologically.
Each feed item is embedded once at ingestion time (Workers AI bge-base-en-v1.5) and stored in D1. At request time only the interests string is embedded; the ~500-item candidate pool is ranked by cosine similarity in-worker — no per-request LLM call needed for the core ranking step.
The LLM ranker (CloudflareLlmRanker, 4-model fallback chain: Llama 3.1 8B → 3.3 70B → Mistral 7B → Llama 3.2 3B) now runs only as an optional "polish" pass over the top FEEDREADER_PERSONALIZE_POLISH_POOL_SIZE similarity hits (default 30; 0 disables it).
/api/personalize reports personalization: "llm" | "similarity" | "none" instead of a flat degraded boolean, so a skipped/failed LLM polish step still serves a similarity-personalized order rather than falling all the way back to chronological.
Ranked order is cached via the Cloudflare Cache API per (interests, source filter, source freshness) for 6h, same invalidation pattern as /api/items.

Why retrieve-then-rerank instead of a pure per-request LLM rerank

Sending the full candidate pool through an LLM on every request scales cost with requests, not data, and gives zero cache benefit across paraphrased interests strings ("rust, AI" vs "AI and rust"). Embedding once at ingestion amortizes that cost over the item's lifetime in the pool, and similarity ranking is naturally robust to paraphrasing.

Test plan

npm run typecheck && npm test
npm run db:migrate:local applies 0002_add_item_embeddings.sql cleanly
wrangler dev --local: /api/personalize degrades gracefully to personalization: "none" without AI credentials; malformed body / missing interests / wrong method all return clean errors
/internal/refresh/<source> succeeds even when embedding generation fails (ingestion never blocked on embed)
With real Workers AI credentials: verify two paraphrased interests strings produce comparable rankings, and the Settings dialog's "Enable personalized ranking" toggle + interests textarea work end-to-end in a browser

github-actions · 2026-06-23T11:37:04Z

🔎 Cloudflare preview: https://426e3711-feedreader.phatpham9.workers.dev

Uploaded from d757cf9c0d4ff2fcf6228d26dc017f6383d7bc89. This is a Worker version preview — it shares the production D1 database, so it reads/writes real data; it does not receive cron-triggered refreshes.

Reranks the feed by free-text reader interests using Cloudflare Workers AI, with graceful degradation to chronological order on any model or ranking failure. Settings live in the existing Reader settings dialog (disabled by default); personalization is opt-in per browser. The backend ranks once per (interests, source filter, freshness) via the Cache API and re-projects the cached order onto a freshly-fetched item page each request, so pagination behaves exactly like /api/items (true offset/limit, real has_next) without re-invoking the LLM per page. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Resolves conflicts between the For You personalization endpoint (/api/personalize) and main's edge caching for /api/items plus the new /api/version and install-prompt-hide features. Kept both sets of functionality, sharing the latestSuccessAt helper and cardsToJson mapping between the items and personalize handlers.

…mbed-at-ingestion + similarity-at-query Embeds each item once at ingestion time (Workers AI bge-base-en-v1.5) instead of sending the full candidate pool through an LLM on every personalize request. At query time, only the interests string is embedded and the pool is ranked by cosine similarity in-worker; the existing LLM ranker now runs only as an optional polish pass over the top similarity hits (FEEDREADER_PERSONALIZE_POLISH_POOL_SIZE, 0 disables it). This also fixes the old all-or-nothing degraded fallback: the response now reports personalization: "llm" | "similarity" | "none" instead of a boolean, so a failed/disabled LLM polish step still serves a similarity-personalized order instead of falling all the way back to chronological.

Resolves conflicts between the embedding-retrieval personalization rewrite and main's weekly item-retention prune (which landed via a separate PR while this branch was in progress, and changed design mid-flight from a second cron trigger to a single hourly trigger with a wall-clock window check). Kept both features; core/test/fakeFeedRepository.ts needed a pruneOldItems stub added since FeedRepository requires it again.

Both files gained real content changes in this branch (personalization toggle/interests UI, personalization field handling) without bumping their ?v= query strings, so a CDN/browser cache could keep serving the pre-change script/styles after deploy.

phatpham9 force-pushed the feature/for-you-personalization branch from 33db4f5 to c972fc0 Compare June 23, 2026 12:01

phatpham9 added 4 commits June 23, 2026 22:46

phatpham9 merged commit 6e3332a into main Jun 23, 2026
2 checks passed

phatpham9 deleted the feature/for-you-personalization branch June 23, 2026 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add AI-powered "For You" feed personalization#2

feat: add AI-powered "For You" feed personalization#2
phatpham9 merged 5 commits into
mainfrom
feature/for-you-personalization

phatpham9 commented Jun 23, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

phatpham9 commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why retrieve-then-rerank instead of a pure per-request LLM rerank

Test plan

Uh oh!

github-actions Bot commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

phatpham9 commented Jun 23, 2026 •

edited

Loading

github-actions Bot commented Jun 23, 2026 •

edited

Loading