Document the 5.1 AI & Models feature set#523
Conversation
New reference section (models/): overview + configuration, embed/generate/ generateStream API, tool calling and the toolMode 'auto' agent loop, the four bundled backends (ollama, openai, anthropic, bedrock), and model-call analytics. Adds the @embed directive and 5.1 vector-indexing additions (int8 quantization, per-query ef, auto-scaled search ef, dotProduct distance) to the schema reference, corrects the HNSW search parameter name (efConstructionSearch, previously documented as efSearchConstruction), and starts the 5.1 release notes. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
error_code can also be pending_unsupported (Models.ts pending-status path is reachable); Bedrock family dispatch also handles mistral., and unknown prefixes are rejected. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Config-file (Joi) validation of the models block is boot-blocking for
structurally invalid entries; only registration-time errors are
warn-and-skip. Also note ${VAR} indirection is string-fields-only.
Surfaced by the models-subsystem deep review.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Only credential fields reject unresolved ${VAR} placeholders at
startup; host/model/region carry them into requests literally.
Surfaced by the models-subsystem deep review.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Current reference docs are served at /reference/v5/ (versions config maps current -> path 'v5'); the PR preview's broken-link check caught the unversioned /reference/ links. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
🚀 Preview DeploymentYour preview deployment is ready! 🔗 Preview URL: https://preview.harper-documentation.harperfabric.com/pr-523 This preview will update automatically when you push new commits. |
VersionBadge tagging for minor-version availability (new vs changed), version derivation from the core release, release-notes-per-minor placement, versioned /reference/v5/ link paths from other content trees, and feature/docs PR cross-linking. Gives the engineering guidelines a single place to point at for docs mechanics. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
🚀 Preview DeploymentYour preview deployment is ready! 🔗 Preview URL: https://preview.harper-documentation.harperfabric.com/pr-523 This preview will update automatically when you push new commits. |
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
| type Document @table { | ||
| id: Long @primaryKey | ||
| text: String | ||
| embedding: [Float] @embed(source: "text", model: "default") |
There was a problem hiding this comment.
Is there an equivalent to automatically embed on the search side of the equation? When we Document.search({. I'm not seeing an equivalent in the docs, but maybe I missed it.
There was a problem hiding this comment.
Sorry Dawson. Missed this before merging. No, there's not but the need is now tracked at HarperFast/harper#1277. Great callout.
Release notes will be authored separately (Kris owns them); this PR stays scoped to the reference documentation. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
🚀 Preview DeploymentYour preview deployment is ready! 🔗 Preview URL: https://preview.harper-documentation.harperfabric.com/pr-523 This preview will update automatically when you push new commits. |
🧹 Preview CleanupThe preview deployment for this PR has been removed. |
* docs(v5.1): release notes, deployment tracking ops, deploy_component updates - Add 5.1.md release notes covering: models/AI, @embed directive, MCP server, deployment tracking, HNSW int8 quantization, and replication improvements - Update deploy_component docs: urlPath, install_allow_scripts params, deployment_id response - Document new deployment operations: list_deployments, get_deployment, get_deployment_payload, delete_deployment_payload - Document hdb_deployment record schema (fields, phases, peer_results) Note: models/AI detail, MCP reference, and HNSW quantization have separate PRs (#523, #507/#516, #508) — this PR adds the release notes overview and the deployment tracking operations which had no coverage. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * style: run prettier on changed files Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: remove cross-plugin MCP link that breaks Docusaurus build The release-notes and reference doc plugins are separate; relative .md links between them resolve incorrectly. Removing until PR #507 (MCP reference section) merges and can be linked with an absolute path. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * docs(5.1): expand release notes — middleware/routing, caching, LOCAL_ONLY, HARPER_CONFIG, RocksDB, migrateOnStart, upgrade improvements --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Summary
First documentation for the models/AI surface shipping in 5.1 — the docs currently have zero coverage of it. Everything is tagged
<VersionBadge version="v5.1.0" />(all feature commits are first contained in core'sv5.1.0-beta.1).reference/models/section (sidebar: "AI & Models"): overview +modelsconfiguration block,embed()/generate()/generateStream()API, tool calling incl. thetoolMode: 'auto'agent loop (budgets, parallelism, error modes, trace, conversation hook), the four bundled backends (Ollama, OpenAI incl. OpenAI-compatible endpoints viabaseUrl, Anthropic, Bedrock), andhdb_model_calls/ aggregate analytics.@embeddirective section (write semantics, validation, auto-HNSW); vector-indexing additions — int8 quantization with full-precision rerank, per-queryef, auto-scaled search ef (type="changed"),dotProductdistance.efSearchConstruction; the option Harper reads isefConstructionSearch. Fixed with a note.Deliberately not documented
openaiStream()SSE formatter — unmerged (branch pinned).agent:config,agent_*operations) — merged as scaffold, operations-API-only,maxCostUsdadvertised but unenforced; holding docs until #839 lands.GenerateOpts(toolArgValidationstrict/lenient,maxCostUsd,conversationId) — called out in a short "Reserved options" note as non-functional in 5.1 so TypeScript autocomplete doesn't mislead.Where to look
resources/models/types.ts, defaults fromagentLoop.ts/analyticsTable.ts/backend components, HNSW behavior fromresources/indexes/HierarchicalNavigableSmallWorld.ts). A follow-up docs-accuracy audit is planned; flagging two spots reviewers should sanity-check: the per-queryefexample (plumbed through the sort descriptor; no existing unit test exercises it end-to-end) and the Azure/Gemini OpenAI-compatibility claims (bearer-auth-only constraint stated).npm run buildpasses (links/MDX verified);format:writeapplied.Generated by Claude (Fable 5).
🤖 Generated with Claude Code