token-optimization

Here are 794 public repositories matching this topic...

rtk-ai / rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

rust cli productivity open-source developer-tools command-line-tool llm cost-reduction anthropic ai-coding claude-code token-optimization agentic-coding

Updated Jun 23, 2026
Rust

headroomlabs-ai / headroom

Star

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

Updated Jun 24, 2026
Python

Control what your AI can see. LeanCTX (Lean Context) is the context intelligence layer for AI agents — one local Rust binary that decides what they read, remembers what they learn, guards what they touch, and proves what they save. 60–90% fewer tokens as the receipt. 76 MCP tools, 30+ agents, local-first.

rust ai mcp developer-tools cursor copilot ai-agents llm gemini-cli ai-coding mcp-server claude-code token-optimization agentic-coding context-engineering context-layer reduce-token-costs lean-context context-intelligence

Updated Jun 24, 2026
Rust

cytostack / openwolf

Star

Sharper context. Fewer tokens. Open-source middleware for Claude Code.

cli open-source middleware developer-tools anthropic claude-ai claude-code token-optimization

Updated Mar 20, 2026
TypeScript

alexgreensh / token-optimizer

Sponsor

Star

Find the ghost tokens. Fix them. Survive compaction. Avoid context quality decay.

token-usage context-window claude-code token-optimization context-engineering claude-plugin claude-code-skill token-optimizer agentskills ghost-tokens

Updated Jun 23, 2026
Python

lucasrosati / claude-code-memory-setup

Star

Up to 71.5x fewer tokens per session on Claude Code with Obsidian + Graphify. Persistent memory, codebase knowledge graphs, and chat import pipeline. 🇧🇷 PT-BR included.

knowledge-graph obsidian zettelkasten developer-productivity second-brain ai-tools graphify claude-code token-optimization coding-agent

Updated Jun 1, 2026
Python

zdk / lowfat

Star

lowfat - slim your command output. strips noise, saves tokens.

rust cli open-source developer-tools shell-script llm cost-reduction token-optimization agentic-coding-tool token-savings token-saving

Updated Jun 19, 2026
Rust

nadimtuhin / claude-token-optimizer

Star

Reusable setup prompts for optimizing Claude Code documentation. Achieve 90% token savings on any project in 5 minutes.

documentation automation developer-tools ai-assistant claude-code token-optimization setup-template

Updated Jun 22, 2026
JavaScript

GMaN1911 / claude-cognitive

Star

Working memory for Claude Code - persistent context and multi-instance coordination

productivity developer-tools claude-ai context-management claude-code token-optimization

Updated Jan 17, 2026
Python

juyterman1000 / entroly

Star

Cut your Claude / OpenAI / Gemini bill 70–95% on AI coding. Local proxy that compresses context, keeps provider caches hot, and verifies LLM output ($0 hallucination guard). Drop-in for Cursor, Claude Code, Codex, Aider + 34 more and custom providers — 30s, no code changes

rust productivity open-source ai mcp cursor ai-agents claude rag llm chatgpt anthropic hallucination-detection context-compression mcp-server claude-code token-optimization llm-grounding ai-hallucination

Updated Jun 22, 2026
Python

ooples / token-optimizer-mcp

Sponsor

Star

Intelligent token optimization for Claude Code - achieving 95%+ token reduction through caching, compression, and smart tool intelligence

caching compression ai mcp claude llm mcp-server token-optimization

Updated Jun 23, 2026
TypeScript

IyadhKhalfallah / clauditor

Star

Stop Claude Code from burning through your quota in 20 minutes. Auto-rotates oversized sessions and preserves context.

cli hooks claude-code token-optimization

Updated Apr 16, 2026
TypeScript

ojuschugh1 / sqz

Star

Compress LLM context to save tokens and reduce costs

javascript python api rust cli open-source ai extensions context tokens developer-tools token cost-optimization llms agentic-ai token-optimization

Updated Jun 21, 2026
Rust

edouard-claude / snip

Star

CLI proxy that reduces LLM token usage by 60-90%. Declarative YAML filters for Claude Code, Cursor, Copilot, Gemini. rtk alternative in Go.

Updated Jun 22, 2026
Go

Lap-Platform / LAP

Star

Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Compile any API spec into a lean, agent-native format. 10× smaller. OpenAPI, GraphQL, AsyncAPI, Protobuf, Postman.

Updated Mar 26, 2026
Python

elusznik / mcp-server-code-execution-mode

Star

An MCP server that executes Python code in isolated rootless containers with optional MCP server proxying. Implementation of Anthropic's and Cloudflare's ideas for reducing MCP tool definitions context bloat.

python docker mcp orchestration agents code-execution claude podman anthropic agentic-ai model-context-protocol claude-code token-optimization

Updated Dec 5, 2025
Python

oxygen-fragment / claude-modular

Star

Production-ready modular Claude Code framework with 30+ commands, token optimization, and MCP server integration. Achieves 2-10x productivity gains through systematic command organization and hierarchical configuration.

productivity development-workflow modular-framework template-repository ai-development claude-code token-optimization adhd-friendly mpc-servers