Feature(LiteLLM): Wire LiteLLM Docker profile with application config overrides (provider and embedder) by marazik · Pull Request #533 · AsyncFuncAI/deepwiki-open

marazik · 2026-06-03T22:01:59Z

Summary

This PR is part of a 4-PR integration effort to add LiteLLM support to DeepWiki-Open while maintaining full backward compatibility.

PR Series

New Feature: Add optional LiteLLM Docker Compose setup for local LLM gateway #526 — Add optional LiteLLM Docker Compose setup for local LLM gateway
Feature: Introduce LiteLLM client for multi-provider model routing #529 — Introduce LiteLLM client for multi-provider model routing
This PR — Wire Docker Compose with LiteLLM provider and configuration layer
Next — Documentation and usage guide updates

Summary (this PR)

This PR completes the deployment-layer integration for LiteLLM by wiring the Docker Compose setup to application-level provider and embedding configurations.

It ensures that when LiteLLM is enabled via Docker, the correct provider configuration and model registry are automatically applied.

🔧 Changes

Configuration Layer

Added generator.litellm.json with full provider registry for:
- OpenAI, OpenRouter, Dashscope, Google, Azure, Bedrock, Ollama, LiteLLM
Added embedder.litellm.json for LiteLLM-based embedding configuration
Ensured LiteLLM is the default provider when selected

Docker Integration

Mounted LiteLLM-specific config overrides in docker-compose-litellm.yml
- generator.litellm.json → generator.json
- embedder.litellm.json → embedder.json

Provider Alignment

Ensured consistency between:
- Provider routing (PR2)
- Docker deployment (PR1)
- Configuration system (this PR)

🧠 Design Notes

LiteLLM remains fully optional and opt-in
No changes to default runtime behavior unless explicitly enabled
All existing providers (OpenAI, OpenRouter, AWS, Google, etc.) remain unchanged
Configuration overrides are isolated to LiteLLM Docker profile only

🔄 Compatibility

This change is fully backward compatible:

Existing deployments are unaffected
Default provider behavior remains unchanged
LiteLLM is only activated via Docker profile selection
No breaking API or configuration changes

🧪 Testing

Tested with:

Docker Compose LiteLLM profile activation
Provider selection switching (OpenAI ↔ LiteLLM)
Embedding pipeline with LiteLLM embedder config
Multi-provider model routing behavior
End-to-end WebSocket query flow

📝 Notes for Reviewers

This PR focuses strictly on deployment + configuration wiring.

PR Feature: Introduce LiteLLM client for multi-provider model routing #529 introduced the provider abstraction
PR New Feature: Add optional LiteLLM Docker Compose setup for local LLM gateway #526 introduced the Docker gateway
This PR connects both via config + compose overrides

A documentation PR will follow to explain usage and setup flows.

gemini-code-assist

Code Review

This pull request introduces LiteLLM-specific configuration files (embedder.litellm.json and generator.litellm.json) and mounts them as overrides in docker-compose-litellm.yml. Feedback is provided to address two critical configuration issues: first, the num_ctx parameter should be removed from the litellm provider models in generator.litellm.json to prevent a runtime TypeError when calling the OpenAI-based client; second, the initialize_kwargs block in embedder.litellm.json should be removed to avoid passing literal placeholder strings when environment variables are unset, allowing the client to safely fall back to its native environment lookups.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-03T22:03:37Z

+        "qwen3:1.7b": {
+            "temperature": 0.7,
+            "top_p": 0.8,
+            "num_ctx": 32000
+        },
+        "llama3:8b": {
+            "temperature": 0.7,
+            "top_p": 0.8,
+            "num_ctx": 8000
+        },
+        "qwen3:8b": {
+            "temperature": 0.7,
+            "top_p": 0.8,
+            "num_ctx": 32000
+        }


The litellm provider models include the num_ctx parameter. Since LiteLLMClient inherits from OpenAIClient and utilizes the standard openai Python SDK under the hood, passing non-standard parameters like num_ctx directly in model_kwargs will cause a TypeError (unexpected keyword argument) client-side when calling chat.completions.create.

Additionally, the context window size (num_ctx) is typically configured on the LiteLLM server side rather than per-request. Removing num_ctx and correcting the indentation to match the rest of the file (10 spaces for properties) resolves this runtime risk and keeps the configuration clean.

"qwen3:1.7b": { "temperature": 0.7, "top_p": 0.8 }, "llama3:8b": { "temperature": 0.7, "top_p": 0.8 }, "qwen3:8b": { "temperature": 0.7, "top_p": 0.8 }

gemini-code-assist · 2026-06-03T22:03:37Z

+    "client_class": "LiteLLMClient",
+    "initialize_kwargs": {
+      "api_key": "${LITELLM_API_KEY}",
+      "base_url": "${LITELLM_BASE_URL}"
+    },
+    "batch_size": 10,


The initialize_kwargs block explicitly passes api_key and base_url using environment variable placeholders. However, if these environment variables are not set, the configuration loader (replace_env_placeholders in api/config.py) will leave the literal placeholder strings "${LITELLM_API_KEY}" and "${LITELLM_BASE_URL}" intact. This will cause LiteLLMClient to initialize with these invalid literal strings instead of falling back to its built-in defaults or environment variable lookups.

Since LiteLLMClient already natively handles retrieving LITELLM_API_KEY and LITELLM_BASE_URL from the environment (with sensible fallbacks like "dummy" and "http://localhost:4000"), you can safely remove the initialize_kwargs block entirely to make the configuration more robust and less redundant.

"client_class": "LiteLLMClient", "batch_size": 10,

feature(litellm): integrate docker profile and config overrides

7cc2425

gemini-code-assist Bot reviewed Jun 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature(LiteLLM): Wire LiteLLM Docker profile with application config overrides (provider and embedder)#533

Feature(LiteLLM): Wire LiteLLM Docker profile with application config overrides (provider and embedder)#533
marazik wants to merge 1 commit into
AsyncFuncAI:mainfrom
marazik:feature/litellm-docker-config-wiring

marazik commented Jun 3, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

marazik commented Jun 3, 2026

Summary

PR Series

Summary (this PR)

🔧 Changes

Configuration Layer

Docker Integration

Provider Alignment

🧠 Design Notes

🔄 Compatibility

🧪 Testing

📝 Notes for Reviewers

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant