You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: forward sampling provider_opts (top_k, repetition_penalty, etc.) to provider APIs
Add support for passing sampling parameters via provider_opts to all
provider backends. This enables custom OpenAI-compatible providers
(vLLM, Ollama, llama.cpp) to receive parameters like top_k,
repetition_penalty, min_p, and seed that they support but the native
OpenAI API does not.
Provider support:
- OpenAI/custom: top_k, repetition_penalty, min_p, typical_p via
SetExtraFields; seed via native field
- Anthropic: top_k via native TopK field
- Gemini: top_k via native TopK field
- Bedrock: top_k via AdditionalModelRequestFields
Also refactors Bedrock buildAdditionalModelRequestFields to avoid
early returns that would discard top_k when thinking budget is
invalid, and extracts isThinkingEnabled() to decouple thinking
detection from additional fields presence.
Assisted-By: docker-agent
Copy file name to clipboardExpand all lines: agent-schema.json
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -552,7 +552,7 @@
552
552
},
553
553
"provider_opts": {
554
554
"type": "object",
555
-
"description": "Provider-specific options. dmr: runtime_flags. anthropic/amazon-bedrock (Claude): interleaved_thinking (boolean, default true). openai: transport ('sse' or 'websocket') to choose between SSE and WebSocket streaming for the Responses API. openai/anthropic/google: rerank_prompt (string) to fully override the system prompt used for RAG reranking (advanced - prefer using results.reranking.criteria for domain-specific guidance).",
555
+
"description": "Provider-specific options. Sampling parameters: top_k (integer, supported by anthropic, google, amazon-bedrock, and custom OpenAI-compatible providers like vLLM/Ollama), repetition_penalty (float, forwarded to custom OpenAI-compatible providers), min_p (float, forwarded to custom providers), seed (integer, forwarded to OpenAI). Infrastructure options: dmr: runtime_flags. anthropic/amazon-bedrock (Claude): interleaved_thinking (boolean, default true). openai: transport ('sse' or 'websocket') to choose between SSE and WebSocket streaming for the Responses API. openai/anthropic/google: rerank_prompt (string) to fully override the system prompt used for RAG reranking (advanced - prefer using results.reranking.criteria for domain-specific guidance).",
0 commit comments