Auto Reasoning Mode
Auto Reasoning mode provides a custom configuration of auto
with all reasoning models.
auto-reasoning
will take in your prompt and route it to the most appropriate reasoning model based on complexity, task type, domain, and language.
Models
Based on the classifications from the model router, the request is routed to one of the language models behind Arcee Conductor: auto-reasoning
. The models which can currently be routed to include:
Arcee Maestro
Arcee AI's 32B parameter reasoning SLM which offers advanced reasoning capabilities without sacrificing performance. Maestro excels at simple to medium complexity reasoning tasks and offers substantially cheaper cost compared to other options.
Deepseek R1
An open-source reasoning model from Deepseek which excels at math, coding, and logic tasks.
o3 mini-high
A closed source reasoning model from Open AI optimized for STEM applications, exceling in science, math, and coding benchmarks.
Claude Sonnet 3.7 Extended Thinking
A closed-source LLM from Anthropic with extended reasoning capabilities which performs well for complex problem solving and coding tasks.
API
Request Syntax
curl -X POST https://models.arcee.ai/v1/chat/completions \
-H "Authorization: Bearer $ARCEE_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"model": "auto-reasoning",
"messages": [{"role": "user", "content": "hi"}]
}'
Response
{
"id": "npd1kq7-z1gNr-92e7e7dee8d66bf7",
"object": "chat.completion",
"created": 1744347473,
"model": "arcee-ai/maestro-reasoning",
"prompt": [],
"choices": [
{
"finish_reason": "stop",
"seed": 1338453356742372900,
"logprobs": null,
"index": 0,
"message": {
"role": "assistant",
"content": "Okay, the user said \"hi\". I should respond in a friendly way. Let me think of a simple greeting. Maybe \"Hello! How can I assist you today?\" That sounds good. It's polite and opens the conversation for them to explain what they need help with. I'll go with that.\n</think>\n\nHello! How can I assist you today?",
"tool_calls": []
}
}
],
"usage": {
"prompt_tokens": 11,
"completion_tokens": 74,
"total_tokens": 85
}
}
Last updated