LogoLogo
  • 👋Welcome to Arcee AI Docs
  • Arcee Orchestra
    • Introduction to Arcee Orchestra
    • Getting Started
    • Workflows
      • Workflow Components
        • Model Node
        • Code Node
        • Integrations
        • Knowledge Retrieval
        • Conditional Node
      • Passing Variables
      • API Invocation
        • List Available Workflows API
        • Workflow Execution API
        • Workflow Execution Steps API
        • Execution History API
        • Workflow Diagram API
        • API Code Examples
        • Upload Workflow JSON API
        • Workflow Runs API
    • Workflow Library
      • Research Automation
      • Real Time Financial Analysis
      • Blog Writer
      • Code Improvement
      • Energy Domain Assistant
    • Chat Interface
    • FAQ
  • ARCEE CONDUCTOR
    • Introduction to Arcee Conductor
    • Getting Started
    • Features & Functionality
      • Auto Mode
      • Auto Reasoning Mode
      • Auto Tools Mode
      • Compare
      • Direct Model Invocation
      • Usage
      • API
    • Arcee Small Language Models
      • Model Selection
      • Model Performance
    • Pricing
Powered by GitBook
On this page
  • Router
  • Models
  • API
  1. ARCEE CONDUCTOR
  2. Features & Functionality

Auto Mode

The auto model mode utilizes Arcee AI's intelligent model router to route prompts to the most optimal and efficient language model.

Router

The power behind Arcee Conductor comes in the form of an ultra-lightweight model router which classifies and routes prompts to the most cost-effective model which can accurately complete the request.

The router is 150M parameter, custom architecture model which classifies based on four primary categories: task type, domain, complexity, and language. The categories for each class consist of:

  • Task Type: Analytical, Classification, Code Generation, Extraction, Math, QA, Rewrite, Summarization, Text Generation, Other

  • Domain: Adult, Arts and Entertainment, Autos and Vehicles, Beauty and Fitness, Books and Literature, Business and Industrial, Computers and Electronics, Finance, Food and Drink, Games, Health, Hobbies and Leisure, Home and Garden, Internet and Telecom, Jobs and Eduction, Law and Government, News, Online Communities, People and Society, Pets and Animals, Real Estate, Science, Sensitive Subjects, Shopping, Sports, Travel and Transportation

  • Complexity: 1-10

  • Language: Arabic, Bengali, Cantonese, Filipino, French, German, Hindi, Italian, Korean, Mandarin, Marathi, Portuguese, Russian, Spanish, Tamil, Telugu

Due to the exceptionally small parameter count and custom architecture, the router executes in roughly 150ms resulting in negligible impact to user experience, when considering time to first token.

Models

Based on the classifications from the model router, the request is routed to one of the language models behind Arcee Conductor. The models which can currently be routed to include:

Model
Description

Blitz

A 24B parameter SLM from Arcee AI, distilled from Deepseek V-3 Blitz offers blazing fast response times and exceptionally low costs with strong general knowledge. Ideal for simple and creative tasks.

Virtuoso Medium

A 32B parameter SLM from Arcee AI, which was distilled from Deepseek V-3 giving it an impressive knowledge distribution.

Virtuoso Large

Arcee AI's premier 72B parameter SLM which competes with the leading LLMs on complex and analytical tasks.

GPT-4.1

A closed-source LLM from Open AI with impressive analytical and complex problem solving capabilities.

Claude Sonnet 3.7

A closed-source LLM from Anthropic with strong performance on coding and complex tasks.

API

Request Syntax

curl -X POST https://models.arcee.ai/v1/chat/completions \
  -H "Authorization: Bearer $ARCEE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
        "model": "auto",
        "messages": [{"role": "user", "content": "hi"}]
      }'

Response

{
  "id": "npyCrRJ-z1gNr-92fdb25bff75cf74",
  "object": "chat.completion",
  "created": 1744575968,
  "model": "arcee-ai/arcee-blitz",
  "prompt": [],
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! How can I assist you today?",
        "tool_calls": []
      },
      "logprobs": null,
      "finish_reason": "stop",
      "seed": null
    }
  ],
  "usage": {
    "prompt_tokens": 170,
    "total_tokens": 180,
    "completion_tokens": 10
  }
}
PreviousFeatures & FunctionalityNextAuto Reasoning Mode

Last updated 27 days ago

Page cover image