Models Overview
Arcee AI offers models at various sizes to meet different deployment scenarios. Choosing the right model can help you complete tasks more efficiently, accurately, and cost effectively.
Strength
Lightweight, ultra-low latency model.
Fast and cost-efficient model for well-defined tasks.
Coming Soon
Ideal Deployment
Fully local on consumer GPUs, edge servers, and mobile devices. Tuned for offline operation.
Serve customer-facing apps, agent backends, and high-throughput services in cloud or VPC.
Coming Soon
Active Parameters
1B per token
3B per token
Coming Soon
Context Window
128k tokens
128k tokens
Coming Soon
Knowledge Cutoff
2024
2024
Coming Soon
Speed
⚡⚡⚡⚡⚡ Instant
⚡⚡⚡ Very Fast
Coming Soon
API Model Name
Coming Soon
trinity-mini
Coming Soon
Download
Coming Soon
Last updated


