Page cover

Models Overview

Arcee AI offers models at various sizes to meet different deployment scenarios. Choosing the right model can help you complete tasks more efficiently, accurately, and cost effectively.

Trinity Mini is currently the only model available by api.

Model
Trinity-Nano-6B
Trinity-Mini-26B
Trinity [Coming Soon]

Strength

Lightweight, ultra-low latency model.

Fast and cost-efficient model for well-defined tasks.

Coming Soon

Ideal Deployment

Fully local on consumer GPUs, edge servers, and mobile devices. Tuned for offline operation.

Serve customer-facing apps, agent backends, and high-throughput services in cloud or VPC.

Coming Soon

Active Parameters

1B per token

3B per token

Coming Soon

Context Window

128k tokens

128k tokens

Coming Soon

Knowledge Cutoff

2024

2024

Coming Soon

Speed

Instant

Very Fast

Coming Soon

API Model Name

Coming Soon

trinity-mini

Coming Soon

Download

Coming Soon

Last updated