Page cover

Pipecat

Pipecat is an open-source, real-time voice AI orchestration framework built for developers who want to create deeply interactive, multimodal conversational agents. It provides a flexible pipeline architecture for integrating speech recognition (STT), language models, and speech synthesis (TTS) components into low-latency, event-driven workflows. Designed for modularity and extensibility, Pipecat enables developers to compose custom voice agent stacks that handle audio streaming, turn-taking, and response timing with precision.

This tutorial will guide you through integrating Arcee AI models as the LLM backbone for your Pipecat voice agent. We'll integrate Arcee's models using OpenRouter.


Prerequisites

  • Python: >=3.10

  • OpenRouter API Key

    • If you don't have an account, set one up here.

  • uv package manager installed

Quickstart

Install the Pipecat CLI globally using uv

Run the Pipecat interactive setup wizard to create the scaffolding for a pipecat agent

You will be prompted with a few questions which determine how the project is setup. Use the following configuration:

Setup the client

Setup the server, install dependencies, and create a .env file

In the .env file, populate your Deepgram and OpenRouter API Key and provide a Deepgram Voice ID and the Arcee AI LLM you want to use. For example:

A full list of Deepgram Voice IDs can be found here.

Edit the Voice Agent Kickoff System prompt

  1. Navigate to arcee-pipecat/server/bot.py

  2. Replace the content of the Kickoff system prompt (on line 139) with the following message:

The full line should now look like the following:

Run your Voice Agent, access at http://localhost:5173/, and click connect to start the conversation!

Last updated