Page cover

Pipecat

Pipecat is an open-source, real-time voice AI orchestration framework built for developers who want to create deeply interactive, multimodal conversational agents. It provides a flexible pipeline architecture for integrating speech recognition (STT), language models, and speech synthesis (TTS) components into low-latency, event-driven workflows. Designed for modularity and extensibility, Pipecat enables developers to compose custom voice agent stacks that handle audio streaming, turn-taking, and response timing with precision.

This tutorial will guide you through integrating Arcee AI models as the LLM backbone for your Pipecat voice agent. We'll integrate Arcee's models using OpenRouter.


Prerequisites

  • Python: >=3.10

  • OpenRouter API Key

    • If you don't have an account, set one up here.

  • uv package manager installed

Integration Steps

  1. Install the Pipecat CLI globally using uv

  1. Run the Pipecat interactive setup wizard to create the scaffolding for a pipecat agent

  1. You will be prompted with a few questions which determine how the project is setup. Use the following configuration:

  1. Once your project has been created, setup the client

  1. Next, setup the server. Install dependencies

  1. Create .env file

  1. In the .env file, fill in your Deepgram and OpenRouter API Key and provide a Deepgram Voice ID and the Arcee AI LLM you want to use. For example:

A full list of Deepgram Voice IDs can be found here.

  1. Edit the Voice Agent Kickoff System prompt

    1. Navigate to arcee-pipecat/server/bot.py

    2. Replace the content of the Kickoff system prompt (on line 139) with the following message:

The full line should now look like the following:

  1. Run your Voice Agent

  1. Access your Voice Agent at http://localhost:5173/

  2. Click connect and start conversating with your agent!

Last updated