Voice Agent

Learn how to build a natural realtime voice experience, ready for production use. Key concepts demonstrated:

Speech-to-text (STT) - for understanding speech inputs
LLM - for generating the agent text response
Text-to-speech (TTS) - for generating agent speech audio

Architecture

Backend: Inworld Agent Runtime + Express.js
Frontend: Vite + React
Communication: WebSocket

Prerequisites

Node.js v20 or higher: Download here
Assembly.AI API key (required for speech-to-text functionality): Get your API key
Inworld API key (required): Sign up here or see quickstart guide

Run the Template

Start the Server

Clone the Voice Agent GitHub repo:

git clone https://github.com/inworld-ai/voice-agent-node
cd voice-agent-node

Navigate to the server directory:
```
cd server
```
Copy the .env-sample file to .env:
```
cp .env-sample .env
```

Configure your .env file with required API keys:

.env

# Required, Inworld Agent Runtime Base64 API key
INWORLD_API_KEY=<your_api_key_here>

# Required, get your Assembly.AI API key from https://www.assemblyai.com/
ASSEMBLY_AI_API_KEY=<your_assemblyai_api_key_here>

Get your Assembly.AI API key for speech-to-text functionality.

Install dependencies:
```
npm install
```
Start the server:
```
npm start
```
The server will start on port 4000.

Start the Client

Open a new terminal window.
Navigate to the client directory:
```
cd client
```

(Optional) Create a .env file to customize client behavior:

.env

# Optional: Enable latency reporting in the UI
VITE_ENABLE_LATENCY_REPORTING=true

# Optional: Server port (default: 4000)
VITE_APP_PORT=4000

Install dependencies:
```
npm install
```
Start the client:
```
npm start
```
The client will start on port 3000 (or the next available port if 3000 is in use) and should automatically open in your default browser.

Chat with Your Agent

Configure the agent:
- Enter the agent system prompt
- Click “Create Agent”
Start chatting:
- Voice input: Click the microphone icon to unmute yourself, speak, then click again to mute
- Text input: Type in the input field and press Enter to send
Monitor performance:
- View dashboards, traces, and logs in the Inworld Portal
- Enable VITE_ENABLE_LATENCY_REPORTING=true in client .env to see latency metrics in the UI

Prerequisites

Run the Template

Start the Server

Start the Client

Chat with Your Agent

Next steps

Explore templates

Vibe Code Your App

​Prerequisites

​Run the Template

​Start the Server

​Start the Client

​Chat with Your Agent

​Next steps

Explore templates

Vibe Code Your App

Prerequisites

Run the Template

Start the Server

Start the Client

Chat with Your Agent

Next steps