Daily AI — Examples

Collection of self-contained real-time voice and video AI demo applications built with dailyai.

Quickstart

Each project has its own set of dependencies and configuration variables. This repo intentionally avoids shared code across projects — you can grab whichever demo folder you want to work with as a starting point.

We recommend you start with a virtual environment:

python -m venv venv

source venv/bin/activate

cd simple-chatbot

pip install -r requirements.txt

Next, follow the steps in the README for each demo.

ℹ️ Make sure you pip install -r requirements.txt for each demo project, so you can be sure to have the necessary service dependencies that extend the functionality of Daily AI. You can read more about the framework architecture here.

Projects:

Project	Description	Services
Simple Chatbot	Basic voice-driven conversational bot. A good starting point for learning the flow of the framework.	Deepgram, OpenAI, Daily Prebuilt UI
Storytelling Chatbot	Stitches together multiple third-party services to create a collaborative storytime experience.	Deepgram, ElevenLabs, Open AI, Fal, Custom UI
Translation Chatbot	Listens for user speech, then translates that speech to Spanish and speaks the translation back. Demonstrates multi-participant use-cases.	Deepgram, Azure, OpenAI, Daily Prebuilt UI
Moondream Chatbot	Demonstrates how to add vision capabilities to GPT4. Note: works best with a GPU	Deepgram, OpenAI, Moondream, Daily Prebuilt UI
Function-calling Chatbot (WIP)	A chatbot that can call functions in response to user input	Deepgram, OpenAI, Fireworks, Daily Prebuilt UI

Important

Daily Prebuilt is a hosted video calling UI. Any real-time session using Daily as a WebRTC transport can be joined using Daily Prebuilt. It provides a quick way to join a real-time session with your bot and test your ideas without building any frontend code.

Other demos:

The Daily AI repo has a wide array of feature-specific demos and foundational examples, you can find them here.

FAQ

Deployment

For each of these demos we've included a Dockerfile. Out of the box, this should provide everything needed to get the respective demo running on a VM:

docker build username/app:tag .

docker run -p 7860:7860 --env-file ./.env username/app:tag

docker push ...

SSL

If you're working with a custom UI (such as with the Storytelling Chatbot), it's important to ensure your deployment platform supports HTTPS, as accessing user devices such as mics and webcams requires SSL.

If you try to run a custom UI without SSL, you may see an error in the console telling you that navigator is undefined, or no devices are available.

Are these examples production ready?

Yes, kind of.

These demos attempt to keep things simple and are unopinionated regarding environment or scalability.

We're using FastAPI to spawn a subprocess for the bots / agents — useful for small tests, but not so great for production grade apps with many concurrent users. You can see how this works in each project's start endpoint in server.py.

Creating virtualized worker pools and on-demand instances is out of scope for these examples, but we have shared various implementation ideas here.

For projects that have CUDA as a requirement, such as Moondream Chatbot, be sure to deploy to a GPU-powered platform (such as fly.io or Runpod.)

What is VAD?

Voice Activity Detection — very important for knowing when a user has finished speaking to your bot. If you are not using press-to-talk, and want Daily AI to detect when the user has finished talking, VAD is an essential component for a natural feeling conversation.

Daily AI makes use of WebRTC VAD by default when using the Daily transport layer. Optionally, you can use Silero VAD for improved accuracy at the cost of higher CPU usage.

You can enable Silero like so:

pip install dailyai[silero]

Installing Silero will override the default VAD implementation, which can be manually toggled on and off like so:

transport = DailyTransport(
    room_url,
    token,
    "Bot Name",
    ...
    vad_enabled=True #Note: True by default
)

The first time your run your bot with Silero, startup may take a while whilst it downloads and caches the model in the background. You can check the progress of this in the console.

Getting help

➡️ Join our Discord

➡️ Reach us on Twitter

martinkallstrom / realtime-voice-server Goto Github PK

realtime-voice-server's Introduction

Daily AI — Examples

Quickstart

Projects:

Other demos:

FAQ

Deployment

SSL

Are these examples production ready?

What is VAD?

Getting help

realtime-voice-server's People

Contributors

Stargazers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent