Utilize ElevenLabs, Google Speech Recognition, and OpenAI APIs to enable interactive conversations with a Language Model (LLM) in various voices, including cloned voices.
- Python 3.x
- pip package manager
- Clone this repository to your local machine.
git clone https://github.com/your/repository.git
- Navigate to the project directory.
cd repository
- Install the required Python packages using pip.
pip install -r requirements.txt
- Create a new file named
.env
in the project directory. - Open the
.env
file in a text editor and add the following lines:Replace the valuesELEVENLABS_API_KEY='...' OPENAI_API_KEY='sk-...'
'...'
and'sk-...'
with your actual API keys.
To start a new conversation, specify a voice and then execute the following,
python GO.py
- Snowboy Hotword Detection
- ElevenLabs TTS true streaming
- local inference with WizardLM based
- upgraded escape sequences (Weather, Time, etc.)