ai-austin / gpt4all-voice-assistant Goto Github PK

This is a 100% offline GPT4ALL Voice Assistant. Completely open source and privacy friendly. Use any language model on GPT4ALL. Background process voice detection. Watch the full YouTube tutorial for setup guide: https://youtu.be/6zAk0KHmiGw

Home Page: https://youtu.be/6zAk0KHmiGw

License: MIT License

Python 100.00%

gpt4all-voice-assistant's People

Contributors

Stargazers

Watchers

gpt4all-voice-assistant's Issues

Not responding after wake word Detected

it is not listening nor responding after
"Wake word detected. Please speak your prompt to GPT4All."

Create an AI Voice Assistant with GPT4ALL for WINDOWS user

Please make a video tutorial specifically designed for the Windows platform, ensuring that the instructions are easy for layman to comprehend

Windows users program detects wake word but not voice prompt.

We are currently diagnosing this in my Discord server. It seems to either be an issue with the listen_in_background() function from speechrecognition, an issue with Pyttsx3 or being caused by them used in parallel. The repo will be updated with ASAP. If any Windows users find a solution for this issue please feel free to share below if this issue is still open.

ERROR: No matching distribution found for whisper==1.9.2

I get an error when trying to install the requirements:

ERROR: No matching distribution found for whisper==1.9.2

It looks like the newest version in the PyPi (pip) library is 1.1.10, so I replaced the line with that in the requirements.txt file and I get no warnings, so it should be fine.. but I will let you know if I run into any other issues. In any case, once I confirm.. I know it's minor but this should be updated (and also useful to be documented for those who may experience the issue).

gpt4all-falcon model is changed from bin file to .gguf model

Downloading the latest model from GPT4All now comes with a different file structure.

gpt4all-falcon-newbpe-q4_0.gguf

WebUI

Would be amazing if you can add a web UI to this awesome AI voice assistant using one of the readily available python libraries.
Thanks for creating and sharing this good work!

add requirements.txt

Add a file called requirements.txt
content of the file

gpt4all
openai-whisper
SpeechRecognition
playsound
PyAudio
soundfile
pyttsx3

turns out you also need espeak on your system

I had to make changes for not getting stuck at engine.runAndWait()

On a windows computer, engine.runAndWait() did not continue after speaking. I moved the engine = pyttsx3.init() in front of the current line 31 and added engine = None and
rate = None to the current lines 23/24 and now it works.

Addintionally, getting ffmpeg to work, required an installation that is described in the openai-whisper description.

Readme inaccurate

the Arch-section is no longer working for a while now: python-package python-espeak demands python-distutils in its PKGBUILD-File, but python-distutils was deprecated and has been fully removed as of Python 3.12.0 - Arch and its derivatives are using Python 3.12.3 at the time of writing.
I contacted the Maintainer of the AUR-PKGBUILD about this.

In addition, that AUR-PKGBUILD was building python-espeak 0.5, the current version (according to pypi) is 0.6.3 - It should be clarified in what way this affects the overall process of getting the voiceassistant to run?

I need your help

Dear Austin, I say thank your for the YouTube video tutorial.
Nevertheless, I am struggling with gpt4all_voice running offline on my PC (Win10) for two days now with no success.
The python script runs without error message, records the wake_word into the wake_detect.wav, and jarvis tells me 'listening' but then, nothing happens.
My voice question is not recorded into prompt.wav and the program hangs. It would be much appreciated if you could take a look at the code. I followed the steps in your video. I have even solved the issue of the missing vocab.bpe. Still nothing. :( Here is my main.py

from os import system
import speech_recognition as sr
from playsound import playsound
from gpt4all import GPT4All
import sys
import whisper
import warnings
import time
import os

wake_word = 'jarvis'
model = GPT4All("/users/apache33/appdata/local/nomic.ai/GPT4All/gpt4all-falcon-newbpe-q4_0.gguf", allow_download=False)
r = sr.Recognizer()
tiny_model_path = os.path.expanduser('/users/apache33/.cache/whisper/tiny.pt')
base_model_path = os.path.expanduser('/users/apache33/.cache/whisper/base.pt')
tiny_model = whisper.load_model(tiny_model_path)
base_model = whisper.load_model(base_model_path)
listening_for_wake_word = True
source = sr.Microphone()
warnings.filterwarnings("ignore", category=UserWarning, module='whisper.transcribe')

if sys.platform != 'darwin':
import pyttsx3
engine = pyttsx3.init()

def speak(text):
if sys.platform == 'darwin':
ALLOWED_CHARS = set("abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789.,?!-_$:+-/ ")
clean_text = ''.join(c for c in text if c in ALLOWED_CHARS)
system(f"say '{clean_text}'")
else:
engine.say(text)
engine.runAndWait()

def listen_for_wake_word(audio):
global listening_for_wake_word
with open("wake_detect.wav", "wb") as f:
f.write(audio.get_wav_data())
result = tiny_model.transcribe('wake_detect.wav')
text_input = result['text']
if wake_word in text_input.lower().strip():
print("Wake word detected. Please speak your prompt to GPT4All.")
speak('Listening')
listening_for_wake_word = False

def prompt_gpt(audio):
global listening_for_wake_word
try:
with open("prompt.wav", "wb") as f:
f.write(audio.get_wav_data())
result = base_model.transcribe('prompt.wav')
prompt_text = result['text']
if len(prompt_text.strip()) == 0:
print("Empty prompt. Please speak again.")
speak("Empty prompt. Please speak again.")
listening_for_wake_word = True
else:
print('User: ' + prompt_text)
output = model.generate(prompt_text, max_tokens=200)
print('GPT4All: ', output)
speak(output)
print('\nSay', wake_word, 'to wake me up. \n')
listening_for_wake_word = True
except Exception as e:
print("Prompt error: ", e)

def callback(recognizer, audio):
global listening_for_wake_word
if listening_for_wake_word:
listen_for_wake_word(audio)
else:
prompt_gpt(audio)

def start_listening():
with source as s:
r.adjust_for_ambient_noise(s, duration=2)
print('\nSay', wake_word, 'to wake me up. \n')
r.listen_in_background(source, callback)
while True:
time.sleep(1)

if name == 'main':
start_listening()

ai-austin / gpt4all-voice-assistant Goto Github PK

gpt4all-voice-assistant's People

Contributors

Stargazers

Watchers

Forkers

gpt4all-voice-assistant's Issues

Not responding after wake word Detected

Create an AI Voice Assistant with GPT4ALL for WINDOWS user

Windows users program detects wake word but not voice prompt.

ERROR: No matching distribution found for whisper==1.9.2

gpt4all-falcon model is changed from bin file to .gguf model

WebUI

add requirements.txt

I had to make changes for not getting stuck at engine.runAndWait()

Readme inaccurate

I need your help

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent