Giter VIP home page Giter VIP logo

wingmanai's Introduction

WingmanAI

WingmanAI is a powerful tool for interacting with real-time transcription of both system and microphone audio. Powered by ChatGPT, this tool lets you interact in real-time with the transcripts as an extensive memory base for the bot, providing a unique communication platform.

Demo

demo.mp4

As you can see, the bot can answer questions about past conversations when you load the transcripts for a designated person.

Features

  • Real-time Transcription: WingmanAI can transcribe both system output and microphone input audio, allowing you to view the live transcription in an easy-to-read format.

  • ChatGPT Integration: You can chat with a ChatGPT powered bot that reads your transcripts in real-time.

  • Efficient Memory Management: The bot maintains a record of the conversation but in a token-efficient manner, as only the current chunk of transcript is passed to the bot.

  • Save and Load Transcripts: WingmanAI allows you to save transcripts for future use. You can load them up anytime later, and any query made to the bot will be cross-referenced with a vector database of the saved transcript, providing the bot with a richer context.

  • Append Conversations: You can keep appending to the saved transcripts, building a vast database over time for the bot to pull from.

Installation

  1. Clone the repository.
  2. Install the requirements: pip install -r requirements.txt
  3. If you wish to use CUDA for Whisper (which is highly recommended), uninstall (pip uninstall torch) torch and run: pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117

Note: This application is currently compatible only with Windows.

Prerequisites

Ensure you have ffmpeg installed on your system. Have a working OpenAI API key. Works best using CUDA! CPU transcription is not real-time. The model currently being used is the "base" model - if your hardware can't run it, change it to "tiny". Language is currently set to English.

Getting Started

  1. Add your OpenAI API key to the keys.env file.
  2. Run main.py.

For any queries or issues, feel free to open a new issue in the repository.

Contributions are always welcomed to improve the project.

Acknowledgements

This project uses a modified version of SevaSk's "Ecoute" project for the transcriptions - check it out here!

wingmanai's People

Contributors

e-johnstonn avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.