Giter VIP home page Giter VIP logo

whispercppgui's Introduction

whispercppGUI

GUI for whispercpp, a high performance C++ port of OpenAI's whisper.

For this GUI to work properly, all of whispercpp files (from the official releases) should be copied to the same location of "whisperGUI.exe". Also, you need to copy a binary of ffmpeg to the same location of "whisperGUI.exe". In this way, you could update the program yourself manually in case the development on this repository stops.

FFMPEG is also used to implement a variable-rate speedup option for the audio (whispercpp no longer has this option).

Not all options are implemented yet, only those that are of common use. For using options not present in this GUI, now you can use the textbox included in the bottom of the main window. For example you can type "-t 8" (without quotes) for using 8 threads when processing the audio.

Requirements

If you want to run whispercppGUI from the python source code, the Gooey library is required. You can install Gooey with the command:

pip install Gooey

The included executable for windows x64 was created using pyinstaller, as described in the documentation of Gooey library. If you don't trust the executable on this repository, you can easily recreate your own executable file.

whispercppgui's People

Contributors

topping1 avatar versedwildcat avatar

Stargazers

robonxt.ai avatar  avatar  avatar Tyler Gordon avatar  avatar Laurent CESSAT avatar  avatar 石林行者 avatar  avatar  avatar  avatar Mr Harrison avatar Graham avatar Ossi Rytkönen avatar Semjon Kravtšenko avatar  avatar Phương Nguyễn avatar Doug avatar kevinmhy avatar Roohullah Afzali avatar  avatar Cantikaputri Febrianti avatar hetao avatar JAGGA avatar  avatar  avatar Xiaoran Yang avatar  avatar famaTV avatar Sean Song avatar Jason avatar  avatar  avatar  avatar  avatar Doug Holton avatar  avatar Bryan Guffey avatar Legion avatar Tim Kersey avatar Christina Warren avatar  avatar Orin Zebest avatar Erik Ylipää avatar Aliaksei avatar Indiscipline avatar  avatar cat-huh avatar Otelo2 avatar Omar MHAIMDAT avatar Daniel Sczepansky avatar Georgi Gerganov avatar Makoto Shimabukuro avatar  avatar fly2sky  avatar  avatar  avatar

Watchers

Kostas Georgiou avatar  avatar cat-huh avatar

whispercppgui's Issues

support inputs other than .wav

Do you think you might automate the conversion to .wav and input into whisper.cpp? Since ffmpeg can also be installed you could take the step off the user to do the conversion separately and then feed your GUI the .wav to reduce errors.

FR: Update to support BLAS/Llama/q4 models?

Thank you for your work on this! It's been the best GUI by miles.

This only works with the early stuff and doesn't allow me to use the q4 ggml models for example. It would be nice to try out the BLAS stuff through it too.

Keep getting error in line 180 - 'utf-8'

Downloaded latest release and running on CPU on Windows machine. Keep getting this error:

Traceback (most recent call last):
File "whisperGUI.py", line 182, in
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf8 in position 50: invalid start byte

Am I doing anything wrong? I have tried transcribing different audio files, but I keep getting this error. However, it seems that the program outputs the files correctly, but just keeps spinning?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.