Giter VIP home page Giter VIP logo

cefixim001 / whishper Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pluja/whishper

0.0 0.0 0.0 11.21 MB

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Home Page: https://whishper.net

License: GNU Affero General Public License v3.0

Shell 3.82% JavaScript 7.33% Python 9.84% Go 27.69% TypeScript 0.48% CSS 0.05% HTML 0.35% Batchfile 1.67% Dockerfile 3.82% Svelte 44.95%

whishper's Introduction

whishper banner

๐Ÿšง I am working on a complete rewrite of the project. Progress on this can be seen at v4 branch. Consequently, this branch will not receive any new releases or updates.

Whishper is an open-source, 100% local audio transcription and subtitling suite with a full-featured web UI.

Features

  • ๐Ÿ—ฃ๏ธ Transcribe any media to text: audio, video, etc.
    • Transcribe from URLs (any source supported by yt-dlp).
    • Upload a file to transcribe.
  • ๐Ÿ“ฅ Download transcriptions in many formats: TXT, JSON, VTT, SRT or copy the raw text to your clipboard.
  • ๐ŸŒ Translate your transcriptions to any language supported by Libretranslate.
  • โœ๏ธ Powerful subtitle editor so you don't need to leave the UI!
    • Transcription highlighting based on media position.
    • CPS (Characters per second) warnings.
    • Segment splitting.
    • Segment insertion.
    • Subtitle language selection.
  • ๐Ÿ  100% Local: transcription, translation and subtitle edition happen 100% on your machine (can even work offline!).
  • ๐Ÿš€ Fast: uses FasterWhisper as the Whisper backend: get much faster transcription times on CPU!
  • ๐Ÿ‘ Quick and easy setup: use the quick start script, or run through a few steps!
  • ๐Ÿ”ฅ GPU support: use your NVIDIA GPU to get even faster transcription times!
  • ๐ŸŽ CPU support: no GPU? No problem! Whishper can run on CPU too.

Roadmap

  • Local folder as media input (#15).
  • Full-text search all transcriptions.
  • User authentication.
  • Audio recording from the browser.
  • Add insanely-fast-whisper as an optional backend (#53).
  • Support for GPU acceleration.
    • Non NVIDIA GPU support. Is it possible with faster-whisper?
  • Can we do something with seamless_communication?

Project structure

Whishper is a collection of pieces that work together. The three main pieces are:

  • Transcription-API: This is the API that enables running Faster-Whisper. You can find it in the transcription-api folder.
  • Whishper-Backend: This is the backend that coordinates frontend calls, database, and tasks. You can find it in backend folder.
  • Whishper-Frontend: This is the frontend (web UI) of the application. You can find it in frontend folder.
  • Translation (3rd party): This is the libretranslate container that is used for translating subtitles.
  • MongoDB (3rd party): This is the database that stores all the information about your transcriptions.
  • Nginx (3rd party): This is the proxy that allows running everything from a single domain.

Contributing

Contributions are welcome! Feel free to open a PR with your changes, or take a look at the issues to see if there is something you can help with.

Development setup

Check out the development documentation here.

Screenshots

These screenshots are available on the official website, click any of the following links to see:

Support

Star History

Star History Chart

Credits

whishper's People

Contributors

pluja avatar jonascz avatar dependabot[bot] avatar hundehausen avatar ftchd avatar miketwo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.