Giter VIP home page Giter VIP logo

voice-cue's Introduction

Introducing VoiceCue πŸ’ΏπŸ“

Forget about listening to long and boring voice recordings and process them manually.

About πŸ‘€

VoiceCue lets you find sentiments, tags, entities, and actions in your voice recordings like a DJ.

Preview

The app workflow is as simple as uploading your voice recording, selecting which type of analysis to perform and clicking on the generated cues in the list to instantly navigate to its exact position in the voice recording.

Features ✨

  1. Voice recognition - AI based solution by Deepgram
  2. General stats - an overview about voice recording
  3. Sentiment analysis - positive and negative word detection
  4. Word cloud generation - most used word classification
  5. Entity name recognition - categories such as person, place, etc
  6. Activity tracking - find actions in past, present or future
  7. Interactive transcript - see progress or click to control it
  8. Speaker detection - total number of speakers in recording
  9. Cue word usage - transcript samples for better context
  10. Custom search - extended ability to query for cues
  11. Waveform preview - see the dynamics of voice, identify silences
  12. Audio controls - play, pause, fast forward and backward
  13. Drag and drop support - drop audio in the file select area
  14. Upload MP3 files - most commonly used audio format
  15. Progress loaders - Improved UX for loading transcripts
  16. Fully responsive - works fine on mobile and tablets
  17. Colorful UI - for easier interaction and highlighting

Tech stack πŸ› οΈ

NextJS - React application framework

Deepgram - for AI based speech recognition

compromise, sentiment - for text processing

react-tagcloud - to generate word cloud

react-tabs - for navigation panels

react-drag-drop-files - for drag and drop support

wavesurfer.js - to generate the audio waveform

GitHub - to host the code

Vercel - to deploy the project

ESLint, prettier - for linting and code formatting

Namecheap - for custom subdomain

Licence πŸ“š

VoiceCue is an open source project. The feature requests are welcome.

The project is under the terms of MIT license.

voice-cue's People

Contributors

madzadev avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.