Giter VIP home page Giter VIP logo

research-arvind_arnav's Introduction

Developing an Automated Algorithm for Transcribing and Summarizing Videos

Overview

Motivation

  • Issue: billions of gigabytes of videos produced daily
  • Audience: everyday digital citizens
  • Need: concise, accurate summaries
  • Purpose: quickly gauging content’s value

Proposed Solution

We will develop an algorithm to streamline and automate the process of transcribing and summarizing speech in TED talks using Google Cloud Speech to Text API and natural language processing (NLP).

Requirements

  • Google Cloud Speech to Text API
  • Pydub
  • NLTK
  • NumPy

Installation Instructions

  • Google Cloud Speech to Text API - Make an Account using your Google Account, Download Account Info in JSON File and Download Packages Using Homebrew
  • Pydub - Install Using Homebrew
  • NumPy - Install Using Homebrew

Run Instructions

  • Run google_stt.py with an audio file that is stored in an audio folder as the input in Terminal (ex. python3 google_stt.py speech.mp3) ** The audio file must have only ONE audio channel.
  • Run test.py in Terminal (ex. python3 test.py)

Sample Output

From summary.txt:

Like other predominantly black churches across the country Trinity embodies the black community in its entirety the doctor and the welfare mom the model student and the former gangbanger. The press has scoured every exit poll for the latest evidence of racial polarization not just in terms of white and black but black and brown as well. I am the son of a black man from Kenya and a white woman from Kansas. A lack of economic opportunity among black men and the shame and frustration that came from not being able to provide for ones family contributed to the erosion of black families a problem that welfare policies for many years may have worsened.

Acknowledgments

The authors would like to acknowledge TJHSST Computer Systems Lab Director Dr. Patrick White for his guidance and support as our research mentor.

Authors

research-arvind_arnav's People

Contributors

aravipati12 avatar arnavbansal1 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.