Giter VIP home page Giter VIP logo

summarize_audio_video's Introduction

title emoji colorFrom colorTo sdk python_version sdk_version app_file pinned
Audio and Video Content Analyzer
๐ŸŽฅ
blue
green
streamlit
3.8
1.27.2
app.py
false

Summarize Audio and Video with Semantic Retrieval, Chatbot, and LLM

Alt Text

Description

This project is an all-in-one solution for audio and video content analysis:

  1. Summarization: Generates concise summaries using advanced Natural Language Processing, powered by HuggingFace Transformers.
  2. Semantic Retrieval: Enables you to find specific words, phrases, or segments, thanks to Whisper Timestamped by Linto.
  3. Chatbot Interface: Features a chatbot that can answer queries about the audio or video content, leveraging Language Models for Machines (LLM).

The project is built using Python and integrates various libraries including Whisper Timestamped, HuggingFace Transformers, and Streamlit for a seamless user experience.

Project Structure

.
โ”œโ”€โ”€ Dockerfile                   # Dockerfile for setting up the environment
โ”œโ”€โ”€ LICENSE                      # License file
โ”œโ”€โ”€ README.md                    # This README file
โ”œโ”€โ”€ YoutubeAudios                # Directory containing YouTube audio files
โ”œโ”€โ”€ _requirements.txt            # Requirements file
โ”œโ”€โ”€ app.py                       # Streamlit application file
โ”œโ”€โ”€ config.py                    # Configuration file
โ”œโ”€โ”€ keyword_retriever            # Module for keyword retrieval
โ”‚   โ””โ”€โ”€ keyword_retreiver.py     # Keyword retriever script
โ”œโ”€โ”€ logger.py                    # Logging utility
โ”œโ”€โ”€ notebooks                    # Jupyter notebooks for development and testing
โ”œโ”€โ”€ pdf_test.py                  # PDF testing script
โ”œโ”€โ”€ query_service                # Query service module
โ”‚   โ””โ”€โ”€ query_engine.py          # Query engine script
โ”œโ”€โ”€ requirements.txt             # Requirements file
โ”œโ”€โ”€ resource_loader              # Resource loader module
โ”‚   โ”œโ”€โ”€ json_loader.py           # JSON loader script
โ”‚   โ”œโ”€โ”€ linkedin_loader.py       # LinkedIn loader script
โ”‚   โ”œโ”€โ”€ uploaded_video_loader.py # Uploaded video loader script
โ”‚   โ”œโ”€โ”€ video_loader_interface.py# Video loader interface script
โ”‚   โ””โ”€โ”€ youtube_loader.py        # YouTube loader script
โ”œโ”€โ”€ summarization_service        # Summarization service module
โ”‚   โ””โ”€โ”€ summarizer.py            # Summarizer script
โ”œโ”€โ”€ transcription_service        # Transcription service module
โ”‚   โ””โ”€โ”€ transcriber.py           # Transcriber script
โ””โ”€โ”€ utils.py                     # Utility functions

Prerequisites

  • Python 3.8+
  • Docker (optional)

Built With

Setup and Installation

Setup and Installation

There are two methods to get the project up and running:

Method 1: Using Docker

  1. Clone the repository.
  2. Navigate to the project directory.
  3. Build the Docker image:
    docker build -t summarizer .
  4. Run the Docker container:
    docker run -p 8501:8501 summarizer
  5. Open your web browser and go to http://localhost:8501.

Method 2: Using Python Environment

  1. Clone the repository.
  2. Navigate to the project directory.
  3. Install the requirements:
    pip install -r requirements.txt
  4. Run the Streamlit app:
    streamlit run app.py
  5. Open your web browser and go to http://localhost:8501.

Using Python Environment

  1. Clone the repository.
  2. Navigate to the project directory.
  3. Install the requirements:
    pip install -r requirements.txt
  4. Run the Streamlit app:
    streamlit run app.py

Usage

  1. Open the Streamlit app in your web browser.
  2. Follow the instructions on the screen to upload or specify your audio/video content.
  3. Click "Submit" to generate a summary, perform semantic retrieval, or interact with the chatbot.

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

summarize_audio_video's People

Contributors

aazizisoufiane avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.