Giter VIP home page Giter VIP logo

ai_lecturenote_v1's Introduction

AI_LectureNote_v1

Transform Your Medical Lectures Into Smart Scripts: Accuracy Meets Efficiency

An automation tool that converts lecture recordings into scripts and summaries. Unlike similar services like Naver Clova Note and Daglo, our software offers unique features tailored for medical education. It translates all medical terminology into English and eliminates filler words and stutters.

For example:

  • Original: "3번은 닉몬이야. 그래서 폐렴이라고 하는 거는"
    Processed: "3위는 Pneumonia입니다. 그래서 Pneumonia이"

  • Original: "이 암이라는 것은 mmr살프로리퍼레이션 서브와이블 때문에 생기는 질환이에요."
    Processed: "Cancer는 주로 MMR (Mismatch Repair) 때문에 발생하는 disease입니다."

The summaries generated are succinct and accurate, and even include a content review quiz.

Table of Contents

Overview

This Python script automatically processes raw text files by correcting errors, translating to English, summarizing the content, and converting it to a Word document.

Prerequisites

  • Python 3.x
  • OpenAI API Key
  • Following Python packages:
    • python-dotenv
    • openai
    • docx

Run the following commands to install the necessary packages:

pip install python-dotenv openai docx

Configuration

  1. Create a .env file in the root directory and add your OpenAI API Key:
    OPEN_API_KEY=your-api-key
    
  2. Update the raw_root_dir variable with your actual root directory path for raw text files.

How it Works

File Scanning

The function get_txt_files scans the raw_root_dir for all .txt files.

Text Processing

raw_text_to_product performs multiple operations on each text file:

  • Correction
  • Translation to English
  • Summarization

File Movement

After processing, the raw text file is moved to the used_raw_root_dir.

Error Handling

The script retries up to 5 times if any errors are encountered.

Usage

Run the txt_to_product.ipynb to start the automated process. It will process all raw text files in the raw_root_dir.

Customization Options

  • Location of Raw Files: The location specified by raw_root_dir in txt_to_product.ipynb can be changed according to your needs.
  • File Organization: After the script and summary are generated, they can be organized using OrganizeFiles.ipynb.

Directory Structure

  • Raw files: Placed in raw_root_dir
  • Processed files: Moved to used_raw_root_dir
  • Intermediate and final products: Stored in intermediate, englished, and summary directories

File Naming Convention

Raw text files must follow this naming convention: raw_subject_week_day_numb.txt

Contributors

  • LEE KYEONGEON
  • KIM TAEHONG

Contact

For any questions or further information, feel free to reach out at [email protected].

ai_lecturenote_v1's People

Contributors

boyskier avatar

Stargazers

Chihaya avatar  avatar  avatar saneo avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.