Giter VIP home page Giter VIP logo

styletransfer_speechgen's Introduction

Repository for Language Generation and Summarization Project:

Description: This project aims to build a constrained generative model along the axes of politeness using exisitng politeness strategy models.

Installation and setup

  1. Install requirements.txt
pip install -r requirements.txt
pip install git+https://github.com/huggingface/transformers
  1. Create a wandb account for recording model results and performance issues.

Use

The main APIs to gather and preprocess tweets are contianed in "Experimentation/api/". A walk through of this process is started in "Experimentation" starting with "Scrapping_Cleaned_Unconstrained.ipynb". It uses snscrape to gather data and then processes the raw data for suitable use. Full_model_processing.ipynb completes the whole processing step post scrapping and cleaning. Once it is run, GPT2_model_new.ipynb is used to fine tune gpt-2 models for the constrained case and "GPT2_model_old.ipynb" is used to fine tune gpt-2 models that are unconstrained. The file "Generate_Human_Evaluation_Unconstrained.ipynb" generates csv files that can be used for human evaluation of output generation. The scoring for this output was done manually by human annotators located at https://docs.google.com/spreadsheets/d/1p1CCXQranUoIe5MO4VuEUfSCtboTBzyQIKcRcezktQ0/edit#gid=1528582561.

The full order to understand this project is in the "Experimentaion" folder: ** Scrapping_Cleaned_Unconstrained.ipynb --> Full_Model_Processing_Constrained --> GPT2_model_old --> GPT2_model_new --> Generate_Human_Evaluations_Unconstrained ** Baselines for politness are available through runing baselines_politeness which is the demo file from https://github.com/CornellNLP/ConvoKit/blob/master/examples/politeness-strategies/politeness_demo.ipynb to check that the API is functional for this instance (please note all credit for that notebook goes to https://convokit.cornell.edu/documentation/index.html).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.