Giter VIP home page Giter VIP logo

hsaurabh0919 / ctransformers Goto Github PK

View Code? Open in Web Editor NEW
3.0 3.0 0.0 21.25 MB

Implementing wide variety of transformers, fine tuning as well as trying architectural variants from various research papers and blogs.

Jupyter Notebook 99.98% Python 0.02%
transformer bert pytorch huggingface-transformers sentiment-analysis weak-supervision snorkel bert-model nlp roberta sentence-embeddings sentence-transformers chatgpt gpt-3 gpt-4 bart fine-tuning pretraining

ctransformers's Introduction

Implementing wide variety of transformers (improvised version),finetuning as well as trying architectural variants from various research papers and blogs.

  1. Roberta Pretraining : Basic Implementation , More detail about the model can be found here.
  2. Implementing Fast Transformers: Failed Attempt, FastTransformer Algorithm can be found here.
  3. Adaptive Transformers for multimodal Representations: Research paper can be found here.
  4. Learning Cross-Modality Encoder Representation from transformers: Paper can be found here.

My text generation model trained on Bhagvad_Gita (Holy Hindu Scripture English version) is now available on huggingface and can be found here.

The easiest way to use them is as below:


from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("epsil/bhagvad_gita")

model = AutoModelForCausalLM.from_pretrained("epsil/bhagvad_gita")

Weak Supervision model build wirh Snorkel and with Finance dataset taken from here. Majority Vote Modelling with some heuristics(rules) and currently work on 3 labelling function. Overall accuracy reached is about 61% and can be enhanced further by dealing with mixed sentence heuristics.

  1. Finetuning of Sentence Transformer for customized dataset.
  2. Improvements in Comparison Methods and Advanced Comparison Techniques.
  1. Performance evaluation of llama-7B Model
  2. Performance evaluation of Redpajama-3B Model
  3. Performance evaluation of Bloke Syntara-7B Model

Stay Tuned !!!

ctransformers's People

Contributors

hsaurabh0919 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.