Giter VIP home page Giter VIP logo

Manuel 's Projects

albert icon albert

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

bert icon bert

TensorFlow code and pre-trained models for BERT

botpercent icon botpercent

implementation of "BotPercent: Estimating Twitter Bot Populations from Groups to Crowds"

covid-berts icon covid-berts

BERT models pretrained on the CORD-19 Kaggle dataset

de-wiki-text-corpus-tools icon de-wiki-text-corpus-tools

Python scripts to process german wiki dump. This is to generate a german text corpus for supervised word representation learning. Especially for training an BILM.

ekphrasis icon ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

electra-1 icon electra-1

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

elus icon elus

Bayesian ideal points of French politicians, based on Twitter data.

embedded-topic-model icon embedded-topic-model

A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM

emoji icon emoji

emoji terminal output for Python

fast-bert icon fast-bert

Super easy library for BERT based NLP models

flaubert icon flaubert

Unsupervised Language Model Pre-training for French

get-tweets icon get-tweets

Single Python script to get tweet JSON objects from a list of tweet IDs

gpt-2-simple icon gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

gpt2 icon gpt2

An implementation of training for GPT2, supports TPUs

gpt2-ml icon gpt2-ml

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

hactproject_julia icon hactproject_julia

Describes and solves some simple HACT models in Julia. The notes and code is modified and translated from Benjamin Moll's notes and codes: http://www.princeton.edu/~moll/notes.htm and http://www.princeton.edu/~moll/HACTproject.htm).

hs-survey-cultural-bias icon hs-survey-cultural-bias

Resources for WOAH 2024 paper: "From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets"

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.