Giter VIP home page Giter VIP logo

Hi

I'm Vladimir Gurevich, ML/NLP Engineer (IR tasks, such as Semantic Search, Information Extraction tasks, such as NER, Relation Extraction, etc.).

I am also interested in Speech Recognition and in LLMs.

Works:

  • jupyter-notebook-viewer - Jupyter Notebook Viewer for local files *.ipynb in browser without Jupyter Notebook installation.
  • wav2vec2-hebrew - package for speech recognition in Hebrew language using wav2vec2 models that were trained on Hebrew datasets (check out the datasets below).
  • distiller - distillation TextClassification and TokenClassification models using transformers library with different distillation methods.
  • spacy-trankit - spacy wrapper for Trankit (NLP pipeline for tokenization+dependency parsing+lemmatization, etc.)

Models:

Speech Recognition:

Datasets:

Contacts

LinkedIn

Vladimir Gurevich's Projects

abydos icon abydos

Abydos NLP/IR library for Python [imvladikon] made some changes

campus-dl icon campus-dl

A simple tool to download video lectures from campus.gov.il (based on edx-dl)

cdatasketch icon cdatasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble [imvladikon] added cython implementations

character-bert icon character-bert

Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"

charbert icon charbert

CharBERT: Character-aware Pre-trained Language Model (COLING2020)

datasets icon datasets

🤗 Fast, efficient, open-access datasets and evaluation metrics in PyTorch, TensorFlow, NumPy and Pandas

distil-whisper icon distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

distiller icon distiller

knowledge distillations for bert (classification, token classification models)

easyocr icon easyocr

Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai

evaluate icon evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

kdbush icon kdbush

Naive implementation of Java KD-Bush(KD-tree) algorithm

knesset-2004-2005 icon knesset-2004-2005

A corpus of transcriptions of Knesset (Israeli parliament) meetings between January 2004 and November 2005.

nbpreview icon nbpreview

Render IPython/Jupyter notebooks without running a notebook server.

news_scrapers icon news_scrapers

This repository contains scripts for scraping news from different sources

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.