Topic: language-identification Goto Github
Some thing interesting about language-identification
Some thing interesting about language-identification
language-identification,Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
User: adbar
Home Page: https://adrien.barbaresi.eu/blog/simple-multilingual-lemmatizer-python.html
language-identification,Efficient Text Localization Algorithm, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and Traditional Computer Vision.
User: adroitanandai
Home Page: https://ieeexplore.ieee.org/document/9550858
language-identification,Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
User: aparnadutta
language-identification,Simple embedding based text classifier inspired by fastText, implemented in tensorflow
User: apcode
language-identification,GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages -- under review
Organization: cisnlp
Home Page: https://huggingface.co/datasets/cis-lmu/GlotCC-V1
language-identification,GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Organization: cisnlp
Home Page: https://arxiv.org/abs/2310.16248
language-identification,GlotScript: A Resource and Tool for Low Resource Writing System Identification -- LREC 2024
Organization: cisnlp
Home Page: https://arxiv.org/abs/2309.13320
language-identification,Natural language detection, Java bindings for CLD2
Organization: commoncrawl
language-identification,fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)
Organization: currentslab
Home Page: https://pypi.org/project/fastlangid/
language-identification,Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼
Organization: dataiku
Home Page: https://www.dataiku.com/product/plugins/nlp-preparation/
language-identification,✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua
User: doodlebears
Home Page: https://pypi.org/project/split-lang/
language-identification,Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Organization: echogarden-project
language-identification, Detect the languages from short pieces of text
Organization: floydhub
language-identification,A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS
Organization: googlesamples
language-identification,Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek
User: hb20007
language-identification,Multi-Langauge Identification
Organization: hiredscorelabs
language-identification,Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks
Organization: hpi-deeplearning
language-identification,
User: igorsitdikov
language-identification,Easy language identification of 380 languages
User: jonsafari
language-identification,Demo: Elasticsearch Language Identification
User: joshdevins
language-identification,Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
User: krishnadn
language-identification,Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
User: krishnadn
language-identification,⚡️ 80x faster language detection with Fasttext | Split text by language for TTS
Organization: llmkira
language-identification,FastText Pytorch version
User: loretoparisi
language-identification,Chương trình dự đoán ngôn ngữ dựa vào văn bản(như Google Dịch ^^)
Organization: ltkk
Home Page: https://nguyenvanhieu.vn/chuong-trinh-du-doan-ngon-ngu/
language-identification,Language Identification Toolkit
User: martinthoma
language-identification,This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
Organization: microsoft
language-identification,A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Organization: modelscope
language-identification,Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features
User: nipunmanral
language-identification,Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.
User: nitotm
language-identification,Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.
User: nitotm
language-identification,Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
User: nitotm
language-identification,The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
User: pemistahl
language-identification,The most accurate natural language detection library for Go, suitable for short text and mixed-language text
User: pemistahl
language-identification,The most accurate natural language detection library for Python, suitable for short text and mixed-language text
User: pemistahl
language-identification,The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
User: pemistahl
language-identification,End-to-end spoken language identification out of the box.
Organization: py-lidbox
language-identification,Rosette API Client Library for Python
Organization: rosette-api
language-identification,CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
User: sagorbrur
Home Page: https://codeswitch.readthedocs.io
language-identification,Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)
User: sinaahmadi
Home Page: https://arxiv.org/ftp/arxiv/papers/2403/2403.01983.pdf
language-identification,PALI: Language identification for Perso-Arabic Scripts
User: sinaahmadi
Home Page: https://aclanthology.org/2023.vardial-1.8/
language-identification,The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
Organization: skit-ai
language-identification,Dataset for programming language identification.
User: smola
language-identification,A TensorFlow-based spoken language identification
Organization: speechflow-io
language-identification,End to End Dialect Identification using Convolutional Neural Network
User: swshon
language-identification,Textpipe: clean and extract metadata from text
Organization: textpipe
language-identification,AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
Organization: ubc-nlp
Home Page: https://demos.dlnlp.ai/afrolid/
language-identification,Vietnamese NLP Toolkit for Node
User: vunb
language-identification,Spoken Language Identification on Common Voice and AudioSet using Deep Learning
Organization: zkmkarlsruhe
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.