Giter VIP home page Giter VIP logo

crazydreamer's Projects

language-identification icon language-identification

The Tatoeba corpus is a large, open source collection of short sentences and their translations in different languages. Due to the open source nature of the work, sentences are sometimes filed under the wrong language which is obviously undesirable. This project aims to present candidates of incorrect sentence translations of the corpus to the user. The most common form of language identification is the n-gram method which is not optimal for short sentences, such as the ones in the corpus, so this project will instead use an extended dictionary method as described by Řehůřek and Kolkus (2009).

languagedetection icon languagedetection

Multi language detection using nltk. (Detection of English, French, German, Dutch, Swedish)

languageidentification icon languageidentification

A new attempt at finding the best language identification algorithm (markov model, cross entropy, graph based, ngramdisplacement and SVM) in python using k-fold cross validation.

lantern icon lantern

:izakaya_lantern: Open Internet for everyone. Lantern is a free application that delivers fast, reliable and secure access to the open Internet for users in censored regions. It uses a variety of techniques to stay unblocked, including domain fronting, p2p, and pluggable transports.

ldig icon ldig

Language Detection with Infinity-gram

makehuman icon makehuman

This is now a near viable port of the current MakeHuman 1.1.1 stable branch to a Python 3 dependency. The port includes support for the pyside binding to QT4. The intention is ultimately to move to QT5 support as final bugs are fixed.

malvo icon malvo

A programming contest platform

mecab-chinese icon mecab-chinese

Chinese morphological analysis with Word Segment and POS Tagging data for MeCab

memo icon memo

Memo is an open-source, programming-oriented spaced repetition software (SRS) written in Flutter.

mermaid icon mermaid

Generation of diagram and flowchart from text in a similar manner as markdown

nmt icon nmt

TensorFlow Neural Machine Translation Tutorial

nof5 icon nof5

No longer wear out your F5 key! This simple PHP script will let you avoid refreshing the browser each time you change a file!

personalization-vocabulary icon personalization-vocabulary

根据个人的英语水平、兴趣和经常阅读的文章, 自动创建一个自定义的生词库,供背单词使用。

player icon player

html5版本音乐播放器,支持iOS设备

player-1 icon player-1

一个简单的音乐播放器,支持歌词同步,调整音量,调整播放进度。SoundManage强力驱动。

polyglot icon polyglot

Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the languages therein.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.