Unsupervised learning for predicting casing information
Contains scripts for creating a bigram language model using the Wikipedia english dump using Map Reduce for [calculating tf-idf scores] (https://github.com/Vatshank/nlp-casing/blob/master/blm_mapreduce.py) and using it for [predicting casing information] (https://github.com/Vatshank/nlp-casing/blob/master/blm_predict.py).