code base to Predict whether a comment posted during a public discussion is considered insulting to one of the participants.
- Kaggle (2k): https://www.kaggle.com/c/detecting-insults-in-social-commentary
- Twitter dataset (16K) :
Ensemble of classifiers : Logistic regression, SVM, Kmeans Classifiers, Multiple bag of words features such as : TFIDF, DeltaTFIDF, Count, Dictionary Ensemble Classifier using : -- Majority vote -- Blending
Best Acheved score : 0.862 roc_auc on kaggledataset and 0.93 on twitterdataset