Dependency parsing by Stanford Corenlp doesn't exactly match the sample examples

Stanford Politeness API

A new home: ConvoKit (July 2019)

The functionality of this API is now integrated with ConvoKit. This new implementation more directly exposes politeness strategies, which can used in downstream tasks (see example scripts in ConvoKit). As such, we are discontinuing support for this API.

Version 2.00 (released March 2017)

Python3 version is available here: https://github.com/sudhof/politeness/tree/python3 (refactored from Version 1.01 through the kindness of Benjamin Meyers).

Note: This python3 version was not yet tested by us, nor compared against the results from our paper (listed below). The code used in the paper is still here in the master branch of this repository (keep on reading).

Version 1.01 (released October 2014)

Python implementation of a politeness classifier for requests, based on the work described in:

A computational approach to politeness with application to social factors.  	
Cristian Danescu-Niculescu-Mizil, Moritz Sudhof, Dan Jurafsky, Jure Leskovec, Christopher Potts.  
Proceedings of ACL, 2013.

We release this code hoping that others will use and improve on our work.

NOTE: If you use this API in your work please send an email to [email protected] so we can add you to our list of users. Thanks!

Further resources:

Info about our work: http://cs.cornell.edu/~cristian/Politeness.html

A web interface to the politeness model: http://politeness.cornell.edu/

The Stanford Politeness Corpus: http://cs.cornell.edu/~cristian/Politeness_files/Stanford_politeness_corpus.zip

Using this API you can:

classify requests using politeness.model.score (using the provided pre-trained model)
train new models on new data using politeness.scripts.train_model
experiment with new politeness features in politeness.features.vectorizer and politeness.features.politeness_strategies

Input: Requests must be pre-processed with sentences and dependency parses. We used nltk's PunktSentenceTokenizer for sentence tokenization and Stanford CoreNLP version 1.3.3 for dependency parsing. A sample of the expected format for documents is given in politeness.test_documents

Caveat: This work focuses on requests, not all kinds of utterances. The model's predictions on non-request utterances will be less accurate. As a bonus, our code also includes a very simple heuristic to check whether a document looks like a request (see politeness.request_utils).

Requirements:

python package requirements are listed in requirements.txt. We recommend setting up a new python environment using virtualenv and installing the dependencies by running

pip install -r requirements.txt

Additionally, since the code uses nltk.word_tokenize to tokenize text, you will need to download the tokenizers/punkt/english.pickle nltk resource. If you've worked with nltk before, there's a good chance you've already downloaded this model. Otherwise, open the python interpreter and run:

import nltk
nltk.download()

In the window that opens, navigate to Models and download the Punkt Tokenizer Models.

Sanity Check:

To make sure everything's working, navigate to the code directory and run

python model.py

This should print out the politeness probabilities for 4 test documents.

Contact: Please email any questions to: [email protected] (Cristian Danescu-Niculescu-Mizil) and [email protected] (Moritz Sudhof)

sudhof / politeness Goto Github PK

politeness's Introduction

Stanford Politeness API

A new home: ConvoKit (July 2019)

Version 2.00 (released March 2017)

Version 1.01 (released October 2014)

politeness's People

Contributors

Stargazers

Watchers

Forkers

politeness's Issues

Dependency parsing by Stanford Corenlp doesn't exactly match the sample examples

Spacy parser URL or other alternative?

Can't run on Python 3.6.2 according to documentation

pip install Politeness doesnt work

Pip install does not work

incompatibility issue with machine learning files Python 3.6

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent