Python 3 code for training models in a multilabel environment where classes overlap. Based on code in the fiction repo, but with bug fixes and improvements.

paceofchange

Code and data to support the article, "How quickly do literary standards change?"

pagelevelhmm

Java code I used to train hidden Markov models on top of page-level classification. Weka is a dependency. Needs refactoring.

pages

Java code for mapping genres at the page level in a large collection. Originally based on pagelevelHMM.

pagetagger

Contains Java code for a page-tagging interface.

parallel-lda

Java package that partitions a corpus and runs LDA in parallel on it

period-cohort

Code and data for an experiment on the relation between individual change and cohort succession in literary history.

plot

Initial exploratory research on patterns of change across narrative time.

pmla-scripts

Data for 1924-2006 pmla model, plus scripts to turn into Gephi network.

post2015

pythonscripts

Python scripts used to wrangle collection from Hathi, mostly on a cluster.

reviews

Parsing periodical indexes and finding book reviews, 1800-2007.

riseandfall

Code and data supporting The Rise and Fall of Genre Differentiation in English-Language Fiction.

roles

Code for a topic modeling variant that allows for character level 'roles' as well as book-level 'themes.'

time

Further research on narrative pace.

tokenize

folder storing current rulesets, scripts, and metadata for tokenizing / collection building

tokenizer

Python scripts for tokenizing text files

tedunderwood Goto Github PK

Ted Underwood's Projects

Recommend Projects

Recommend Topics

Recommend Org