Dr. Joe Sutherland's Projects
A collection of code snippets that can be constructed into larger trading algorithms.
A topic-centric list of HQ open datasets in public domains. PR ☛☛☛
Quickstart kit for new Columbia Polisci PhD students.
LaTeX Template for Columbia University Dissertations
Datasets for conversational AI
Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.
Python utility that canonicalizes various financial data file formats to OFX 2, which is an XML format and hence a lot easier for other code to deal with. It recognizes OFX 1.x, OFX 2.x, QFX, QIF, and OFC.
Notes taken from Google Machine Learning Course provided to public for practice & correction.
Scraping and parsing tools for the GPO's congressional hearings dataset.
Grunt task for running an Express Server that works great with LiveReload + Watch/Regarde
Some of my solutions to HackerRank problems
Various Emacs formulae for the Homebrew package manager
Machine Learning Toolkit for Kubernetes
Coding space for the LegisLetters project.
Odds and ends. Files that wouldn't fit anywhere else.
AngularJS library for the Twitter REST Api
A nodejs module for interacting with the Quandl API.
Object oriented framework for multiple emacs modes based on indirect buffers
:exclamation: This is a read-only mirror of the CRAN R package repository. stargazer — Well-Formatted Regression and Summary Statistics Tables
Code and Data for TAD 3001-004 at NYU
extract text from any document. no muss. no fuss.
Save 1% of the Twitter firehose (the amount that's publicly available).
Code for a dynamic multilevel Bayesian model to predict US presidential elections. Written in R and Stan.
Read SAS datasets remotely (from wrds-cloud) into a Pandas dataframe.
Zipline, a Pythonic Algorithmic Trading Library