ericleasemorgan Goto Github PK
Name: Eric Lease Morgan
Type: User
Bio: I'm employed as a librarian, but I've been consistently writing software since 1976.
Name: Eric Lease Morgan
Type: User
Bio: I'm employed as a librarian, but I've been consistently writing software since 1976.
Given a HathiTrust collection file, do analysis against the full text
A "suite" of scripts used to query bibliographic databases and cache the results for various reporting purposes
Given an JSON file describing the bibliographics of arxiv, create a searchable index of the same
Given a Arxiv API query, output URLs (and optionally harvest) pointing to PDF files
Given a XML file of citations exported from Bioarxiv, create a simple TSV file of metadata and locally cache the associated preprints
Create a data set of Brazilian candidates and do text mining against it
Bringing algorithms and machine learning into library collections & services
Given the short name of a Distant Reader study carrel, create a network graph and visualize it
Given a set of MARC records, create an "enhanced" set of indexes and online catalog
This repository contains a summary of the Catholic Youth Literature Project as well as some sample code.
Given an XML describing a set of churches, create a database of the same
Given a set of PDF files, create a classification model for determining their "aboutness" regarding C. neoformas
Jekyll static site for Code4Lib.org
given a Constellate data set identifier, do stuff with the data set's content
transform JSTOR Constellate JSONL to a zip file suitable for Distant Reader input
Submit the CORD-19 dataset to the Distant Reader
Given a set of documents, evaluate to what degree the concept of discernment evolves
A suite of software tools designed to enable "distant reading" processes against the corpus of Early English Books Online (EEBO)
A set of Perl scripts and helper/sample files used to create ePub files from TEI files from the Alex Catalogue Of Electronic Texts
given a directory, output a CSV file suitable for the Reader
Cache theses & dissertations from an institutional repository and convert them to plain text for analysis
Given a HathiTrust identifier, output plain text as well as PDF versions of a book in the public domain from the HathiTrust.
Given a JSON file from the HathiTrust Research Center, output "human-readable" plain text files akin to books.
A (tiny) Perl library of three subroutines and example scripts to be used against the HathiTrust Research Center
A set of tools used to do "distant reading" against sets of public domain content found in the HathiTrust digital library
A small suite of utilities to find, count, and tabulate features of corpus
Given the root URL of ITAL's OAI data repository, create a full text index of the journal
Given a CSV file of patent titles, ultimately compute a uniqueness score for each patent
Given a Data For Research citations.xml file from JSTOR, this suite of software will cache content locally, index it, do some analysis against it, and create a few visualizations. It is meant to support "distant reading" against sets of scholarly journal articles.
Library Carpentry Workshop Overview
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.