Giter VIP home page Giter VIP logo

boe_tabulator's Introduction

BOE_tabulator

Reads PDFs from Baltimore's archive of minutes from the Board of Estimates and places the data in a searchable table.

Table of Contents

Overview

This is where the project overview will go

Getting Started

Follow the steps below to recreate the development environment necessary to start contributing to this project

Prerequisites

  • Python version 3.6 or later

In order to check which version of python you have installed, run the following command in your command line (for Mac/Linux)

NOTE: in all of the code blocks below, lines preceded with $ indicate commands you should enter in your command line (excluding the $ itself), while lines preceded with > indicate the expected output from the previous command.

$ python --version

The output should look something like this:

> Python 3.7.7

If you don't have Python version 3.6 or later installed on your computer. Consider using pyenv to install and manage multiple versions of Python concurrently.

Installation

  1. Fork the repo -- for more information about forking, reference this guide
  2. Clone your forked repo on your local machine:
    $ git clone https://github.com/YOUR_USERNAME/BOE_tabulator.git
    
  3. Move into the directory created when you cloned the repo:
    $ cd BOE_tabulator/
    
  4. Configure the upstream repository for your local fork:
    $ git remote add upstream https://github.com/department-of-general-services/BOE_tabulator.git
    
  5. Confirm that the upstream repo has been set correctly
    $ git remote -v
    > origin    https://github.com/YOUR_USERNAME/BOE_tabulator.git (fetch)
    > origin    https://github.com/YOUR_USERNAME/BOE_tabulator.git (push)
    > upstream  https://github.com/department-of-general-services/BOE_tabulator.git (fetch)
    > upstream  https://github.com/department-of-general-services/BOE_tabulator.git (push)
    
  6. Create a new virtual environment in your local directory
    $ python -m venv env
    
  7. Activate your virtual environment
    $ source env/bin/activate
    
  8. Install necessary python packages
    $ pip install -r requirements.txt
    

Fetching the Data

  1. Open up jupyter notebooks
    $ jupyter notebook
    
  2. Open the tabulator.ipynb from the directory in the browser
  3. Run each cell of the notebook

NOTE: Running the notebook may take 10-20 minutes the first time as you download all of the pdf files from the Comptroller webpage

Usage

This is where we will specify how to use the tool

Contributing

James will add details around the workflow for contributing

boe_tabulator's People

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.