Giter VIP home page Giter VIP logo

pdap-scrapers's Introduction

Welcome!

This is the GitHub home for the Police Data Accessibility Project. We're assembling a toolkit and space for shared resources. People all over the country use these resources to collect public records about the U.S. criminal legal system.

This repository is also a guide to the countless ways we use scraper code to access data. (What do we mean by web scraper?)

Note: This repo is a work in progress, especially the structure of its utilities. Trust what you read here!

How to run a scraper

Right now, this requires some Python knowledge and patience. We're in the early stages: there's no automated scraper farm or fancy GUI yet. Scrapers can be run locally as needed.

  1. Install Python. Prefer a differently opinionated guide? Perhaps this is more your speed.
  2. Clone this repo.
  3. Find the scraper you wish to run. These are sorted geographically, so start by looking in /USA/....
  4. Follow the instructions in the scraper's README to get going. (If it's broken or simply out of date, please open an issue in this repo or submit a PR.)

Sharing back to the PDAP community

If you do something cool or interesting or fun with your shiny new data, share that in our Discord. Want to kick around an idea or share something that doesn't work as expected? Discord's a great place for that, too.

How to contribute

To write a scraper, start with CONTRIBUTING.md. Be sure to check out the /common folder!

For everything else, start with docs.pdap.io.

Resources

Here are some potentially useful tools. If you want to make additions or updates, you can edit the docs in GitHub!

pdap-scrapers's People

Contributors

captainstabs avatar josh-chamberlain avatar ktynski avatar ericturner3 avatar mcsaucy avatar oscarvanl avatar evildrpurple avatar thejqs avatar dependabot[bot] avatar mcoberley avatar richardji7 avatar csa-goose avatar nfrostdev avatar dongately avatar mcpf15 avatar not-new avatar ellygaytor avatar mitchyme avatar jlintag avatar rainmana avatar ayyubibrahimi avatar constantinek avatar dtoboggan avatar evanhahn avatar omnituensaeternum avatar nathanmentley avatar rasmusfonseca avatar sambarnes avatar timwis avatar danielmelles avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.