Giter VIP home page Giter VIP logo

data's Introduction

Data sets

Data sets created for stories on The Pudding, open to the public.

Pudding Story Title Story Publish Date Data Update Frequency Keywords Data
What Does a Happily Ever After Look Like? October 2023 Never music, country, radio data
They Won't Play a Lady-O on Country Radio May 2023 Never music, country, radio repo
The Greatest Unexpected NBA Performances February 2023 Never basketball, NBA repo
We think this cool study we found is flawed. Help us reproduce it. April 2022 Daily replication, randomness json
Who's in Your Wallet? April 2022 Never banknotes, currency repo
When Women Make Headlines February 2022 Never headlines, women repo
"I Kissed a Girl" to "Call Me By Your Name" June 2021 Never music, lyrics, queer, gender data
The Naked Truth March 2021 Never beauty, makeup, diversity, US repo
Winning the Internet July 2020 Sometimes newsletter repo
Police Misconduct October 2020 Never social data
Whole Foods x 15 Percent Pledge October 2020 Never retail data
How candidate diversity impacts color diversity August 2020 Never politics, design, color repo
90s Song Memory July 2020 Never music repo
The Infinite Monkey Theorem Experiment Apr 2020 Daily piano, probability json
Just how does Kidz Bop censor songs? Apr 2020 Never Kidz Bop, music, censorship repo
The Evolution of the American Census Mar 2020 Never US, census repo
Where will you need your umbrella? Feb 2020 Never weather, precipitation repo
Laughing OnLine Oct 2019 Never laughter, Reddit repo
Finding Forever Homes Oct 2019 Never dogs, PetFinder, US repo
Vocal Register and Singing Voices Aug 2019 Never music repo
Book Covers July 2019 Never books repo
Hipster Summer Reading List June 2019 Never books, library, Seattle repo
Best Year in Music June 2019 Sometimes music repo
A People Map of the UK June 2019 Never wiki, names, UK repo
A People Map of the US May 2019 Never wiki, names, US repo
Sing My Name May 2019 Never music, names, US repo
The Rise of Hyphenated Last Names in Pro Sports May 2019 Never sports, culture, names, MLB, NBA, NFL, NHL, NWSL, WNBA repo
The NBA Has A Defensive Three Seconds Problem May 2019 Never basketball, NBA repo
Colorism in High Fashion April 2019 Never fashion, diversity, US repo
EU Regions April 2019 Daily european union, eu data and analysis
NBA Spell Jam March 2019 Daily spelling, names, sankey, NBA data
Who is the Biggest Pop Star? March 2019 Never music repo
How Many High School Stars Make It to the NBA? March 2019 Never US, basketball, NBA repo
The Gyllenhaal Experiment February 2019 Never spelling, names, sankey data
The Sexualized Messages Dress Codes are Sending to Students February 2019 Never US, dress code, high school repo
The Largest Vocabulary in Hip Hop January 2019 Sometimes music data
Internet Boy Band Database November 2018 Never music, boybands, dance, US repo
Thirty Years of American Anxieties November 2018 Never advice, anxiety repo
The Winningest Cities in North American Sports November 2018 Never sports, championships, rankings, basketball, football, hockey, baseball repo
Human Terrain October 2018 Never population how-to
What Does the Path To Fame Look Like? October 2018 Never celebrities, culture repo
The Celebrity Billboard Project September 2018 Daily celebrities, culture repo
Film vs. Digital August 2018 Never film, movies repo
Women's Pockets are Inferior August 2018 Never pockets, fashion, equality, women repo
Life After Death on Wikipedia August 2018 Never Wikipedia, pageviews, death repo
What Airport Traffic Tells Us About the World's Megacities July 2018 Never cities, population, airports repo
Let's Talk About Birth Control July 2018 Never contraception, US, birth control, health, survey, CDC repo
Men are from Chelsea, Women are from Park Slope June 2018 Never gayborhoods, gay, lesbian, queer, LGBTQ, neighborhoods, pride, gender repo
The Diversity of Makeup Shades June 2018 Never beauty, makeup, Fenty, diversity, US, global repo
The Good, the Rad, and the Gnarly June 2018 Never skateboard, music, genre, gnarly repo
Baking the Most Average Chocolate Chip Cookie May 2018 Never baking, cookies, machine learning, NLP repo
One-Hit Wonders in Sports April 2018 Never sports, ranking, basketball, golf, tennis, baseball, hockey repo
The Birthday Paradox Experiment April 2018 Daily math, paradox, birthday json
A Tale of Two Cities March 2018 Never neighborhoods, business, Yelp repo
The Structure of Stand-Up Comedy February 2018 Never stand-up, comedy, Ali Wong, US repo
Greetings from Mars February 2018 Daily mars, weather, Curiosity Rover, global repo
How far is too far? An analysis of driving times to abortion clinics in the US. September 2017 Never abortion, clinics, duration, access, US repo
Free Willy & Flipper by the Numbers July 2017 Never whales, dolphins, captivity, US repo
The Timing of Baby Making May 2017 Never births, babies, county, US repo
NBA Last Two Minute Report February 2017 Daily nba, basketball, referee, US repo

data's People

Contributors

anbnyc avatar iblind avatar jadiehm avatar jeffmacinnes avatar kevinahuber avatar matthewfdaniels avatar mmcghee18 avatar proquestionasker avatar russellsamora avatar sahitisarva avatar svickars avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data's Issues

Issues with the "incidence rate" in the "neighborhoods" analysis

@ProQuestionAsker @iblind

Inspired by your work I'm in the process of analyzing Yelp data for Berlin's neighborhoods, but I'm a little confused by the "incidence rate" you use. It's obviously the same that Katie Hempenius used. Do you happen to know where it comes from, or if she came up with it herself? The only incidence rate I know of comes from epidemiology and includes a component of time, which Katie's formula definitely does not. I've checked out some business resources and they all use the standard epidemiological definition.

Also, do you do any kind of within-category, between-district comparison in your analysis, i.e. do you rank the categories within each district and the districts within each category? As far as I can tell you don't. I ask because the second half of the formula, the part that normalizes the data, only effects the within-category, between-district rankings, but not the within-district, between-category rankings, since it is simply a constant at the within-district level. If you don't compare between districts then the normalization step is superfluous.

Hopefully you can tell me if I've maybe missed something. I plan to post my analysis on my blog in the next week or so. It's likely very few will read it, but I do plan to take up David Robinson on his offer, so if I'm lucky some people will read it. I respect your work and don't want it to seem like I've launched a surprise attack, ergo this "issue".

I have a few other objections to Katie's "incidence rate" beyond the above details. You are welcome to read and comment on them here. You'll want the "Deciding on a Metric" section that starts at line 359. It's still very much a rough draft, so some things will change, but my arguments in this section are fleshed out enough to understand what I'm aiming for.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.