Giter VIP home page Giter VIP logo

venmo-data's Introduction

Venmo Transaction Dataset

What is this?

This is a dataset of over 7,000,000 transactions scraped from the Venmo public API. Venmo is an app which allows users to easily send and receive money.

This data was collected as part of a data analysis project and was scraped during the following date ranges:

  • July 2018 - September 2018
  • October 2018
  • Jan 2019 - Feb 2019

I am releasing this dataset in order to bring attention to Venmo users that all of this data is publicly available for anyone to grab without even an API key. There is some very valuable data here for any attacker conducting OSINT research.

Protect Yourself

I would highly encourage all users to switch their Venmo account to private by going to Settings > Privacy and selecting "Private" as well as Past Transactions > Change All to Private. Screenshot instructions are available here.

Dataset Info

The dataset is in BSON form as it was exported from a MongoDB database. It has been compressed with xz to allow for faster transfers.

  • Compressed size: 1.03 GB
  • Uncompressed size: 10.87 GB
  • Transaction Count: 7,076,585

Each transaction contains lots of information about the sender and receiver, but does not include dollar amounts. A sample transaction .json file is included as sample.json

Download Dataset

Use This Dataset

This dataset was exported from a MongoDB instance with the following settings:

  • Database name: test
  • Collection name: venmo
  1. Install MongoDB
    • sudo apt install mongodb
  2. Extract .xz
    • tar xf venmo.tar.xz
    • or with progress: pv venmo.tar.xz | tar xf - -C extracted/ --xz
  3. Restore dump
    • mongorestore --collection venmo --db test venmo.bson

Citation

APA

Salmon, Dan. (2019). Venmo Transaction Dataset [Dataset]. Retrieved from https://github.com/sa7mon/venmo-data

MLA

Venmo Transaction Dataset. Dan Salmon, June 2019, https://github.com/sa7mon/venmo-data. Accessed (date here). Dataset.

Press

venmo-data's People

Contributors

sa7mon avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

venmo-data's Issues

Bson file

Quick question! I'm not too familar with Mongo and bson file. I was able to torrent the file and the bson file is sitting on my desktop. What do i do with this file? I am trying to convert it to a pandas df.

Data Collection Process

Thanks for sharing this dataset!

I just downloaded the data and am beginning to explore it a bit, but was hoping you could provide a bit of context on what this data represents. Specifically, a few questions I have are:

  • For the given timeframes you were scraping data, how much of the total transaction volume does this represent?
  • What was the frequency / strategy for scraping data?
  • Were there any filters on the data scraped, or does this represent all public transactions during those timeframes?

Basically, I am just trying to understand how this sample is related to the full population, and if I draw insights/conclusions from your data here, how I can extrapolate to make statements about the full population of Venmo transactions.

Thanks again for sharing!

Issue downloading torrent

The torrent does not appear to have any seeders and the HTTPS link isn't working. Is there any other way to download the data?

German press

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.