Giter VIP home page Giter VIP logo

ghas-results / aml-data Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ibm/aml-data

0.0 0.0 0.0 15 KB

The data represents financial transactions -- bank transfers, purchases, credit card transactions, checks, etc. Most of the transactions are legitimate. A few represent money laundering. The data is in CSV format. The data is generated using a multi-agent virtual world model. All of the agents in the virtual world have actions governed by stat

License: Apache License 2.0

ibm

aml-data's Introduction

AML-Data

NOTE: Although this Github repository is under the Apache-2.0 license, the actual data is released under the CDLA-Sharing-1.0 license.

DATA: https://ibm.box.com/v/AML-Anti-Money-Laundering-Data

PDF DOCUMENTATION

AML = Anti Money Laundering

This AML data is in CSV format and represents financial transactions -- bank transfers, purchases, credit card transactions, checks, etc. Most of the transactions are legitimate. A few represent money laundering. A laundering tag is provided with each transaction. With that laundering tag, AML models can use this data for training and to test their inferences.

The data is generated using a multi-agent virtual world model. All of the agents in the virtual world have actions governed by statistical distributions. Thus the model and data are NOT based on obfuscating or anonymizing real individuals. Everything is synthetic. More specifically the underlying model uses a virtual world of banks, individuals, and companies -- with individuals and companies buying items, and doing bank transfers to make payments, get supplies, pay salaries, etc. The underlying model has good and bad actors, with bad actors doing things like smuggling, extortion, illegal gambling, etc. The bad actors sometimes attempt to launder ill-gotten funds resulting in money-laundering transactions.

RELATED WORK

  • Alternate AML Data and Models: https://github.com/IBM/AMLSim

    • Focus on models
    • Data generation does not use detailed virtual world approach used here
  • Synthetic credit card transaction data: https://github.com/IBM/TabFormer

    • Labeled for fraud / not-fraud
    • Github site contains both data and Transformer-based fraud detection model

aml-data's People

Contributors

ealtman741 avatar stevemar avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.