Giter VIP home page Giter VIP logo

ecs171_group-project's Introduction

ECS 171 Final Project

What is this project about?

This project is about solving the binary classification problem (fraud detection) based on the dataset retrieved from https://www.kaggle.com/mlg-ulb/creditcardfraud. Please check the paper for more details.

Authors

Kirby Zhou, Wenhao Su, James Lemkin, Zekun Chen, Zhujun Fang,Yuqi Sha,

Rongfei Li, Hulin Wang, Damu Gao, Bo Xiao, Haoyang Li, Yizhi Huang

How to run the scripts

The file folder libs consists of multiple utility functions to draw the ROC and PR curves. Before running the script, it is extremely important to download corresponding processed Data folder AND dataset from Kaggle website to current workspace.

Another repository is merged into current repository: https://github.com/cosmobiosis/fraud_detection

Dataset Folder download link: https://github.com/DeviRule/Ecs171_group-project/tree/master/Data

Kaggle Dataset download link: https://www.kaggle.com/mlg-ulb/creditcardfraud

The output will be automatically saved to current folder or printed out during runtime.

For decision tree script, please extract code to parent directory.

Outline

The project consists of file scripts, together with utility functions inside the file folder Utility.

decision_tree_grid_search.py: decision tree

distribution.py: preprocessing distribution

preprocessing_comparision.ipynb: SMOTE performance

Adaboost_main.py: Adaboost

SNE_KNN.ipynb: TSNE and KNN

ANN_kfold.ipynb: Feed-Forward Neural Network

RF_kfold_under.ipynb: Random Forest ROC graph

RF_final_under.ipynb: Random Forest Pr Curve, Classification Report

logistic_regression_parameter_sweep_undersample.ipynb: Logistic Regression Parameter Tuning

logistic_regression_kfold.py: Logistic Regression Best Model Performance

Dependencies

  • Programming Tools: Python3 and Jupyter Notebook
  • Libraries: imblearn, sklearn, Keras, matplotlib, graphviz and pandas

Paper and Code connection

In electronic pdf paper file, every link is connected to the corresponding script file which can retrieve and reproduce the results from the paper.

Support

For any questions contact [email protected] (email expired in 2021).

Licence

The project is using MIT license.

Acknowledgement

This work was supported by course ECS 171 of UC Davis. The dataset has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group (http://mlg.ulb.ac.be) of ULB (Université Libre de Bruxelles) on big data mining and fraud detection. More details on current and past projects on related topics are available on https://www.researchgate.net/project/Fraud-detection-5 and the page of the DefeatFraud project

ecs171_group-project's People

Contributors

cosmobiosis avatar devirule avatar floraqa avatar haicatw avatar like241 avatar qianlan97 avatar raycloud51 avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

haicatw

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.