Giter VIP home page Giter VIP logo

ml-for-bot-detection's Introduction

Machine Learning for Web Robot Detection

Abstract

As more and more businesses choose to offer their services on the Internet, as compared with the traditional way of operating a retail storefront, there has been a proportionate surge in antagonistic tactics which also make use of the modern Web, though operating at the edge of legality. More specifically, in order to gain a competitive advantage, business owners sometimes settle on using automated tools to sabotage or exploit the resources of another entity in the same market. Within the field of electronic commerce, this is often manifested in the form of web robots that can either do as little as cloning certain pieces of information that could benefit the operating party, or even perform a series of attacks to their competitor's infrastructure, rendering them unable to complete any transactions with their customers and thus losing out on sales.

In order to combat this issue effectively, a solution that is rising in popularity nowadays is the use of machine learning to detect underlying patterns that are able to discern between human and robot behavior. As this aspect has been studied extensively in the past, in this study we will be attempting to further develop this analytical technique by examining whether we can extract new features from the clients' access logs that are specifically related to e-commerce websites. When measuring the performance of this approach on a publicly available dataset, by training both a simple decision tree classifier and a more complex deep neural network, we were able to achieve results with an F1-score of over 94% and 98% respectively.

Dependencies

In order to execute the code in this repository you will need the following third-party libraries for Python:

Dataset

The dataset used is the following:

Zaker, Farzin, 2019, "Online Shopping Store - Web Server Logs", https://doi.org/10.7910/DVN/3QBYB5, Harvard Dataverse, V1

ml-for-bot-detection's People

Contributors

din0s avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.