Giter VIP home page Giter VIP logo

sahi_hai's Introduction

Sahi Hai - A Chrome Extension to detect Malicious Websites

Introduction

Sahi Hai was made by keeping in the mind the regular internet user who has to go through many websites during his time on the internet and may fall under a trap of a malicious website who might want the user's information or want to introduce malware in their system. Our chrome extension helps the user to check a particular website whether it is safe for browsing or not we have used a pretrained ml model to do so.

How it Works ?

The ML model extracts the following features from a url :

Feattures Used
Having IP address URL Length URL Shortening service Having @ symbol
Having double slash Having dash symbol(Prefix Suffix) Having multiple subdomains SSL Final State
URL of Anchor Links in tags SFH - Server from Handler. Submitting to email
Abnormal URL IFrame Age of Domain DNS Record
Web Traffic - using data.alexa.com Google Index Statistical Reports

We have iterated multiple times during training phase :


What Problem it Solves ?

Every other website in today's day and age on the internet wants to collect data of its users by tricking them into giving away their credentials for fraud or many such vindictive acts. Naive users using a browser have no idea about the backend of the page. The users might be tricked into giving away their credentials or downloading malicious data.

We have created an extension for Chrome that will act as middleware between the users and the malicious websites and relieve users of giving away to such websites. Our project was made by keeping in the mind the regular internet user who has to go through many websites during his time on the internet and may fall under a trap of a malicious website who might want the user's information or want to introduce malware in their system. Our chrome extension helps the user to check a particular website whether it is safe for browsing or not

Tech Stack

  • HTML - The front-end development language used for creating extension.

  • CSS - The front-end development language used for creating extension.

  • Python - The Programing Language used to parse features from a website and for training/testing of the ML model.

  • JavaScript - The scripting language used for creating the extension and sending requests to the served Ml model.

  • Php - The scripting language used for serving the Ml model .

  • Beautiful Soup - The library used to scrape websites from a url.

  • Googlesearch - The library for performing google search's during feature extraction.

  • whois - The package for retrieving WHOIS information of domains during feature extraction.

  • scikit-learn - The library used for training ML models.


Usage

Directory Structure

.
|-- LICENSE
|-- README.md
|-- extension
|   |-- icon.png
|   |-- manifest.json
|   |-- popup.html
|   |-- popup.js
|   `-- style.css
|-- images
|   `-- working.gif
|-- models
|   |-- mlp_model.pkl
|   `-- random_forest.pkl
|-- requirements.txt
|-- run.sh
|-- test
|   |-- __pycache__
|   |   |-- features_extraction.cpython-39.pyc
|   |   `-- patterns.cpython-39.pyc
|   |-- features_extraction.py
|   |-- features_extraction.pyc
|   |-- index.php
|   |-- markup.txt
|   |-- patterns.py
|   |-- patterns.pyc
|   `-- test.py
`-- train
    |-- data
    |   `-- web_data.arff
    |-- train_mlp.py
    `-- train_rf.py

Backend - Ml Model

  1. Clone The Repo

  2. Fire Up Terminal and Hit

    pip install -r requirements.txt 
    ./run.sh
    

Extension

  1. Go to chrome Settings using three dots on the top right corner

  2. select Extensions.

  3. Enable developer mode

  4. click on Load Unpacked and select the extensions folder.


Acknowledgments

A very heartful thanks to the authors and owners of the following articles which propelled us to make Sahi Hai.

And also lots of gratitude for the whole team of "HackNITR 2021" for providing us the perfect platform to showcase our idea.

sahi_hai's People

Contributors

raj-sahu avatar arabhiar avatar rishav139 avatar prakharvermavr avatar sharmanaman00 avatar

Stargazers

Bishwa Thapa avatar  avatar  avatar Hoang Van Nhat avatar  avatar  avatar Shubhankit Jain avatar  avatar Wang Ting An avatar  avatar

Watchers

 avatar

sahi_hai's Issues

DOCKERIZE

DOCKERIZE


DOCKERIZE to promote Better Portability

To Do

  • Train Ml Model

  • Build Extension

  • Serve the Ml Model

  • Integrate Extension with backend

  • run scripts

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.