Giter VIP home page Giter VIP logo

insurance_fraud_detection's Introduction

forthebadge forthebadge forthebadge

The project is about creating a model which predicts whether presnt insurance is fraud or not based on the various data provided by them in the input form

Table of contents



Usage

(Back to top)

Man pages have been added. Checkout man colorls.

Installation

(Back to top)

  1. Install git (preferably, version >= 2.0) and python (preferably, version >=3.6) (windows) For Linux :
   sudo yum instal git -y
   sudo yum install python -y
  1. Copy the github url from the repository :
https://github.com/SiddharthaShandilya/Dementia_detection_using_AI.git
  1. Select a Directory in local system and use
git clone https://github.com/SiddharthaShandilya/Dementia_detection_using_AI.git          
*Note for `git clone command`  Please make sure that you have proper internet connection. *

*Note for `python` Please try to anaconda for running the app.*  
  1. Create a seperate virtual environment to avoid conflict between python libraries :
    python3 -m venv new-env 

In case for anaconda we can use below commands

    conda create --prefix ./env
    conda activate ./env
  1. In case tou want to Activate the virtual env follow the given instructions: ๐Ÿ‘‰ (click Here)

  2. Install all the libraries for the application.

pip3 install -r requirements.txt
  1. Once the environment is created use following commands to start
git init
dvc init
dvc dag
dvc repro
  1. Have a look at Recommended configurations and Custom configurations.



Custom configurations

(Back to top)

  1. In the project we are running flask application by using python3 which might not work so try below mentioned commands:
    python/python3 app.py
    or
    flask run 



Recommended configurations

(Back to top)

You can overwrite the existing code according to your needs and changing them.

  • Note :
  1. Please have a look at the dvc.yaml file , Here i have used python3.7 version so if your console takes python3 to run python make sure to change all the commands in dvc.yaml file.

  1. If any change regarding the file are concerned you are advised to change the config file in location ' /config/config.yaml '. For eg.. the location of dataset in thsi code is of a lcoal storage ../dementia_dataset, this might not be in you case so change the code accordingly.

Updating

(Back to top)

Want to update to the latest version of dementia_detection? make the required change and give us a pull request

git push https://github.com/SiddharthaShandilya/Dementia_detection_using_AI.git



Output

(Back to top)

For Downloading Docker Image

docker pull centos104/dementia_detection_web_app

Below is some screenshot of the web application when it is successfully launched.



Uninstallation

(Back to top)

Want to uninstall ? No issues (sob). Please feel free to open an issue regarding how we can enhance dementia_detection app.

ctrl + A, ctrl + shift + delete



Contributing

(Back to top)

Your contributions are always welcome! Please have a look at the contribution guidelines first. ๐ŸŽ‰


Future Scope

(Back to top)

we are currently working on the UI part as well as trying to make section in the web-app to upload a MRI scan to locate demtia. Feel free to improve the code or share some innovative ideas.

About Author

(Back to top)

Siddhartha Shandilya

insurance_fraud_detection's People

Contributors

siddharthashandilya avatar

Stargazers

 avatar

Watchers

 avatar

insurance_fraud_detection's Issues

[ID-14] perform EDA on the ingested data and clean the data for model training

once we are done with creating a data ingestion pipeline , we'll move towards creating a Data processing pipeline which will clean the ingested data and prepare it for model training

  • Data cleaning: Remove or impute missing values, handle duplicates and outliers, and address any other data quality issues.

  • Data transformation: Apply necessary data transformations like normalization, standardization, or encoding categorical variables.

  • Univariate analysis: Analyze the distribution and characteristics of individual variables using visualizations like histograms, density plots, box plots, and summary statistics.

  • Bivariate analysis: Examine the relationship between two variables using scatter plots, correlation coefficients, and other statistical tests.

  • Multivariate analysis: Analyze the relationship between three or more variables using techniques like heatmaps, cluster analysis, and principal component analysis (PCA).

  • Feature engineering: Create new features by combining or transforming existing ones based on insights gained from the earlier analysis.

column name validation .

Provide a function "column_name_validation" which will match the columns present in the input files with the provided schema.json file .
Put the function under src/data_processing/data_validation.py inside Raw_Data_Validation class

[ID-6] validate missing values for whole column

This function validates if any column in the csv file has all values missing. If all the values are missing, the file is not suitable for processing. Such files are moved to bad raw data.

[ID-15] Add readme file to the artifacts/data

Add readme file to the artifacts/data folder mentioning the link to raw data

  • create a README.md file inside the data directory
  • Update the readme file with the link to download the raw data

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.