Giter VIP home page Giter VIP logo

Nitinguptadu

Breast cancer detection

Problem 1 )Data Consists of Outlier in several Columns Problem 2) Data consists of Left sweeknews in several Columns Problem 3) Data consists of Zero or Missing values in several Columns Problem 4) Data is unbalanced in terms of Cancer(“0”) counts 234 and Not Cancer (“1”) counts 42 [ Ratio 5:1]

Machine Learning Prediction Results

I have applied 6 different Machine learning model with Unbalanced and Balanced data

Machine learning models

RandomForest Naive Bayes SVM KNN Logistic Regression Xgboost

Data Balance Techniques

Downsampling Techniques Near Miss

Upsampling Techniques 1) SMOTETomek 2) RandomOverSampler

In last slide I have shown the result of all machine learning models with unbalanced and balanced data

Red colour represent Highest accuracy on the basis of F1 score with existing models and techniques

Blue colour represent Second Highest accuracy on the basis of F1 score with existing models and techniques

Nitin's Projects

ankur-machine-learning- icon ankur-machine-learning-

Client: M17(https://m17.asia/en/product/17media/) a streaming platform. DataLink: ​ Client_data.zip The attached data here is generated from the live-streaming platform. Analyze the data to come up with the top 20% streamers. Using these top 20% streamers as good streamers create a classification model which can classify whether any streamer is good streamer or not. Evaluation Metric​ : F1-Score Classification Report (Using sklearn.metrics.classification_report) is also required

build-a-restful-service-that-extracts-expense-date-from-a-receipt. icon build-a-restful-service-that-extracts-expense-date-from-a-receipt.

Build a RESTful service that extracts expense date from a receipt.Deploy the service on any cloud platform like Heroku/AWS/GCP. The service should contain one API which has the following contract: Request: POST /extract_date Payload: {“base_64_image_content”: <base_64_image_content>} Response: If date is present: {“date”: “YYYY-MM-DD”} If date is not present: {“date”: null}

comaptaring-two-columns- icon comaptaring-two-columns-

This repo is for self learning purpose for comparing two numerical columns and creating a new columns for saving higher have from two colomns

dentisty.ai icon dentisty.ai

Please use the following assignment for Data Scientist position: The deadline of Assignment is 3 days. 1. Download the data-set from : https://warwick.ac.uk/fac/sci/dcs/research/tia/glascontest/download/ 2. Augment the data-set to make it efficient against shift, light variation and other noise. 3. Train a transfer learning based classifier to classify images according to benign/malignant given in the Excel file. 4. Train a transfer learning based Semantic segmentation according to the annotation in _anno.BMP file. 5. Upload the assignment on your Github in form of a IPYNB showing results on a few test set images.

docker-nlm- icon docker-nlm-

This is my first docker image this repo is for personal learning purpose

fast.ai icon fast.ai

Multi label image classification problem

faster-r-cnn-for-open-images-dataset-by-keras icon faster-r-cnn-for-open-images-dataset-by-keras

Introduction The original code of Keras version of Faster R-CNN I used was written by yhenon (resource link: GitHub .) He used the PASCAL VOC 2007, 2012, and MS COCO datasets. For me, I just extracted three classes, “Person”, “Car” and “Mobile phone”, from Google’s Open Images Dataset V4. I applied configs different from his work to fit my dataset and I removed unuseful code. Btw, to run this on Google Colab (for free GPU computing up to 12hrs), I compressed all the code into three .ipynb notebooks. Sorry for the messy structure. I wrote my exploring and experiment results for Faster R-CNN in this article in Medium. If you are in China, you cannot directly access Medium. So I make a copy in here. Project Structure Object_Detection_DataPreprocessing.ipynb is the file to extract subdata from Open Images Dataset V4 which includes downloading the images and creating the annotation files for our training. I run this part by my own computer because of no need for GPU computation. frcnn_train_vgg.ipynb is the file to train the model. The configuration and model saved path are inside this file. frcnn_test_vgg.ipynb is the file to test the model with test images and calculate the mAP (mean average precision) for the model.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.