Giter VIP home page Giter VIP logo

kaggle_image-classify's Introduction

Kaggle: Image classification challenges

CI complete testing Code formatting codecov pre-commit.ci status

Experimentation

install this tooling

A simple way how to use this basic functions:

! pip install https://github.com/Borda/kaggle_image-classify/archive/main.zip

The Herbarium 2022: Flora of North America dataset comprises 1.05 M images of 15,501 vascular plants, which constitute more than 90% of the taxa documented in North America. The provided dataset is constrained to include only vascular land plants (lycophytes, ferns, gymnosperms, and flowering plants) and it has a long-tail distribution. The number of images per taxon is as few as seven and as many as 100 images. Although more images are available.

Sample images

run notebooks in Kaggle

run notebooks in Colab

some results

Training progress with EffNet-b3 with training for 10 epochs:

Training process

Foliar (leaf) diseases pose a major threat to the overall productivity and quality of apple orchards. The current process for disease diagnosis in apple orchards is based on manual scouting by humans, which is time-consuming and expensive.

The main objective of the competition is to develop machine learning-based models to accurately classify a given leaf image from the test dataset to a particular disease category, and to identify an individual disease from multiple disease symptoms on a single leaf image.

Sample images

run notebooks in Kaggle

run notebooks in Colab

I would recommend uploading the dataset to you personal gDrive and then in notebooks connect the gDrive which saves you lost of time with re-uploading dataset when ever your Colab is reset... :]

some results

Training progress with ResNet50 with training for 10 epochs > over 96% validation accuracy:

Training process

More reading

The online cataloguing information is generated by subject matter experts and includes a wide range of data. These include, but are not limited to: multiple object classifications, artist, title, period, date, medium, culture, size, provenance, geographic location, and other related museum objects within The Met’s collection. Adding fine-grained attributes to aid in the visual understanding of the museum objects will enable the ability to search for visually related objects.

Sample images

run notebooks in Kaggle

run notebooks in Colab

I would recommend uploading the dataset to you personal gDrive and then in notebooks connect the gDrive which saves you lost of time with re-uploading dataset when ever your Colab is reset... :]

some results

Training progress with ResNet50 with training for 35 epochs and subset labels with ore then 100 samples:

training on 100 samples per class

The task is to classify each cassava image into five categories indicating - plant with a certain kind of disease or healthy leaf.

Organizers introduced a dataset of 21,367 labeled images collected during a regular survey in Uganda. Most images were crowd-sourced from farmers taking photos of their gardens, and annotated by experts at the National Crops Resources Research Institute (NaCRRI) in collaboration with the AI lab at Makerere University, Kampala.

Sample images

run notebooks in Colab

I would recommend uploading the dataset to you personal gDrive and then in notebooks connect the gDrive which saves you lost of time with re-uploading dataset when ever your Colab is reset... :]

some results

Training progress with ResNet50 with training for 10 epochs:

Training process

kaggle_image-classify's People

Contributors

borda avatar edgarriba avatar pre-commit-ci[bot] avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

kaggle_image-classify's Issues

Training metrics not reported in log file

🐛 Bug

In the Pytorch Lightning Google Collab notebook the training loss, training accuracy, and training f1 score are not logged in the metrics file.
The training metrics are not logged by the Logger.
The validations metrics are logged in validation steps and are found in the log file.
The training metrics are not logged in the training step function and are not found in the log file.

How can these metrics be logged?

ImportError: cannot import name 'InterpolationMode' from 'torchvision.transforms'

🐛 Bug

ImportError: cannot import name 'InterpolationMode' from 'torchvision.transforms' when trying to import PlantPathologyDM

To Reproduce

Steps to reproduce the behavior:

  1. Import PlantPathologyDM
  2. See error

Expected behavior

InterpolationMode should be imported correctly

Additional context

Tested on torchvision==0.8.1.

Dataset

where is the dataset ?? I need to download it from kaggle to gdrive - then working with it??

# copy the dataset to local drive
! rsync -ah --progress /content/gdrive/Shareddrives/Datasets/Kaggle/Plant-pathology/plant-pathology-2021-fgvc8_640.zip plant-pathology.zip

Obsolete repository description

🐛 Bug

Obsolete repository description in "About" sidebox: "Identify the type of disease present on Appletree leafs", although this repository is now a compilation of various classification tasks.

To Reproduce

Not applicable here.

Expected behavior

The description should be updated to fit current repository content.

Additional context

None

Question about dataset

Is the plant pathology 2021 dataset used in the Pytorch Lightning notebook a resized version of the dataset? There are some commands that are commented out:

# ! apt-get install -qq -y imagemagick
# ! mogrify -resize 960 train_images/*.jpg

Were these commands used to resize the dataset before running the notebook?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.