Training and deployment of deep learning models applied to satellite and aerial imagery.

👉 satellite-image-deep-learning.com 👈

How to use this repository: if you know exactly what you are looking for (e.g. you have the paper name) you can Control+F to search for it in this page (or search in the raw markdown).

Model training

This section discusses training machine learning models.

Metrics

A number of metrics are common to all model types (but can have slightly different meanings in contexts such as object detection), whilst other metrics are very specific to particular classes of model. The correct choice of metric is particularly critical for imbalanced dataset problems, e.g. object detection

TP = true positive, FP = false positive, TN = true negative, FN = false negative
Precision is the % of correct positive predictions, calculated as precision = TP/(TP+FP)
Recall or true positive rate (TPR), is the % of true positives captured by the model, calculated as recall = TP/(TP+FN). Note that FN is not possible in object detection, so recall is not appropriate.
The F1 score (also called the F-score or the F-measure) is the harmonic mean of precision and recall, calculated as F1 = 2*(precision * recall)/(precision + recall). It conveys the balance between the precision and the recall. Ref
The false positive rate (FPR), calculated as FPR = FP/(FP+TN) is often plotted against recall/TPR in an ROC curve which shows how the TPR/FPR tradeoff varies with classification threshold. Lowering the classification threshold returns more true positives, but also more false positives. Note that since FN is not possible in object detection, ROC curves are not appropriate.
Precision-vs-recall curves visualise the tradeoff between making false positives and false negatives
Accuracy is the most commonly used metric in 'real life' but can be a highly misleading metric for imbalanced data sets.
IoU is an object detection specific metric, being the average intersect over union of prediction and ground truth bounding boxes for a given confidence threshold
[email protected] is another object detection specific metric, being the mean value of the average precision for each class. @0.5 sets a threshold for how much of the predicted bounding box overlaps the ground truth bounding box, i.e. "minimum 50% overlap"
For more comprehensive definitions checkout Object-Detection-Metrics
Metrics to Evaluate your Semantic Segmentation Model

Best practice

This section includes tips and ideas I have picked up from other practitioners including ai-fast-track, FraPochetti & the IceVision community

Almost all imagery data on the internet is in RGB format, and common techniques designed for working with this 3 band imagery may fail or need significant adaptation to work with multiband data (e.g. 13-band Sentinel 2)
In general, classification and object detection models are created using transfer learning, where the majority of the weights are not updated in training but have been pre computed using standard vision datasets such as ImageNet
Since satellite images are typically very large, it is common to tile them before processing. Alternatively checkout Fully Convolutional Image Classification on Arbitrary Sized Image -> TLDR replace the fully-connected layer with a convolution-layer
Where you have small sample sizes, e.g. for a small object class which may be under represented in your training dataset, use image augmentation
In general, larger models will outperform smaller models, particularly on challenging tasks such as detecting small objetcs
If model performance in unsatisfactory, try to increase your dataset size before switching to another model architecture
In training, whenever possible increase the batch size, as small batch sizes produce poor normalization statistics
The vast majority of the literature uses supervised learning with the requirement for large volumes of annotated data, which is a bottleneck to development and deployment. We are just starting to see self-supervised approaches applied to remote sensing data
4-ways-to-improve-class-imbalance discusses the pros and cons of several rebalancing techniques, applied to an aerial dataset. Reason to read: models can reach an accuracy ceiling where majority classes are easily predicted but minority classes poorly predicted. Overall model accuracy may not improve until steps are taken to account for class imbalance.
For general guidance on dataset size see this issue
Read A Recipe for Training Neural Networks by Andrej Karpathy
Seven steps towards a satellite imagery dataset
Implementing Transfer Learning from RGB to Multi-channel Imagery -> takes a resnet50 model pre-trained on an input of 224x224 pixels with 3 channels (RGB) and updates it for a new input of 480x400 pixels and 15 channels (12 new + RGB) using keras
How to implement augmentations for Multispectral Satellite Images Segmentation using Fastai-v2 and Albumentations
Principal Component Analysis: In-depth understanding through image visualization applied to Landsat TM images, with repo
Leveraging Geolocation Data for Machine Learning: Essential Techniques -> A Gentle Guide to Feature Engineering and Visualization with Geospatial data, in Plain English
3 Tips to Optimize Your Machine Learning Project for Data Labeling
Image Classification Labeling: Single Class versus Multiple Class Projects
Labeling Satellite Imagery for Machine Learning
Image Augmentations for Aerial Datasets
Leveraging satellite imagery for machine learning computer vision applications
Best Practices for Preparing and Augmenting Image Data for CNNs
Using TensorBoard While Training Land Cover Models with Satellite Imagery
An Overview of Model Compression Techniques for Deep Learning in Space
Visualise Embeddings with Tensorboard -> also checkout the Tensorflow Embedding Projector
Introduction to Satellite Image Augmentation with Generative Adversarial Networks - video
Use Gradio and W&B together to monitor training and view predictions
Every important satellite imagery analysis project is challenging, but here are ten straightforward steps to get started
Challenges with SpaceNet 4 off-nadir satellite imagery: Look angle and target azimuth angle -> building prediction in images taken at nearly identical look angles — for example, 29 and 30 degrees — produced radically different performance scores.
How not to test your deep learning algorithm? - bad ideas to avoid
AI products and remote sensing: yes, it is hard and yes, you need a good infra -> advice on building an in-house data annotation service
Boosting object detection performance through ensembling on satellite imagery
How to use deep learning on satellite imagery — Playing with the loss function
On the importance of proper data handling
Generate SSD anchor box aspect ratios using k-means clustering -> tutorial showing how to discover a set of aspect ratios that are custom-fit for your dataset, applied to tensorflow object detection
Transfer Learning on Greyscale Images: How to Fine-Tune Pretrained Models on Black-and-White Datasets
How to create a DataBlock for Multispectral Satellite Image Segmentation with the Fastai
A comprehensive list of ML and AI acronyms and abbreviations
Finding an optimal number of “K” classes for unsupervised classification on Remote Sensing Data -> i.e 'elbow' method
Supplement your training data with 'negative' examples which are created through random selection of regions of the image that contain no objects of interest, read Setting a Foundation for Machine Learning
The law of diminishing returns often applies to dataset size, read Quantifying the Effects of Resolution on Image Classification Accuracy
Implementing Transfer Learning from RGB to Multi-channel Imagery -> Medium article which discusses how to convert a model trained on 3 channels to more channels, adding an additional 12 channels to the original 3 channel RGB image, uses Keras
satellite-segmentation-pytorch -> explores a wide variety of image augmentations to increase training dataset size
Quantifying uncertainty in deep learning systems
How to create a custom Dataset / Loader in PyTorch, from Scratch, for multi-band Satellite Images Dataset from Kaggle -> uses the 38-Cloud dataset
How To Normalize Satellite Images for Deep Learning
ML Tooling 2022 by developmentseed

Free online compute

A GPU is required for training deep learning models (but not necessarily for inferencing), and this section lists a couple of free Jupyter environments with GPU available. There is a good overview of online Jupyter development environments on the fastai site. I personally use Colab Pro with data hosted on Google Drive, or Sagemaker if I have very long running training jobs.

Google Colab

Collaboratory notebooks with GPU as a backend for free for 12 hours at a time. Note that the GPU may be shared with other users, so if you aren't getting good performance try reloading.
Also a pro tier for $10 a month -> https://colab.research.google.com/signup
Tensorflow, pytorch & fastai available but you may need to update them
Colab Alive is a chrome extension that keeps Colab notebooks alive.
colab-ssh -> lets you ssh to a colab instance like it’s an EC2 machine and install packages that require full linux functionality

Kaggle

Free to use
GPU Kernels - may run for 1 hour
Tensorflow, pytorch & fastai available but you may need to update them
Advantage that many datasets are already available

Deployment

This section discusses how to get a trained machine learning & specifically deep learning model into production. For an overview on serving deep learning models checkout Practical-Deep-Learning-on-the-Cloud. There are many options if you are happy to dedicate a server, although you may want a GPU for batch processing. For serverless use AWS lambda.

Rest API on dedicated server

A common approach to serving up deep learning model inference code is to wrap it in a rest API. The API can be implemented in python (flask or FastAPI), and hosted on a dedicated server e.g. EC2 instance. Note that making this a scalable solution will require significant experience.

Basic API: https://blog.keras.io/building-a-simple-keras-deep-learning-rest-api.html with code here
Advanced API with request queuing: https://www.pyimagesearch.com/2018/01/29/scalable-keras-deep-learning-rest-api/
How to make a geospatial Rest Api web service with Python, Flask and Shapely - Tutorial
BMW-YOLOv4-Training-Automation -> project that demos training ML model via rest API
Basic REST API for a keras model using FastAPI
NI4OS-RSSC -> Web Service for Remote Sensing Scene Classification (RS2C) using TensorFlow Serving and Flask
Sat2Graph Inference Server -> API in Go for road segmentation model inferencing
API algorithm to apply object detection model to terabyte size satellite images with 800% better performance and 8 times less resources usage
clearcut_detection -> django backend
airbus-ship-detection -> CNN with REST API

Model serving with GRPC

GPRC is a framework for implementing Remote Procedure Call (RPC) via HTTP/2. Developed and maintained mainly by Google, it is widely used in the industry. It allows two machines to communicate, similar to HTTP but with better syntax and performance.

deploy-models-with-grpc-pytorch-asyncio

Framework specific model serving

If you are happy to live with some lock-in, these are good options:

Tensorflow serving is limited to Tensorflow models
TensorRT_Inference -> An oriented object detection framework based on TensorRT
Pytorch serve is easy to use, limited to Pytorch models, and can be deployed via AWS Sagemaker, See pl-lightning-torchserve-neptune-template
sagemaker-inference-toolkit -> Serve machine learning models within a Docker container using AWS SageMaker

Framework agnostic model serving

The Triton Inference Server provides an optimized cloud and edge inferencing solution. Read CAPE Analytics Uses Computer Vision to Put Geospatial Data and Risk Information in Hands of Property Insurance Companies
RedisAI is a Redis module for executing Deep Learning/Machine Learning models and managing their data

Using lambda functions - i.e. serverless

Using lambda functions allows inference without having to configure or manage the underlying infrastructure

On AWS either use regular lambdas from AWS or SageMaker Serverless Inference
Object detection inference with AWS Lambda and IceVision (PyTorch) with repo
Deploying PyTorch on AWS Lambda
Example deployment behind an API Gateway Proxy

Inferencing on large images

Models are typically trained and inferenced on relatively small images, e.g. 640x640 pixels for YOLOv5m. To inference on a large image it is necessary to use a sliding window over the image, inference on each window, then combine the results. However lower confidence predicitons will be made at the edges of the window where objects may be partially cropped. To overcome this a framework called sahi has been developed. An example of how to use sahi with yolo is here. For an example of using threading to process a large image see Fast-Large-Image-Object-Detection-yolov7

Models in the browser

The model is run in the browser itself on live images, ensuring processing is always with the latest model available and removing the requirement for dedicated server side inferencing

Classifying satellite imagery - Made with TensorFlow.js YoutTube video

Model optimisation for deployment

The general approaches are outlined in this article from NVIDIA which discusses fine tuning a model pre-trained on synthetic data (Rareplanes) with 10% real data, then pruning the model to reduce its size, before quantizing the model to improve inference speed. There are also toolkits for optimisation, in particular ONNX which is framework agnostic.

MLOps

MLOps is a set of practices that aims to deploy and maintain machine learning models in production reliably and efficiently.

How to Build MLOps Pipelines with GitHub Actions

Model monitoring

Once your model is deployed you will want to monitor for data errors, broken pipelines, and model performance degradation/drift ref

Blog post by Neptune: Doing ML Model Performance Monitoring The Right Way
whylogs -> Profile and monitor your ML data pipeline end-to-end

Model tracking, versioning, specification & compilation

dvc -> a git extension to keep track of changes in data, source code, and ML models together
Weights and Biases -> keep track of your ML projects. Log hyperparameters and output metrics from your runs, then visualize and compare results and quickly share findings with your colleagues
geo-ml-model-catalog -> provides a common metadata definition for ML models that operate on geospatial data
hummingbird -> a library for compiling trained traditional ML models into tensor computations, e.g. scikit learn model to pytorch for fast inference on a GPU
deepchecks -> Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort
pachyderm -> Data Versioning and Pipelines for MLOps. Read Pachyderm + Label Studio which discusses versioning and lineage of data annotations

AWS

Host your data on S3 and metadata in a db such as postgres
For batch processing use Batch. GPU instances are available for batch deep learning inferencing.
If processing can be performed in 15 minutes or less, serverless Lambda functions are an attractive option owing to their ability to scale. Note that lambda may not be a particularly quick solution for deep learning applications, since you do not have the option to batch inference on a GPU. Creating a docker container with all the required dependencies can be a challenge. To get started read Using container images to run PyTorch models in AWS Lambda and for an image classification example checkout this repo. Also read Processing satellite imagery with serverless architecture which discusses queuing & lambda. Sagemaker also supports server less inference, see SageMaker Serverless Inference. For managing a serverless infrastructure composed of multiple lambda functions use AWS SAM and read How to continuously deploy a FastAPI to AWS Lambda with AWS SAM
Sagemaker is an ecosystem of ML tools accessed via a hosted Jupyter environment & API. Read Build GAN with PyTorch and Amazon SageMaker, Run computer vision inference on large videos with Amazon SageMaker asynchronous endpoints, Use Amazon SageMaker to Build, Train, and Deploy ML Models Using Geospatial Data
SageMaker Studio Lab competes with Google colab being free to use with no credit card or AWS account required
Deep learning AMIs are EC2 instances with deep learning frameworks preinstalled. They do require more setup from the user than Sagemaker but in return allow access to the underlying hardware, which makes debugging issues more straightforward. There is a good guide to setting up your AMI instance on the Keras blog. Read Deploying the SpaceNet 6 Baseline on AWS
Specifically created for deep learning inferencing is AWS Inferentia
Rekognition custom labels is a 'no code' annotation, training and inferencing service. Read Training models using Satellite (Sentinel-2) imagery on Amazon Rekognition Custom Labels. For a comparison with Azure and Google alternatives read this article
Use Glue for data preprocessing - or use Sagemaker
To orchestrate basic data pipelines use Step functions. Use the AWS Step Functions Workflow Studio to get started. Read Orchestrating and Monitoring Complex, Long-running Workflows Using AWS Step Functions and checkout the aws-step-functions-data-science-sdk-python
If step functions are too limited or you want to write pipelines in python and use Directed Acyclic Graphs (DAGs) for workflow management, checkout hosted AWS managed Airflow. Read Orchestrate XGBoost ML Pipelines with Amazon Managed Workflows for Apache Airflow and checkout amazon-mwaa-examples
When developing you will definitely want to use boto3 and probably aws-data-wrangler
For managing infrastructure use Terraform. Alternatively if you wish to use TypeScript, JavaScript, Python, Java, or C# checkout AWS CDK, although I found relatively few examples to get going using python
AWS Ground Station now supports data delivery to Amazon S3
Redshift is a fast, scalable data warehouse that can extend queries to S3. Redshift is based on PostgreSQL but has some differences. Redshift supports geospatial data.
AWS App Runner enables quick deployment of containers as apps
AWS Athena allows running SQL queries against CSV files stored on S3. Serverless so pay only for the queries you run
If you are using pytorch checkout the S3 plugin for pytorch which provides streaming data access
Amazon AppStream 2.0 is a service to securely share desktop apps over the internet
aws-gdal-robot -> A proof of concept implementation of running GDAL based jobs using AWS S3/Lambda/Batch
Building a robust data pipeline for processing Satellite Imagery at scale using AWS services & Airflow
Using artificial intelligence to detect product defects with AWS Step Functions -> demonstrates image classification workflow
sagemaker-defect-detection -> demonstrates object detection training and deployment
How do you process space data and imagery in low earth orbit? -> Snowcone is a standalone computer that can run AWS services at the edge, and has been demonstraed on the ISS (International space station)
Amazon OpenSearch -> can be used to create a visual search service
Automated Earth observation using AWS Ground Station Amazon S3 data delivery
Satellogic makes Earth observation data more accessible and affordable with AWS
Analyze terabyte-scale geospatial datasets with Dask and Jupyter on AWS
How SkyWatch built its satellite imagery solution using AWS Lambda and Amazon EFS
Identify mangrove forests using satellite image features using Amazon SageMaker Studio and Amazon SageMaker Autopilot
Detecting invasive Australian tree ferns in Hawaiian forests
Improve ML developer productivity with Weights & Biases: A computer vision example on Amazon SageMaker
terraform-aws-tile-service -> Terraform module to create a vector tile service using Amazon API Gateway and S3
sagemaker-ssh-helper -> A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH
Hosting YOLOv8 PyTorch models on Amazon SageMaker Endpoints
Automatically convert satellite imagery to Cloud-Optimized GeoTIFFs for hosting in Amazon S3
How to deploy your ML model using DagsHub+MLflow+AWS Lambda

Google Cloud

For storage use Cloud Storage (AWS S3 equivalent)
For data warehousing use BigQuery (AWS Redshift equivalent). Visualize massive spatial datasets directly in BigQuery using CARTO
For model training use Vertex (AWS Sagemaker equivalent)
For containerised apps use Cloud Run (AWS App Runner equivalent but can scale to zero)

Microsoft Azure

Azure Orbital -> Satellite ground station and scheduling services for fast downlinking of data
ShipDetection -> use the Azure Custom Vision service to train an object detection model that can detect and locate ships in a satellite image
SwimmingPoolDetection -> Swimming pool detection with Azure Custom Vision
Geospatial analysis with Azure Synapse Analytics and repo
AIforEarthDataSets -> Notebooks and documentation for AI-for-Earth managed datasets on Azure

State of the art engineering

Compute and data storage are on the cloud. Read how Planet and Airbus use the cloud
Traditional data formats aren't designed for processing on the cloud, so new standards are evolving such as COG and STAC
Google Earth Engine and Microsoft Planetary Computer are democratising access to 'planetary scale' compute
Google Colab and others are providing free acces to GPU compute to enable training deep learning models
No-code platforms and auto-ml are making ML techniques more accessible than ever
Serverless compute (e.g. AWS Lambda) mean that managing servers may become a thing of the past
Custom hardware is being developed for rapid training and inferencing with deep learning models, both in the datacenter and at the edge
Supervised ML methods typically require large annotated datasets, but approaches such as self-supervised and active learning require less or even no annotation
Computer vision traditionally delivered high performance image processing on a CPU by using compiled languages like C++, as used by OpenCV for example. The advent of GPUs are changing the paradigm, with alternatives optimised for GPU being created, such as Kornia
Whilst the combo of python and keras/tensorflow/pytorch are currently preeminent, new python libraries such as Jax and alternative languages such as Julia are showing serious promise

Web apps

Flask is often used to serve up a simple web app that can expose a ML model

FastMap -> Flask deployment of deep learning model performing segmentation task on aerial imagery building footprints
Querying Postgres with Python Fastapi Backend and Leaflet-Geoman Frontend
cropcircles -> a purely-client-side web app originally designed for accurately cropping circular center pivot irrigation fields from aerial imagery
django-large-image -> Django endpoints for working with large images for tile serving
Earth Classification API -> Flask based app that serves a CNN model and interfaces with a React and Leaflet front-end
Demo flask map app -> Building Python-based, database-driven web applications (with maps!) using Flask, SQLite, SQLAlchemy and MapBox
Building a Web App for Instance Segmentation using Docker, Flask and Detectron2
greppo -> Build & deploy geospatial applications quick and easy. Read Build a geospatial dashboard in Python using Greppo
localtileserver -> image tile server for viewing geospatial rasters with ipyleaflet, folium, or CesiumJS locally in Jupyter or remotely in Flask applications. Checkout bokeh-tiler
flask-geocoding-webapp -> A quick example Flask application for geocoding and rendering a webmap using Folium/Leaflet
flask-vector-tiles -> A simple Flask/leaflet based webapp for rendering vector tiles from PostGIS
Crash Severity Prediction -> using CAS Open Data and Maxar Satellite Imagery, React app
wildfire-detection-from-satellite-images-ml -> simple flask app for classification
SlumMappingViaRemoteSensingImagery -> learning slum segmentation and localization using satellite imagery and visualising on a flask app
cloud-removal-deploy -> flask app for cloud removal
clearcut_detection -> research & web-service for clearcut detection
staticmaps-function -> A FastAPI that can generate maps using the py-staticmaps package. Designed for deployment to Azure Functions

Neural nets in space

Processing on board a satellite allows less data to be downlinked. e.g. super-resolution image might take 8 images to generate, then a single image is downlinked. Other applications include cloud detection and collision avoidance.

Lockheed Martin and USC to Launch Jetson-Based Nanosatellite for Scientific Research Into Orbit - Aug 2020 - One app that will run on the GPU-accelerated satellite is SuperRes, an AI-based application developed by Lockheed Martin, that can automatically enhance the quality of an image.
Intel to place movidius in orbit to filter images of clouds at source - Oct 2020 - Getting rid of these images before they’re even transmitted means that the satellite can actually realize a bandwidth savings of up to 30%
WorldFloods will pioneer the detection of global flood events from space, launched on June 30, 2021. This paper describes the model which is run on Intel Movidius Myriad2 hardware capable of processing a 12 MP image in less than a minute
How AI and machine learning can support spacecraft docking with repo uwing Yolov3
exo-space -> startup with plans to release an AI hardware addon for satellites
Sony’s Spresense microcontroller board is going to space -> vision applications include cloud detection, more details here
Palantir Edge AI in Space -> using NVIDIA Jetson for ship/aircraft/cloud detection & land cover segmentation
Spiral Blue -> startup building edge computers to run AI analytics on-board satellites
RaVAEn -> a lightweight, unsupervised approach for change detection in satellite data based on Variational Auto-Encoders (VAEs) with the specific purpose of on-board deployment. It flags changed areas to prioritise for downlink, shortening the response time
AWS successfully runs AWS compute and machine learning services on an orbiting satellite in a first-of-its kind space experiment

chihiron / model-training-and-deployment Goto Github PK

model-training-and-deployment's Introduction