Giter VIP home page Giter VIP logo

datasets's Introduction

Warwick Electron Microscopy Datasets

DOI

This repository is for the preprint|paper "Warwick Electron Microscopy Datasets". It supplements datasets by providing scripts used to curate datasets and their variants, and to create both static and interactive visualizations.

There are three main datasets containing 19769 experimental STEM images, 17266 experimental TEM images and 98340 simulated TEM exit wavefunctions. Datasets are available here.

Scrips and data for variational autoencoders (VAEs) and modified t-distributed stochastic neighbor embedding (tSNE) are in the vaegan subdirectory. Pretrained VAEs are here.

Interactive Visualizations

Interactive visualizations can be created by running display_visualization_files.py. Change values of file location variables (in the script) to display their visualization:

SAVE_DATA: Full save location of a NumPy file containing a dataset. For example, from the datasets main page.
SAVE_FILE: Full save location of a NumPy file containing tSNE map points. Files for each visualization are in this repository and have filenames in the form "tsne_*.npy" for PCA and vae_tsne_*.npy for VAE, where * is a wildcard.

An optional extra parameter, USE_FRAC, controls the portion of data points that are displayed. Use a value lower than 1 if a visualization is slow/unresponsive for a large dataset.

Other Contents

There are a few folders:

create_96x96: Scripts to downsample examples to 96x96.
cropping: Scripts to crop 512x512 regions from full images.
mining_scripts: An assortment of mining scrips used to curate micrographs.
stem_full_shapes: Scripts to investigate the distribution of STEM full images shapes.
vaegan: Source code and pretrained models for VAEs, and source code and precompiled binaries for modified tSNE implementations.

In addition, there are a few noteable fles:

create_static_displays: Creates tSNE visualizations with map points and images.
create_table_images: Example TEM and STEM images are selected using their positions in tSNE visualizations.
create_visualization_files: Ouputs NumPy files containing dataset principal componets and tSNE visualizations.

Scripts to simulate wavefunctions are here. They have the form "run_simulations*.py", where * is a wildcard.

Contact

Jeffrey Ede: [email protected]

Example Visualization

An example tSNE visualization for 19769 96x96 crops from STEM images. It was created by training a VAE to encode images as 64-dimensional means and standard deviations of normal distributions. Standard deviations were then used to weight the clustering of means in 2 dimensions by tSNE. Images are shown at 500 randomly sampled points.

datasets's People

Contributors

jeffrey-ede avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.