Giter VIP home page Giter VIP logo

data_sci_tutorials's Introduction

Data Science Tutorials

license

A series of Jupyter Notebook tutorials, relating to libraries and methods relevant to Vizzuality's Data Science Group.

Start docker via

./jupyter.sh develop

N.b. if the above does not execute you may need to chown +x jupyter.sh.

Then open a browser to 0.0.0.0:8888 to connect to the Jupyter server.

Notebooks and data for this repo are stored in ./Work.

data_sci_tutorials's People

Contributors

01painadam avatar aagm avatar benlaken avatar enriquecornejo avatar guizar avatar ikerey avatar nrigheriu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data_sci_tutorials's Issues

Tutorial covering Global Forest Watch data creation steps

GFW has some bespoke methods of encoding information into RGB+A channels, and decoding these data in a front-end application.
We should produce a tutorial explaining the steps taken to go from the original Hansen dataset to the final visualisation on GFW, so that 1) this knowledge can be spread effectively in the group, and 2) so that we can re-use and improve on these techniques in the future.

Create tutorial demonstrating use of netcdf

Create a notebook to demonstrate how to use Python to:

  • Simple data access (both local and remote)
  • extract metdatada
  • obtain array slices
  • Plotting of data
  • use metadata info to identify closest points in space to desired locations (use of xray library?)
  • Show how to use a widget to navigate through time dimension of a dataset?
  • show how to save slices of data as a geotiff?

Tutorial demonstrating method of minifying raster data

We often need to convert raster data from scientific datasets into lightweight versions meant for fast loading tiles. We should detail some standard methods of extracting and minifying (i.e. making the files lightweight).
For example, we can document how we would perform an ad hoc/per project assessment of what information is essential for a dataset to maintain, and what detail is not necessary, and a corresponding description of how to subsequently convert values into different formats (e.g. like mapping continuous temperature measurements into 256 integers), and saving the data as unsigned 16-bit integers, and finally compressing these data.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.