Giter VIP home page Giter VIP logo

fyp's Introduction

Will Fitch Final Year Project

This is a repository for my final year project at Brunel University London, implementing a neural regression of greenhouse gas data using fastai and transfer learning. If you're interested in the dissertation itself, email me at [email protected].

TODO

  • Move model training logic to GEE so the entire import pipeline becomes unnecessary. TFlow implementation available in GEE.
  • Train model on a planetary scale. Super normal stuff.
    • Probably worth starting on London, then the southern UK, then the rest of the UK, then based on individual countries.
  • Export model from GEE.
  • Write a basic Django REST server that uses the exported GEE model to predict coordinates.
  • Write a frontend webpage to display the functionality offered by the Django server.

fyp's People

Contributors

wrfitch avatar

Stargazers

 avatar

Watchers

 avatar  avatar

fyp's Issues

Investigate overuse of "scale" parameter

In the image import pipeline, the data is being continually scaled and rescaled - optimise this pipeline by reducing the use of scaling until absolutely necessary.

Improving Model Accuracy

Current accuracy given 224px images =~ 11% +- 2
Target <= 5%

Options:

  • Experiment with different network architecture - see fastai model zoo.
    • compare networks, see what works in background literature.
  • tune other hyperparameters
    • optimisers
    • loss function
    • batch size
    • epochs
  • Create an ensemble of networks and combine predictive power.
    • Existing kernel measurement idea re: take a load of nearby measurements and incorporate them into a tab recommender.
  • Data
    • Give augmentation another go
      • only use rotations OR flips?
    • Import far more data to get a good prediction?

Clean up import code

I mean, dear god. fix file structure and remove unnecessary rubbish. Write import script into files if streaming architecture ends up being necessary.

Clear up bg_research dir

It's too cluttered. It's a handy place to store research links, but the PDFs are getting out of hand.

Resolve image distortion

Currently the PNG exports of the GeoTIFF files are distorted. Examine whether the GeoTIFF files are distorted in GIMP and resolve however possible. This may be due to a scaling issue somewhere in the export pipeline.

Remove old 160-px image pipeline

Now that we've definitively proven the old 160px image pipeline is inferior to the new 224px, replace immediately

Incidentally, testing it with even larger images may be useful.

Fix image brightness

All the images are weirdly dark. This could be resolved in fastai, but I'd prefer to limit that kind of operation where possible.

Notebook Cleanup

There's a lot of code that doesn't need to be sat in these notebooks, cluttering the notebooks so they aren't so readable. Extract shared code and unnecessary verbosity.

Cut Release Branch

Cut a disso release branch with clean code and minimal notebook output, depending on where the notebook is needed. This may also include a model, though that should be kept out of the git tree.

Refactor Architecture

Currently it's organised in a very awkward way, based around these notebooks. Look for options to make more elegant.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.