Giter VIP home page Giter VIP logo

rbpn-vsr-p's Introduction

Full-length video upscaling

Current Progress

Method

To perform VSR (Video Super Resolution), we perform MISO-SR on the target frame using concatenated neighboring frames and corresponding dense optical flows. The recurrent module of this network repeats itself using subsequent iterations over each of 6 neighboring frames. The input frame is bicubically upscaled to provide a color-correct baseline for the network to add onto.

image

Fig 3. RBPN (Recurrent-Back-Projection Network)

Current setbacks

  • Training the RBPN was very time-consuming with time for one epoch being around 35 mins. This was due to the limited RAM on free cloud-based GPU services such as Kaggle which could only serve small batch sizes.
  • Had to train with a small dataset which hindered output quality as larger datasets would make training times infeasible.
  • Training would often stall and losses tended to oscillate regardless of tuning hyperparameters resulting in blank/totally black outputs after each epoch.
  • Problem identified was Gradient Explosion and was mitigated by introducing batch-norm layers; however the oscillating loss was still persistent.
  • Could not identify modifications needed to produce color-correct frames with few artifacts.
  • Output would capture image structure and content but would fail to perceive the correct colors leading to frames with bright hues.
  • Localized patches of intense color would appear over the frames in seemingly random locations producing unwanted artifacts.
  • The model is inefficient when used with any video. Example: a 7-second video takes 12 mins to upscale despite using the GPU...

Currently Working On

  • Designing first draft of website to learn basics of HTML and CSS
  • Researching basicVSR++

Further plans

  • Develop a seamless and interactive interface to the model via a website using Flask as the framework alongside Javascript and HTML/CSS
  • Explore basicVSR++ as an alternative to RBPN considering the infeasibility of the model's real-world use case.

Associated links

References

rbpn-vsr-p's People

Contributors

doobiusp avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.