Giter VIP home page Giter VIP logo

computational-creativity's Introduction

Computational-Creativity

QMUL Coursework

CNN_StyleTransfer.ipynb first needs to be ran in order to get the stylized image which then can be fed into the image caption model. All the instructions to run the code has been given in CNN_StyleTransfer.ipynb notebook.

I've used me.jpg as a original content image and fire.jpg as a style image.

The CNN_StyleTransfer.ipynb needs to be ran on Google Colab in order to use Colab form tools. Flask has also been used to create a web UI to upload the content and style image. Warning: The CNN_StyleTransfer.ipynb notebook will ask for the access to your Google drive and then will create the directories in which to upload the content and style images.

I used Kaggle Kernel to build the image captioning model. Although this makes the workflow not seamless as it involves switching from google colab to kaggle kernel, kaggle offers a range of data repositories which can be very large for local storage. For example, Flickr8k is more than 1GB while glove 6b can be more than 800MB which I was able to get on Kaggle without any internet bandwidth. Therefore I'd recommend to use Kaggle Kernel to run the image captioning notebook. In order to use Kaggle Kernal, you need to setup Kaggle account at https://www.kaggle.com/ and here are the instructions to setup kaggle kernal https://www.kaggle.com/getting-started/44939

If you decide to run everything locally;

Flickr8k can be downloaded from the below link; https://github.com/jbrownlee/Datasets/releases/tag/Flickr8k

Glove 6b embeddings used for pre-trained embedding in the image captioning can be downloaded from the below link; https://nlp.stanford.edu/projects/glove/

GPU is required for the accelerated experimentations which is already provided in Google Colab and Kaggle Kernel.

All the libraries imported on to the CNN_StyleTransfer.ipynb notebook can be ran on Google Colab and image captioning model's notebook on Kaggle Kernel. Flask library has been installed to run on Google Colab. Installation command is stated on to the CNN_StyleTransfer.ipynb notebook.

If you decide to run the notebooks locally, I suggest to use pip package manager.

Reference: The notebook of CNN style network has been adapted from the Computation Creativity Week 4 Labs at Queen Mary University and the notebook of image captioning has been adapted from Kaggle Kernel https://www.kaggle.com/rishabhchaurasia7/image-captioning-on-flickr8k-dataset

computational-creativity's People

Contributors

rahul-ahuja avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.