mahalrs / newsgen Goto Github PK

View Code? Open in Web Editor NEW

0.0 2.0 0.0 365 KB

Multi-Modal Image Generation for News Stories

License: Apache License 2.0

TypeScript 8.01% Jupyter Notebook 24.97% Python 67.03%

clip dalle-mini multi-modal text-image transformers vqgan vqgan-clip

newsgen's People

Contributors

Watchers

newsgen's Issues

Update requirements.txt

requirements.txt is empty. Add required dependencies to it.

Add additional news sources to crawl-input.json

Right now TechCrunch is the only target news source in the crawl-input.json. We should add additional news sources to the list for a diverse news data.

Update encode_data.py to save train/val/test splits in separate files

Currently data loading is very slow. Speed can be improved by splitting the data files into train/val/test. Update encode_data.py. Also, remove raw text (headlines, captions, image paths, etc. to reduce size). Save using pytorch.save to compress it.

Fix VQGAN to use PyTorch Lightning

Original VQGAN implementation used PyTorch Lightning module but we converted to PyTorch NN module. It is much easier to do distributed training with PyTorch Lightning, so let's convert it to Lightning module.

crawler: normalize urls

Crawler needs to normalize urls. For example, https://www.example.com and https://www.example.com/ are the same and crawler shouldn't treat them as separate.

Change Adam to AdamW with linear schedule with warmup

Use AdamW from transformers instead of Adam. Use linear schedule with warmup, transformers.get_linear_schedule_with_warmup.

Add training script to fine-tune VQGAN

Using PyTorch Lightning, add an easy to use training/fine-tuning script.

Add script to encode dataset (image tokens)

Using VQGAN encoder, convert all images in the given dataset to image tokens. These image tokens along with tokens from BART encoder (encoded captions/news headline) are fed to BART decoder to train it to generate image tokens given encoded captions/headlines.

We can do this as part of data transform step, however, doing it beforehand will speed up the training.

Fix create_subset.py to take subset size as percentage

create_subset.py takes a fixed subset size argument. Fix it to take subset size as percentage.

Fix VQGAN notebook training loop

Since current VQGAN model is using NN module and not Lightning module, the training loop should handle calls such as optimizer.step, loss.backward(), optimizer.zero_grad(), etc.

Update Readme

Update Readme with instruction to run Newsgen.

Log images every 100 mini-batches

Currently we are logging images every 1000 mini batches during validation and testing. However, we will not have enough images logged in case we have a small dataset or higher number of devices. Either we should change it to 100 mini-batches or take this value as a command line argument.

mahalrs / newsgen Goto Github PK

newsgen's People

Contributors

Watchers

newsgen's Issues

Recommend Projects

Recommend Topics

Recommend Org