Giter VIP home page Giter VIP logo

Comments (16)

DesaleF avatar DesaleF commented on May 14, 2024 3

Yes I have managed to train and evaluate the model on my own custom dataset, I don't have test set in my dataset, I have only train and val set. so the test.yaml file points to the val set.

img: val.img.tsv
hw: val.hw.tsv
label: val.label.tsv
feature: val.feature.tsv
caption: val_caption.json

this is my test.yaml and val.yaml fil content.
But first you need to create the .tsv and .json files. If you need help in this regard I can point you the issues which helped me to prepare those files

from oscar.

xiyinmsu avatar xiyinmsu commented on May 14, 2024 1

We have saved all running scripts to: https://github.com/microsoft/Oscar/blob/master/MODEL_ZOO.md

Is this what you are looking for?

from oscar.

xiyinmsu avatar xiyinmsu commented on May 14, 2024 1

the train_yaml is from our dataset, you can download it: https://github.com/microsoft/Oscar/blob/master/DOWNLOAD.md

from oscar.

xiyinmsu avatar xiyinmsu commented on May 14, 2024 1

To get the image captioning dataset, run this:
wget https://biglmdiag.blob.core.windows.net/oscar/datasets/coco_caption.zip

Then train.yaml is a yaml file specifies the data like features and labels used in training. It comes with the dataset.

from oscar.

xjli avatar xjli commented on May 14, 2024 1

MIN_BOXES=0, MAX_BOXES=100
conf_thresh = 0.2 or 0.4, I forget the exact value, great probability it is 0.2

from oscar.

runzeer avatar runzeer commented on May 14, 2024

The run_captioning.py file has the train_yaml config but I have not found this.

from oscar.

runzeer avatar runzeer commented on May 14, 2024

The file for generating the downloaded files?

from oscar.

runzeer avatar runzeer commented on May 14, 2024

Recently, I tried to generate the COCO caption feature files according to your format. The model you used is the bottom-up-attention,right? https://github.com/peteanderson80/bottom-up-attention
If so, there exists some hyperparameters that I can not understand. I found most of the images in you feature tsv contain 20~30 bounding boxes. So if convenient,could you share the MIN_BOXES , MAX_BOXES as well as the conf_thresh hyperparameter?

from oscar.

runzeer avatar runzeer commented on May 14, 2024

The bottom up model is trained in Visual Genome dataset,right?

from oscar.

xjli avatar xjli commented on May 14, 2024

Yes

from oscar.

DesaleF avatar DesaleF commented on May 14, 2024

I want to test the model with my own images and now I generated the features and labels. however when I run the script oscar/run_captioning.py It needs test.yaml. Do I have to download the whole training data to test the model on my own images?
This is what I run on my terminal:
python oscar/run_captioning.py
--do_test --do_eval --test_yaml test.yaml
--per_gpu_eval_batch_size 64 --num_beams 5
--max_gen_length 20 --eval_model_dir oscar/checkpoint-29-66420

from oscar.

rachs avatar rachs commented on May 14, 2024

I want to test the model with my own images and now I generated the features and labels. however when I run the script oscar/run_captioning.py It needs test.yaml. Do I have to download the whole training data to test the model on my own images?
This is what I run on my terminal:
python oscar/run_captioning.py
--do_test --do_eval --test_yaml test.yaml
--per_gpu_eval_batch_size 64 --num_beams 5
--max_gen_length 20 --eval_model_dir oscar/checkpoint-29-66420

@DesaleF Were you able to figure out the test.yaml file to test the model on your own images? I'm running into the same issue with not knowing how create test.yaml

from oscar.

vakadanaveen avatar vakadanaveen commented on May 14, 2024

Can some one tell me how to get only a part of the dataset for training ? 22GB to huge. Also, how to do inference on a downloaded pretrained model ? how to get the test.yaml file ?

from oscar.

arushirai1 avatar arushirai1 commented on May 14, 2024

DesaleF

Hey it would be really nice if you could share those links or an example of what you created. I need to do the same :/

from oscar.

arushirai1 avatar arushirai1 commented on May 14, 2024

Can some one tell me how to get only a part of the dataset for training ? 22GB to huge. Also, how to do inference on a downloaded pretrained model ? how to get the test.yaml file ?

Also just to confirm, do we need the whole download to run inference? I am not sure if the tokenizer is already in the repo or if its in that massive download.

from oscar.

via815via avatar via815via commented on May 14, 2024

Yes I have managed to train and evaluate the model on my own custom dataset, I don't have test set in my dataset, I have only train and val set. so the test.yaml file points to the val set.

img: val.img.tsv
hw: val.hw.tsv
label: val.label.tsv
feature: val.feature.tsv
caption: val_caption.json

this is my test.yaml and val.yaml fil content. But first you need to create the .tsv and .json files. If you need help in this regard I can point you the issues which helped me to prepare those files

How to get label.tsv? Thanks!

from oscar.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.