Giter VIP home page Giter VIP logo

dsg's People

Contributors

inf800 avatar j-rausch avatar rashitig avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

dsg's Issues

figure graphic

Is there any setting that can be added during the execution of demo.py to include the div for the graphic figures contained within the image in the hOCR, or is it automatic? Because I'm testing it on various images, and currently, it's not being detected. However, I noticed that it's present in sysdemo.

Installation process

Half the repos in the installation process are not found, pytorch 1.11 is no longer served by pip, neither is the numpy version.

Is there a simpler installation process to your knowledge? I would really like to try your code but the wall of installs is preventing me from going forward.

Where is the arXiv dataset?

Thank you for publishing this great work.
I successfully run a demo using the Eperiodica dataset.
But, I couldn’t find the arXiv dataset in your Google Drive link.
Is it possible to publish the arXiv dataset?
Thank you in advance.

Finetuning/Training our Own Custom Dataset.

Hi, Thank you for the great work.

I see that the repo only has Prediction/demo codes available. Can you please add Finetuning code and Custom Dataset format that the model accepts. Can you also please add all details required to custom train on the datasets.

Thank you

OCR

Hi, how can I get the text of a image through OCR? Is there already something implemented, or do I need to do it outside of this project?

TXT files in sysdemo

Hi, and thank you for this great project.

I am trying to test on my own image, and I don't know how to generate the accompanied .txt file for the image.
I see there is one in the sysdemo folder.
How do I generate this file?
Thank you.

Can you provide an explanatory file for relationship annotation?

Thanks for your great project! I would like to utilize the datasets you've provided for my research. However, I'm encountering some difficulty in interpreting the rules for relationship annotation within the provide files. Could you please clarify the annotation guidelines for me?
image
image

Why can a figure have so many parents? Also why can a row be the parent of so many entities?

Some questions about TXT files

Hi, thank you for this great project!
I'm trying to test my own images and I got OCR file using Tesseract, but the format is different from the OCR file in sysdemo.
What do the two numbers at the beginning of the OCR file in sysdemo mean? And are the four numbers following each word the coordinates?
I look forward to your reply.

Evaluation on eperiodica and arxivdoc

Hi, Johannes, thanks for your great project! I want to follow your datasets and work to do research. However, I haven't found the evaluation scripts to reproduce the results in DSG paper.
I tried to modify the training scripts like:
CUDA_VISIBLE_DEVICES=0 python scripts/train_doc_SG_head.py \ --config-file ./configs/sgg_end2end_EP.yaml \ --num-gpus 1 \ --eval-only \ --resume \ MODEL.ROI_SCENEGRAPH_HEAD.PREDICT_USE_VISION True \ OUTPUT_DIR ./output/eval \ MODEL.WEIGHTS ./checkpoints/DSG_E2E_eperiodica/dsg_e2e_eperiodica_checkpoint.pth
However, it reported a bug:
TypeError: _evaluate_predictions_on_coco() got an unexpected keyword argument 'use_fast_impl'
I do not know what is the correct command.

Thanks! Look forward to your reply!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.