Giter VIP home page Giter VIP logo

suyukun666 / ufo Goto Github PK

View Code? Open in Web Editor NEW
279.0 8.0 47.0 286.92 MB

Official PyTorch implementation of the “A Unified Transformer Framework for Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection”. (TMM2023)

License: MIT License

Python 99.57% Shell 0.43%
video-salient-object-detection matching co-saliency-detection co-segmentation deep-learning pytorch inpainting video

ufo's People

Contributors

lianchengmingjue avatar suyukun666 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ufo's Issues

Dataset share?

Hi~
Could you share the Pascal voc Dataset you used in experiment?
Thank you!
Best wishes

问题已解决

您好,非常想学习这篇代码,由于自身硬件设备限制无法完成训练,可否分享一下训练好的权重文件?

关于模型性能的问题

作者您好,最近我尝试了下UFO模型,并且训练了Co-Segmentation和Video Salient Object Detection任务。但在测试过程中,有些令我比较疑惑的问题,特地来问下。

  1. 采用原始模型权重进行测试的结果与我重新训练模型的测试结果不一致。

    • Co-Segmentation任务
      原始模型权重的测试结果
      image
      重新训练模型的测试结果
      image

    • Video Salient Object Detection任务
      原始模型权重的测试结果
      image
      重新训练模型的测试结果
      image
      两者间相差是比较大的,重新训练的模型均采用仓库中的配置,未作任何改动。目前,我还不清楚是什么原因。

A doubt about VSOD task being unsupervised

Hi author, I would like to do further research based on your work, but I have some doubts that I hope you can answer for me. You mentioned in your paper that your model is unsupervised for the VSOD task, but I see that the code uses groundtruth for supervision, what is the reason for this?

add model to Hugging Face

Hi, would you be interested in adding UFO to Hugging Face Hub? The Hub offers free hosting, and it would make your work more accessible and visible to the rest of the ML community. We can setup an organization or a user account under which UFO can be added similar to github.

Example from other organizations:
Keras: https://huggingface.co/keras-io
Microsoft: https://huggingface.co/microsoft
Facebook: https://huggingface.co/facebook

Example spaces with repos:
github: https://github.com/salesforce/BLIP
Spaces: https://huggingface.co/spaces/akhaliq/BLIP

github: https://github.com/facebookresearch/omnivore
Spaces: https://huggingface.co/spaces/akhaliq/omnivore

and here are guides for adding spaces/models/datasets to your org

How to add a Space: https://huggingface.co/blog/gradio-spaces
how to add models: https://huggingface.co/docs/hub/adding-a-model
uploading a dataset: https://huggingface.co/docs/datasets/upload_dataset.html

Please let us know if you would be interested and if you have any questions, we can also help with the technical implementation.

How to deal with the noisy ground truth of COCO-SEG?

I observed that several ground truth segmentation masks provided in the COCO-SEG for certain categories are noisy i.e. the masks are several other objects are also present along with the mask of the desired object. Some examples are cup, tennis racket (humans are segmented along with the racket in the ground truth mask). How should we deal with this? Do you pre-preprocess this dataset somehow? Thanks.

Co-saliency detection test set

Hello authors, I am currently evaluating the UFO model for co-saliency detection. Could you please share with me the test set you used for evaluating model performance for co-saliency detection. Is that part of the three datasets: CoCA, Cosal2015, CoSoD3k or the entire datasets you use for testing? Thanks.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.