fawnliu / tris Goto Github PK
View Code? Open in Web Editor NEW[ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"
License: MIT License
[ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"
License: MIT License
Thank you for your great work. I have a question for you. In the first stage of training, we have divided a positive sample of a text image pair and N negative samples of other text and this image, so why do we still need to select the response graph generated by these sample pairs in stage 1? Why not just select the response graph of the positive sample?
Hi,
Thank you for sharing your work!
I noticed in demo.py
, the get_transform()
method does not respect the size
argument. It is hardcoded to resize to (224,224):
Line 22 in b45f660
I believe this should be (size,size)
, which will take 320px by default. I can confirm that this change results in a heatmap that is a lot closer to the example in your readme.
I'm still not sure why my heatmap doesn't match your example 100%, but the results are impressive nonetheless. 🙂
Thanks authors for sharing the code. I have a following question:
When computing the attention map for visual features, is Av in the below line a all-one tensor? Only one language vector is used as key and the softmax is applied on the last dimension which is 1.
Line 122 in b45f660
When will your group release the paper and code?
I would like to know how long your model was trained on a single RTX 3090?
run in the order you specified but run these code :
Train IRNet and generate pseudo masks.
cd IRNet
dir=../output
CUDA_VISIBLE_DEVICES=0,1,2,3 python run_sample_refer.py --cam_out_dir $dir/refcocog_umd/cam --ir_label_out_dir $dir/refcocog_umd/ir_label --ins_seg_out_dir $dir/refcocog_umd/ins_seg --train_list $dir/refcocog_umd/refcocog_train_names.json --cam_eval_thres 0.15 --work_space output_refer/refcocog_umd --num_workers 8 --irn_batch_size 96 --cam_to_ir_label_pass True --train_irn_pass True --make_ins_seg_pass True
error: No such file or directory: '../output/refcocog_umd/refcocog_train_names.json'
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.