yhw-yhw / d2hc-rmvsnet Goto Github PK
View Code? Open in Web Editor NEWThe official repository of the paper "Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking" (ECCV2020 Spotlight)
License: MIT License
The official repository of the paper "Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking" (ECCV2020 Spotlight)
License: MIT License
Excuse me.in paper,when training you set N = 3 ,D = 128, epochs=6 Batch size = 6 on 2 NVIDIA TITAN RTX graphics cards.But,in code train.sh
you set N = 4 ,D = 196, epochs=10 Batch size = 1 .When I attempt to train this network follow your train.sh ,batch =2 on 2 NVIDIA 3090(24G),11 days if training ends and GPU utilization below 46%,very slow.if batch=3,out cuda memory.why?
Hi, I'm asking about the implementation details. You mentioned in your paper the network for DTU is trained on 2 GPU with batchsize 6, but in your script train.sh, I don't see this. Also, is the resolution for DTU inference 480 x 640 according to eval_dtu.sh? Maybe it's not mentioned in the paper. Thank you for clarification!
Can you share your pretrained model on the BlendMVS dataset since I want to run your code in the outdoor scene
Thanks so much for open sourcing this amazing work!
What would be the license for this? If possible it would be great to add a LICENSE file.
Thanks!
From table 3 it seems dynamic consistency fusion increases the score by 1.65 which is a lot, so I want to test it on other methods to see if it consistently increases the score for any kind of depth prediction result. Have you tried to apply this module on other methods such as MVSNet? Should be just a drop-in replacement.
In your code however, I cannot find what is written in the paper.
There are two fusion functions, one in eval.py
Lines 268 to 283 in 7ebeb16
fusion.py
Lines 192 to 214 in 7ebeb16
Anyway, they don't do what's described in the paper, so I wonder what's your final implementation to get the good results, and how is that different from the paper.
File "fusion.py", line 376, in filter_depth
color = ref_img[:, : , :][valid_points] # hardcoded for DTU dataset
IndexError: boolean index did not match indexed array along dimension 1; dimension is 479 but corresponding boolean dimension is 480
Hello, your paper results are excellent. But I have tested a group of photos taken by myself. What's the matter when the fusion.sh stage reports that the bool dimension here is not correct? (My photos have been processed with colmap and colmap2mvsnet.py and are available correctly)
Hi,
I am doing training of D2HC-RMVSNet. I am using D=192, nviews=3 and batch=1, image_scale=0.25.
I am using the BlendedMVS dataset for training on one RTX 3080 Ti (12GB mem).
However, unfortunately, at d=160 of the regularization stage, I am running out of memory.
I am not using apex for automatic mixed precision.
My question is, in the training stage, will the recurrent structure will save memory? It seems it does not.
Thanks,
Han
Hello, I tested blended MVs before, using the model you gave me. The graphics card is 3090, but the result is very poor. How do you set the test parameters and get the results
This is a cool work! I appreciate it a lot.
When do you expect to release it so that I can have a try?
Are the parameters in eval_tanks.sh inconsistent with that on papers? Thanks
I tried increasing the max h max w, or reducing the threshold. But still only around 0.7 f2score for the reference Ignatius scene, which can be around 0.8 according to the official Tanks website.
hello , thanks for share. i wonder the dynamic consistency checking noted in the paper is not the same with the code in fusion.py. maybe i did not get it ,could you give some guidance?
Hi, since you reported memory usage in the paper and I noticed you have commented some memory usage in the code, so I am wondering if you use some package or the command "nvidia-smi" to report the memory. Thank you!
Hi, Which datasets is your pretrained model trained on, DTU or BlendedMVS?
1080Ti * 4, batch_size = 4
RuntimeError: CUDA out of memory. Tried to allocate 20.00 MiB (GPU 0; 10.92 GiB total capacity; 10.09 GiB already allocated; 5.00 MiB free; 19.09 MiB cached)
In what file is TEST_DATA_FOLDER set?
Similarly, could you please clarify what is intended by "Set MODEL_FOLDER to ckpt and model_ckpt_index to checkpoint_list."? That is, what files should be modified?
Hi,
Thanks for your excellent work. I'm wondering will you release the blended training code & dataset code? I have retrianed this model using BlendedMVS by my implement-code, but the models performance is worse than your pretrained model.
Thanks!
Hi, since I am working on a similar project and would like to use D2HC-RMVSNet as a reference. So, currently I have run it on Tanks and Temples training set and got the following results, which is very good:
f1 score | Barn | Caterpillar | Church | Courthouse | Ignatius | Meetingroom | Truck | mean
D2HC RMVSNet | 0.6555 | 0.6104 | 0.5421 | 0.594 | 0.8193 | 0.4095 | 0.7392 | 0.62428571
So I just want to make sure it is consistent with your best performance on that data (if you have these results)
(although I think I have followed your code exactly and it should be consistent)
Thank you!!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.