Giter VIP home page Giter VIP logo

Comments (9)

zhangkui669 avatar zhangkui669 commented on July 29, 2024 1

@mrharicot , I found another unsupervised method in herehttps://github.com/tinghuiz/SfMLearner. and he also use dispnet network, and i solved the stereo model training problem according to this method.

from monodepth.

mrharicot avatar mrharicot commented on July 29, 2024

Hi,

I did not figure out why the stereo model became unstable.
It never happened in our previous Torch implementation.

The only real change between both versions comes from the implementation of the bilinear sampler and the deconvolutions which have been replaced with resize-convolutions.

You could also try adding some regularization to the network and see if it helps.

from monodepth.

zhangkui669 avatar zhangkui669 commented on July 29, 2024

Thanks for your reply. I will try to solve the problems of stereo model according to your advices.

Another question:
Mono version thrained with stereo rectified image pairs, the main loss is the image reconstruction loss, but the disparity exist only when the stereo image pairs existed. At same depth, the disparity will change if the baseline or focal length changed.
The model is trained with image pairs have same baseline, focus length. If I use another camera which have different baseline or focus length to achive stereo image pairs. I doubt that this mono model won't work.

from monodepth.

mrharicot avatar mrharicot commented on July 29, 2024

Hi,

I am not quite sure of what you mean by "the disparity exist only when the stereo image pairs existed".

If you test on images which have a different focal length the result will probably be less accurate.
However you can fine-tune a pretrained on data with a different baseline, which is what we did in the paper. You could also possibly mix both datasets, however it might be confusing for the network as the same depth will result in a different disparity. I suspect that the network would overfit and learn to identify which dataset the image is from.

from monodepth.

zhangkui669 avatar zhangkui669 commented on July 29, 2024

Hi,

By "the disparity exist only when the stereo image pairs existed" , I means that disparity exist only when scene is imaged more than one viewpoint, and may be different if the position changed betweent different viewpoint.
So , I think the main Limitations of your methods is cannot used to different cameras.

Another question, I did not recurrence results of stereo model in your paper, can you tell me the params setting when you are training the stereo model?

thanks a lot!

from monodepth.

mrharicot avatar mrharicot commented on July 29, 2024

Indeed the current model only supports one camera at a time, nothing is stopping it from learning from different cameras/baseline at the same time but I predict the results won't be great.

The stereo model was trained with the exact same parameters as the mono version.
We found out that the model became unstable after ~12 epochs, which somehow did not happen in our previous Torch implementation. We thus stop training after 12 epochs.

from monodepth.

JackHenry1992 avatar JackHenry1992 commented on July 29, 2024

Hi, @mrharicot , I also want to evaluate this method for indoor scenes, can you provide some pretrained model and datasets for indoor scenes .
@Aryayay , can you share your results in indoor_scenes predicted by monodepth?
It will be very grateful if you can upload your indoor_dataset, cause I can not find suitable stereo indoor dataset.

from monodepth.

mrharicot avatar mrharicot commented on July 29, 2024

I do not have models nor datasets to share for indoor stereo unfortunately.

from monodepth.

zhangkui669 avatar zhangkui669 commented on July 29, 2024

@JackHenry1992 , my indoor dataset is quiet simple, just use a stereo camera and capture some video.

from monodepth.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.