I get different results on the test set and other questions. about lite-mono HOT 5 CLOSED

413090531 commented on August 31, 2024

I get different results on the test set and other questions.

from lite-mono.

Comments (5)

413090531 commented on August 31, 2024 4

After I changed cuda and pytorch version,I have trained a model with similar results to the paper.Thanks again for your thoughtful answer, I'm going to close the issue.

from lite-mono.

noahzn commented on August 31, 2024 1

Hi! Thanks for the feedback.

It is difficult to say where the discrepancy comes from. Different PyTorch or CUDA versions could be one of the reasons, see this link. I have tried training on the TITAN Xp (CUDA11.0, PyTorch 1.7.1) and NVIDIA A40 (CUDA 11.8, PyTorch 1.12.1), and I can get similar or even better results than those reported in the paper. What changes did you make to kitti_dataset.py and mono_dataset.py? Since your results are already very close, I assume that your training is converging to a local minimum.
Monodepth2 also uses shuffle=True, see their code here. This may improve the robustness of the training.
I initialized the networks for training without pre-trained weights. This might improve the performance of the training. However, if you load the pre-trained weights, the initialized weights in the encoder will be overwritten by the ImageNet weights.
You can use tensorboard to inspect if the training goes well. Please use this command tensorboard --logdir ./tmp/your_saved_model_folder. If your training converges to a local minimum, you can just train it again. Or, you may increase the drop_path rate to 0.3 if the training has an overfitting issue. Since Monodepth2 uses nn.ReflectionPad2d (see this link), which is not a deterministic operation, we cannot get exactly the same results every time, even if we fix random seeds. And, I noticed that the training can converge fast and the best result might be achieved at epoch 16 or 17. So, please check all the epochs.

from lite-mono.

413090531 commented on August 31, 2024 1

Hi！Thanks for your detailed reply！

1.I found out that the problem is in the mono_dataset.py,
color_aug = transforms.ColorJitter(
self.brightness, self.contrast, self.saturation, self.hue)

color_aug = transforms.ColorJitter.get_params(
self.brightness, self.contrast, self.saturation, self.hue)
You are using the first one in your code and I need to use the second one.When I use other computers, I found that loading
data in the same way as your code can work. I found that this is because the code in the forward() part of transforms.py is
different, so my computer needs to add .get_params() to work.Will adding .get_params() affect the experimental results?

2.2 and 3 understand.

3.I will pay attention to the details you said and try training again.

I'll let you know when I train with similar results.Thanks again！

from lite-mono.

lonelysheeep commented on August 31, 2024

Hello, thanks for great code.I got the pretrained model and I ran the train.py to reproduce the lite-mono, but I got different results.

paper abs_rel | sq_rel | rmse | rmse_log | a1 | a2 | a3 | 0.107 0.765 4.561 0.183 0.886 0.963 0.983

own abs_rel | sq_rel | rmse | rmse_log | a1 | a2 | a3 | 0.107 0.804 4.633 0.185 0.885 0.962 0.982

Among them, the second and third indicators are quite different from those in the paper.I did not make changes in trainer.py, options.py, depth_encoder.py and depth_decoder.py.Due to an unknown computer error,I change the kitti_dataset.py and mono_dataset.py.The new kitti_dataset.py and mono_dataset.py are from monodepth2. 1.I want to know why my result is not good.Because of kitti_dataset.py and mono_dataset.py? 2.Why the shuffle in the DataLoader is True? self.train_loader = DataLoader( train_dataset, self.opt.batch_size, shuffle=True, num_workers=self.opt.num_workers, pin_memory=True, drop_last=True) 3.Why use the def _init_weights(self, m) in the encoder and decoder? 4.In addition to the settings in options.py and trainer.py, what details do I need to pay attention to in order to reproduce the result of the lite-mono model？ Looking forward to your reply, thank you.

Hello, my reproduction code has not been working well, can you send me your reproduction code and instructions, please?

from lite-mono.

413090531 commented on August 31, 2024

Hello, my reproduction code is exactly the same as the code uploaded by the author of the paper, CUDA11.0, PyTorch 1.7.1. Please check your code and replace your CUDA and pytorch version.

from lite-mono.

I get different results on the test set and other questions. about lite-mono HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent