Giter VIP home page Giter VIP logo

magnet's People

Contributors

baegwangbin avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

magnet's Issues

Loss of D-net

Hi, Excellent work!
I found the loss of D-Net is nll = (torch.square(mu - gt_depth) / (2 * var)) + (0.5 * torch.log(var)) in here.
I have two questions below:

  1. will the loss turns negative as log(1e-5) < 0 (take 1e-5 for example)
  2. how long does the training step of D-net take for each dataset?

Thank you for your reply!!

Negative frame index/name

Hi,

Trying to test the multi-view model via

python test_MaGNet.py ./test_scripts/magnet/scannet.txt

but, got FileNotFoundError due to negative image index/name (-15.jpg).
Anyone got the same error?

Btw, monocular model (python test_DNet.py ./test_scripts/dnet/scannet.txt) works fine.

Error During F-Net Training

When I try to run

python train_FNet.py train_scripts/fnet/scannet.txt

I get the following error:

Traceback (most recent call last):
File "MaGNet/venv/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 69, in _wrap
fn(i, *args)
File "MaGNet/train_FNet.py", line 238, in main_worker
train(model, args, device=args.gpu)
File "MaGNet/train_FNet.py", line 130, in train
utils.visualize_F(args, ref_img, gt_dmap, gt_dmap_mask, pred_dmap, total_iter)
File "MaGNet/utils/utils.py", line 286, in visualize_F
pred_emap = np.abs(pred_dmap - gt_dmap)
ValueError: operands could not be broadcast together with shapes (480,640) (120,160)

I think the problem is caused by this line:

pred_dmap = F.interpolate(pred_dmap, size=[img.shape[2], img.shape[3]], mode='nearest')

resizing the predicted depth to the image dimensions (which have a 4 times higher resolution than the ground truth depth).

I removed the respective line as a workaround. Does that make sense or am I missing some logic?

GPU?

How many GPUs are needed to reproduce the results?

The meaning of predicted depth

Hello, I'm curious about the meaning of model prediction. Does it represent a relative depth or absolute depth in meters?

A quession about D_net!

thanks for your nice work!
I trained the D_net on DTU dataset, the training loss declined normally(avg depth_error 5mm), but on the validation datase, the loss is high(avg depth_error 50mm). That seems like an overfit,.Could you tell me how to deal with this problem?

Details results of D-Net

Excellent work!
Hi, I'm using another dataset to train and test D-Net! Could you please show us the results of D-Net after training and testing?
like abs_rel abs_diff sq_rel rmse rmse_log irmse log_10 silog a1 a2 a3 NLL for each scene?
Thank you so much!

[SOLVED] _pickle.UnpicklingError: invalid load key, '<'.

In my case, after running ckpts/download.py, the test command makes an error.

...
  File "/home/chung/anaconda3/envs/magnet/lib/python3.6/site-packages/torch/serialization.py", line 585, in load
    return _legacy_load(opened_file, map_location, pickle_module, **pickle_load_args)
  File "/home/chung/anaconda3/envs/magnet/lib/python3.6/site-packages/torch/serialization.py", line 755, in _legacy_load
    magic_number = pickle_module.load(f, **pickle_load_args)
_pickle.UnpicklingError: invalid load key, '<'.

The problem was the .pt file which was not properly downloaded by the provided code (download.py)
Instead, I recommend using gdown to download the .pt files: for example,

## pip install gdown
gdown 1bbzfboj6XkfFhoJ54Iiqc5Ylj95A015M -O ./ckpts/DNET_scannet.pt

or you can run those codes with python easily

import os 
os.system("gdown 1bbzfboj6XkfFhoJ54Iiqc5Ylj95A015M -O ./ckpts/DNET_scannet.pt")
os.system("gdown 1ugDr67UOanpQZMlPopiM8OihUexhPql4 -O ./ckpts/FNET_scannet.pt")
...

and it works. :)

How do you predict the scale of the depth

the current model input only has relative pose. I don't see any input of model related to depth scale (like depth range or absolute value of extrinsics). If a dataset has mixied scales of depth of images (drone data and regular photo), your training/inference can fail.

"_compute_cost_CW" function in homography.py

I'm confused why the normalized pixel coordinates need do the following operation:
src_coords[src_coords > 10.0] = 10.0 src_coords[src_coords < -10.0] = -10.0
The "F.grid_sample" function requires the coordinates in the range of -1 to 1 rather than -10 to 10. The coordinates out of [-1, 1] will be invalid thus produce 0 at the corresponding position of output.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.