Comments (2)
Hi,
Firstly, thanks a lot for your wonderful work. I'm facing a problem when training the model with 4 GPUs. DataLoader worked well and I got a error like this:
Any suggestions will be appreciated!
Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Traceback (most recent call last): File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 288, in main_worker(config, args) File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 168, in main_worker with context as fp, model.no_sync() as no_sync: File "/project/6064028/tmp/idisc/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1269, in getattr raise AttributeError("'{}' object has no attribute '{}'".format( AttributeError: 'IDisc' object has no attribute 'no_sync' Traceback (most recent call last): File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 288, in main_worker(config, args) File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 168, in main_worker with context as fp, model.no_sync() as no_sync: File "/project/6064028/tmp/idisc/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1269, in getattr raise AttributeError("'{}' object has no attribute '{}'".format( AttributeError: 'IDisc' object has no attribute 'no_sync'
Hello, I have the same problem and would like to know how you solved it. Could you please tell me?
from idisc.
Hi,
Firstly, thanks a lot for your wonderful work. I'm facing a problem when training the model with 4 GPUs. DataLoader worked well and I got a error like this:
Any suggestions will be appreciated!
Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Loaded 22441 images. Totally 717 invalid pairs are filtered Loaded 491 images. Totally 206 invalid pairs are filtered -> Local random sampler Start training: Traceback (most recent call last): File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 288, in main_worker(config, args) File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 168, in main_worker with context as fp, model.no_sync() as no_sync: File "/project/6064028/tmp/idisc/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1269, in getattr raise AttributeError("'{}' object has no attribute '{}'".format( AttributeError: 'IDisc' object has no attribute 'no_sync' Traceback (most recent call last): File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 288, in main_worker(config, args) File "/project/6064028/tmp/code/idisc/scripts/train_DDP.py", line 168, in main_worker with context as fp, model.no_sync() as no_sync: File "/project/6064028/tmp/idisc/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1269, in getattr raise AttributeError("'{}' object has no attribute '{}'".format( AttributeError: 'IDisc' object has no attribute 'no_sync'Hello, I have the same problem and would like to know how you solved it. Could you please tell me?
I've solved this problem. This problem occurs when I don't use distributed training. The device I used is a single machine with multiple GPUs, and slurm is not installed. After I modified the code to DDP training without slurm, the code can work normally. If anyone encounters the same problem, you can refer to it.
from idisc.
Related Issues (20)
- I want to visualize your results, but you don't seem to use visulization.py in your code, where should I use them HOT 3
- Can you tell me about the steps you do to normalize the input depth image? HOT 1
- I want to verify your nyunormal results, but the results I ran out were different from yours, and I wanted to know what went wrong and how to run your results
- The evaluation index of the prediction result of the table plane normal vector estimation is printed incorrectly HOT 1
- How to test the depth of a picture
- Cannot import 'MultiScaleDeformableAttention' HOT 4
- Question about Diode indoor dataset HOT 2
- Surface Normal Estimation procedure
- Question about outdoor zeroshot datasets HOT 2
- Evaluating on KITTI Improved Ground Truth HOT 1
- Abnormal Training Phenomena and Bad Performance HOT 6
- _IncompatibleKeys failure loading weights. HOT 1
- About the results of my own test picture are inconsistent with the paper HOT 5
- Setting cross_attn_{i+1}_d{1} but not cross_attn_{i+1}_d{j} on purpose or typo in id_module.py?
- Transposed Cross Attention (page 4 in the paper)
- Input to AFP module
- "ModuleNotFoundError: No module named 'idisc'" when running test.py HOT 3
- Saved depth seems wrong HOT 2
- Question : How is GT-Based Depth Rescaling for Diode Indoor Dataset performed ? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from idisc.