Comments (3)
The main error, which is partly crippled in your output, is this:
/usr/bin/ld: ne peut trouver -lcudnn
collect2: error: ld returned 1 exit status
This means, it did not found cuDNN.
from returnn.
You need to set some environment variables, so it can correctly link to cudnn.
You can try to put something like this (for your version of cudnn and your username and location of cudnn) into your ~/.bashrc (assuming that the .so files are in /home/voigtlaender/cudnn_v5/)
#cuddn v5.1
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/voigtlaender/cudnn_v5/
export LIBRARY_PATH=$LIBRARY_PATH:/home/voigtlaender/cudnn_v5/
export CPATH=$CPATH:/home/voigtlaender/cudnn_v5/
from returnn.
Thank you for your responses, it is working now.
from returnn.
Related Issues (20)
- AttributeError: 'DistributedDataParallel' object has no attribute 'num_iterations' HOT 1
- Torch distributed: Every worker reserves memory on GPU 0 HOT 9
- Torch distributed error: ncclSystemError: Call to bind failed : Cannot assign requested address HOT 2
- PyTorch distributed: eval distributed as well
- ReturnnDumpHDFJob bug HOT 4
- PT DistributedDataParallel with mixed precision training HOT 5
- hdf_dump not working with SprintCacheDataset + seq_list_filter_file HOT 5
- param_dropout doesn't work with TF2.4 HOT 3
- RF CausalAttention get_sequence_mask_broadcast bug HOT 3
- PT potential CUDA mem leak? HOT 2
- `psutil` `_read_smaps_file` takes lots of time HOT 4
- Hang in `uvm_ioctl` in kernel HOT 2
- PyTorch CUDA OOM in distributed training HOT 7
- PyTorch distributed training, could not unlink the shared memory file
- PyTorch distributed training, hang in `all_reduce(_has_data ...`, after exception, Timed out waiting 1800000ms for send operation to complete HOT 4
- PyTorch training, some epochs very slow HOT 5
- PyTorch training RuntimeError: cuFFT error: CUFFT_INTERNAL_ERROR HOT 2
- PyTorch collect model statistics
- PyTorch recover after CUDA OOM with restart does not work with CUDA HOT 3
- PyTorch distributed training CPU OOM with sync_on_cpu HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from returnn.