Comments (9)
@nerdcha I see that there is script to build GPU image. But Is there any documentation how could I pull GPU image without assembling it from the source code?
from docker-python.
Think, you might need to use nvidia-docker to make use of your gpu.
from docker-python.
GPU support is something that we're going to add soon.
from docker-python.
I was successfully able to build and run the GPU image from gpu.Dockerfile.
However, running a GPU script inside that image I hit this:
THCudaCheck FAIL file=/opt/conda/conda-bld/pytorch_1535491974311/work/aten/src/THC/THCGeneral.cpp line=74 error=35 : CUDA driver version is insufficient for CUDA runtime version
Traceback (most recent call last):
File "kesnet.py", line 115, in <module>
display_imgs(np.asarray(md.trn_ds.denorm(x)))
File "/opt/conda/lib/python3.6/site-packages/fastai/dataset.py", line 252, in denorm
if type(arr) is not np.ndarray: arr = to_np(arr)
File "/opt/conda/lib/python3.6/site-packages/fastai/core.py", line 60, in to_np
if isinstance(v, torch.cuda.HalfTensor): v=v.float()
RuntimeError: cuda runtime error (35) : CUDA driver version is insufficient for CUDA runtime version at /opt/conda/conda-bld/pytorch_1535491974311/work/aten/src/THC/THCGeneral.cpp:74
I am able to run this script on Kaggle.com and outside of the Docker image.
I've tried on 2 machines. One has Nvidia Driver Version: 410.48 and the other has Nvidia Driver Version: 396.54.
I'm not an expert at Docker or CUDA so could be something simple I've got configured wrong, but thought I'd post here in case there's an issue or the GPU Docker isn't ready for mass consumption yet.
from docker-python.
Hi @breckuh,
Both Nvidia driver version should be fine.
Have you updated any packages inside your GPU image?
from docker-python.
@rosbo , I have not.
I just downloaded one of the base Nvidia Docker images and was not able to detect the GPU inside even that container ("CUDA_ERROR_NO_DEVICE: no CUDA-capable device is detected"), so my issue appears to be more of a system wide one and not with this Kaggle image per se. I will keep digging. Thanks!
from docker-python.
@breckuh are you using the docker NVIDIA runtime?
https://github.com/NVIDIA/nvidia-docker
# you will need to add some apt repos first, see link above
sudo apt-get install -y nvidia-docker2
docker run --runtime=nvidia --rm kaggle/python-build-gpu /bin/bash
from docker-python.
@rosbo I am not! Thanks. Attempting to install now (but running into unrelated typical dependency problems :) ).
from docker-python.
I was able to get the docker NVIDIA runtime installed, able to run jobs on the GPU in the base NVIDIA CUDA devel image, and can now see the GPUs with nividia-smi from within the running kaggle/python-build-gpu image, but still get the "CUDA driver version is insufficient for CUDA runtime version" error message, same as above.
Still investigating....
from docker-python.
Related Issues (20)
- NameError: name 'io' is not defined HOT 1
- Check the version of python and cuda with each image of cpu and gpu. HOT 1
- ImportError: cannot import name 'QhullError' from 'scipy.spatial' HOT 3
- Unable to fully install xFormers in auxilliary notebook HOT 1
- bump pyTorch to 2.1+
- Interrupt code breaks sometimes HOT 1
- Please Update CUDA version to 12.1 or 11.8
- Updating torch ecosystem HOT 1
- Please Install new package to be used in Kaggel Notebook HOT 1
- Ultralytics YOLOv8 Kaggle CPU notebook running deprecated Python 3.7 HOT 1
- v143 - GPU Latest, The latest update version upgrading CUDA seems to have caused many errors related to DeepSpeed
- CHAOS AT CURRENT CUDF WITH RAPIDS DRIVERS HOT 2
- update to glibc 2.35 0r 2.32
- Keras Issue for TensorFlow Hub model HOT 1
- Can't Install Krippendorff HOT 3
- Can't install vllm, llama.cpp
- NotFoundError: Graph execution error: TPU HOT 8
- Could not establish connection to "Kaggle"
- Not Able to run google generative ai models on kaggle notebook [bug] HOT 2
- CUML is broken on latest Kaggle environment (e.g. May 2024) HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from docker-python.