datamachines / cuda_tensorflow_opencv Goto Github PK

View Code? Open in Web Editor NEW

118.0 10.0 16.0 719 KB

DockerFile with GPU support for TensorFlow and OpenCV

License: Apache License 2.0

Dockerfile 33.84% Shell 28.69% Makefile 29.34% Python 6.13% Perl 2.00%

nvidia-gpu opencv gpu-tensorflow docker-images

cuda_tensorflow_opencv's Introduction

DockerFile with Nvidia GPU support for TensorFlow and OpenCV

Revision: 20220815

1. About
1. Docker images tag naming
1. Building the images
1. A note on supported GPU in the Docker Hub builds
1. Using the container images
1. Additional details
1. Examples of use
- 7.1. Simple OpenCV picture viewer
  - 7.1.1. Using OpenCV DNN
- 7.2. Using GPU TensorFlow in your code (only for cudnn- versions)
- 7.3. Using Jupyter-Notebook (A note on exposing ports)
- 7.4. Testing Yolo v4 on your webcam (Linux and GPU only)
  - 7.4.1. Darknet Python bindings
- 7.5. Testing PyTorch with CUDA

1. About

For TensorFlow GPU, you will need to build the cudnn_ version.

The base OS for those container images is Ubuntu or DockerHub's nvidia/cuda based on Ubuntu. More details on the Nvidia base images are available at https://hub.docker.com/r/nvidia/cuda/ . In particular, please note that "By downloading these images, you agree to the terms of the license agreements for NVIDIA software included in the images"; with further details on DockerHub version from https://docs.nvidia.com/cuda/eula/index.html#attachment-a

Version history:

20191107: builds a non-CUDA version: tensorflow_opencv.
20191210: builds a CuDNN version: cudnn_tensorflow_opencv
20200211: making use of Docker 19.03's GPU support and adding information about the OpenCV builds in the OpenCV_BuildConf directory.
20200327: added Protobuf, WebP, GStreamer and Eigen to the OpenCV build.
20200423: added support for OpenCV 3.4.10 and 4.3.0, and added GStreamer plugins to the build. Also added Nvidia Jetson Nano build steps in the JetsonNano directory.
20200615: TensorFlow is built from source. Note that TensorFlow will not have GPU support unless it was compiled with CUDNN support.
20200803: added PyTorch. Removal of cudnn_ version for CUDA 9.2 with TensorFlow 2.3.0 (minimum needed was 10.1)
20201204: added support for Python 3.7 for TensorFlow 1 builds and Python 3.8 for Tensorflow 2 builds (makes use of the deadsnakes/ppa and changes the default python3). Warning: only pip3 installed packages will work, not apt-get installed ones (Python 3.6 is still the default for Ubuntu 18.04)
20210211: added support for CUDA 11 (using Nvidia's Ubuntu 20.04 based container). CUDA 9.2 and 10.2 versions are still created from Nvidia's Ubuntu 18.04 based containers.
20210414: updated for OpenCV 3.4.14 and 4.5.2, removed cuda_ from build target (and built containers)
20210420: moved base container for CPU builds to be based on Ubuntu 20.04 (from 18.04), limiting GPU TF1 builds to CuDNN7, patching TF1 builds (for Python 3.8 and grpc), all containers provide python 3.8 at minimum.
20210601: Added TF 2.5.0 builds for CUDA 11. Providing both CUDA 11.2 and 11.3 builds, as the Nvidia driver needed to run 11.3 is still in beta (465, with 460 as current stable). v2 tag (same release) includes TF2.5.0 with CUDA 10.2
20210810: Added TF 2.5.1 and OpenCV 3.4.15+4.5.3
20210812: Added TF 2.6.0 and updated CUDA to 11.4.1
20211027: Update OpenCV to 3.4.16+4.5.4
20211029: Update min CUDA 11 version to 11.3 (to match PyTorch requirement), and added a README section about testing PyTorch on GPU
20211220: Added TF 2.6.2 and updated PyTorch. Expected to be last release to built TF1 or Ubuntu 18.04 based images.
20211222: Added TF 2.7.0 with CUDA 11 only, which removed Ubuntu 18.04 base images.
20220103: Updated OpenCV to 4.5.5.
20220308: Updated Jetson directory (renamed from JetsonNano)
20220318: Added TF 2.8.0 and updated PyTorch
20220329: Added jupyter_to and jupyter_cto builds to DockerHub
20220331: Added Unraid specific releases
20220510: Updated base containers, including Nvidia's new package signing key
20220521: Update to TF 2.9.0
20220525: Update to TF 2.9.1
20220530: Building FFmpeg 4.4.2 and PyTorch 1.11 from source
20220815: Update to OpenCV 4.6.0 and PyTorch 1.12.1

tensorflow_opencv (aka to):

Builds containers with TensorFlow and OpenCV. Also install, Jupyter, Keras, numpy, pandas, PyTorch and X11 support.
Can be used on systems without a Nvidia GPU, and the runDocker.sh script will setup proper X11 passthrough
for MacOS X11 passthrough, install the latest XQuartz server and activate the Security -> Allow connections from network clients (must logout for it to take effect)
Pre-built containers available on DockerHub: https://hub.docker.com/r/datamachines/tensorflow_opencv

cudnn_tensorflow_opencv (aka cto):

Builds an Nvidia GPU optimized version of TensorFlow and OpenCV. Also install, Jupyter, Keras, numpy, pandas, PyTorch and X11 support.
As of the 20200615 version, both OpenCV and TensorFlow are compiled within the container.
OpenCV integrated additional CUDNN support after October 2019, see CUDA backend for the DNN module.
For CUDNN, the CUDA backend for DNN module requires CC 5.3 or higher.
Pre-built containers available on DockerHub: https://hub.docker.com/r/datamachines/cudnn_tensorflow_opencv

jetson_tensorflow_opencv (see the Jetson directory):

Builds a Nvidia Jetson cudnn_tensorflow_opencv container image based on Nvidia's provided l4t containers and adapted from the Makefile and Dockerfile used for the other builds.
Pre-built containers available on DockerHub: https://hub.docker.com/r/datamachines/jetson_tensorflow_opencv

juypter_to (content in the Jupyter_build directory):

Jupyter Notebook container built FROM the tensorflow_opencv (to) container.
Pre-built containers available on DockerHub: https://hub.docker.com/r/datamachines/jupyter_to
a "user" version (current user's UID and GID are passed to the internal user) can be built using make JN_MODE="-user" jupyter_to

jupyter_cto (content in the Jupyter_build directory):

Jupyter Notebook container built FROM the cudnn_tensorflow_opencv (cto) container.
Pre-built containers available on DockerHub: https://hub.docker.com/r/datamachines/jupyter_cto
a "user" version (current user's UID and GID are passed to the internal user) can be built using make JN_MODE="-user" jupyter_cto

juypter_to-unraid and jupyter_cto-unraid (specialization of the Jupyter_build builds):

Jupyter Notebook containers built FROM the tensorflow_opencv (to) and cudnn_tensorflow_opencv (cto) containera with a sudo-capable jupyter user using unraid specific uid and gid. Comes preconfigured with a password for the UI (dmc)
Pre-built containers available on DockerHub: https://hub.docker.com/r/datamachines/jupyter_to-unraid and https://hub.docker.com/r/datamachines/jupyter_cto-unraid
Built using make JN_MODE="-unraid" jupyter_all
Unraid's templates published and containers available in Unraid's "Community Applications" as "Jupyter-TensorFlow_OpenCV" and "Jupyter-CuDNN_TensorFlow_OpenCV"

The Builds-DockerHub.md file is a quick way of seeing the list of pre-built container images. When available, a "BuiidInfo" will give the end user a deeper look of the capabilities of said container and installed version. In particular the compiled GPU architecture (see https://en.wikipedia.org/wiki/CUDA#GPUs_supported ). This is useful for you to decide if you would benefit from re-compiling some container(s) for your specific hardware.

It is possible to use those as FROM for your Dockerfile; for example: FROM datamachines/cudnn_tensorflow_opencv:11.3.1_2.8.0_4.5.5-2022031

2. Docker images tag naming

The image tags follow the cuda_tensorflow_opencv naming order. As such 10.2_1.15.3_3.4.10-20200615 refers to Cuda 10.2, TensorFlow 1.15.3 and OpenCV 3.4.10.

Docker images are also tagged with a version information for the date (YYYYMMDD) of the Dockerfile against which they were built from, added at the end of the tag string (following a dash character), such that cuda_tensorflow_opencv:10.2_1.15.3_3.4.10-20200615 is for the Dockerfile dating June 15th, 2020.

Similarly, the tensorflow_opencv and cudnn_tensorflow_opencv tags follow the same naming convention.

3. Building the images

Building a GPU container requires nvidia-docker and its nvidia container runtime to be the default. To do so, after installation, add "default-runtime": "nvidia" to the /etc/docker/daemon.json configuration file and restarting the docker daemon using sudo systemctl restart docker as described in https://docs.nvidia.com/dgx/nvidia-container-runtime-upgrade/index.html#using-nv-container-runtime . You can test it is valid by running docker run --rm -it nvidia/cuda:11.2.1-runtime nvidia-smi from the command line (this command does not have the --runtime nvidia). If you get details on your GPU(s) setup, nvidia is your Default Runtime (use docker info for further details).

Building a CPU image requires the nvidia runtime disabled: comment the default-runtime line above and restart docker.

On GPU build: we do our best effort to specify the GPU architectures; please let us know if some that should be present are missing.

The tag for any image built will contain the datamachines/ organization addition that is found in the publicly released pre-built container images.

Use the provided Makefile by running make to get a list of targets to build:

make build_all will build all container images
make tensorflow_opencv to build all the tensorflow_opencv container images
make cudnn_tensorflow_opencv will build all the cudnn_tensorflow_opencv container images
use a direct tag to build a specific version (from the list provided by the call to make); for example make cudnn_tensorflow_opencv-10.2_2.2.0_4.3.0, will build the datamachines/cudnn_tensorflow_opencv:10.2_2.2.0_4.3.0-20200615 container image (if such a built is available, see the Docker Image tag ending and the list of Available Docker images to be built for accurate values).

The Builds-DockerHub.md will give you quick access to the BuildInfo-OpenCV and BuildInfo-TensorFlow (if available) for a given compilation. Building the image takes time, but we encourage you to modify the Dockerfile to reflect your specific needs. If you run a specific make you will see the values of the parameters passed to the build, simply set their default ARG value to what matches your needs and manually compile, bypassing the make by using a form of docker build --tag="mycto:tag" .

4. A note on supported GPU in the Docker Hub builds

In some cases, a minimum nvidia driver version is needed to run specific version of CUDA, Table 1: CUDA Toolkit and Compatible Driver Versions and Table 2: CUDA Toolkit and Minimum Compatible Driver Versions as well as the nvidia-smi command on your host will help you determine if a specific version of CUDA will be supported.

It is important to note that not all GPUs are supported in the Docker Hub provided builds. The containers are built for "compute capability (version)" (as defined in the GPU supported Wikipedia page) of 6.0 and above (ie Pascal and above). If you need a different compute capbility, please edit in the Makefile the DNN_ARCH_CUDA matching the one that you need to build and add your architecture. Then type make to see the entire list of containers that the release you have obtained can build and use the exact tag that you want to build to build it locally (on Ubuntu, you will need docker and build-essential installed at least to do this). For example, from the 20210211 release, you can make cudnn_tensorflow_opencv-11.2.0_2.4.1_4.5.1. We can not promise that self built docker image will build or be functional. Building one such container takes a lot of CPU and can take many hours, so we recommend you build only the target you need.

5. Using the container images

The use of the provided runDocker.sh script present in the source directory allows users to utilize the built image. Dy default, it will set up the X11 passthrough (for Linux and MacOS) and give the user a /bin/bash prompt within the running container, as well as mount the calling directory as /dmc. A user can test that X11 is functional by using a simple X command such as xlogo from the command line.

To use it, the full name of the container image should be passed as the CONTAINER_ID environment variable. For example, to use datamachines/cudnn_tensorflow_opencv-10.2_2.2.0_4.3.0-20200615, run CONTAINER_ID=datamachines/cudnn_tensorflow_opencv-10.2_2.2.0_4.3.0-20200615 ./runDocker.sh. Note that runDocker.sh can be called from any location using its full path, so that a user can mount its current working directory as /dmc in the running container in order to access local files.

runDocker.sh can take multiple arguments; running it without any argument will provide a list of those arguments.

As of Docker 19.03, GPU support is native to the container runtime, as such, we have shifted from the use of nvidia-docker to the native docker [...] --gpus all. We understand not every user want to use all the GPUs installed on his system, as such, to change this option, change the D_GPUS line in the first few lines of runDocker.sh to reflect the paramaters that best reflect your system or needs. GPU support is only enabled for the cuda_ and cudnn_ images.

Note that the base container runs as root, if you want to run it as a non root user, add -u $(id -u):$(id -g) to the docker command line but ensure that you have access to the directories you will work in. This can be done using the -e command line option of runDocker.sh.

6. Additional details

About OpenCV and GPU: In cuda_tensorflow_opencv (resp. cudnn_tensorflow_opencv), OpenCV is compiled with CUDA (resp. CUDA+CuDNN support), but note that not all of OpenCV's functions are optimized. This is true in particular for some of the contrib code.
A note about opencv-contrib-python: The python version of cv2 built within the container is already built with the "contrib" code (expect the "non free" portion, see the Makefile for additional details). opencv-contrib-python install another version of cv2 (as in import cv2), as such please be aware that you might lose some of the compiled optimizations.
Testing GPU availability for TensorFlow: In the test directory, you will find a tf_hw.py script. You can test it with a cudnn- container by adapating the following command:

CONTAINER_ID="datamachines/cudnn_tensorflow_opencv:10.2_1.15.3_4.3.0-20200615" ../runDocker.sh -X -N -c python3 -- /dmc/tf_hw.py

7. Examples of use

7.1. Simple OpenCV picture viewer

If a user place a picture (named pic.jpg) in the directory to be mounted as /dmc and the following example script (naming it display_pic.py3)

import numpy as np
import cv2

img = cv2.imread('pic.jpg')
print(img.shape, " ", img.size)
cv2.imshow('image', img)
cv2.waitKey(0) & 0xFF
cv2.destroyAllWindows()

, adapting PATH_TO_RUNDOCKER in CONTAINER_ID=datamachines/cudnn_tensorflow_opencv-10.2_2.2.0_4.3.0-20200615 PATH_TO_RUNDOCKER/runDocker.sh, from the provided bash interactive shell, when the user runs cd /dmc; python3 display_pic.py3, this will display the picture from the mounted directory on the user's X11 display.

7.1.1. Using OpenCV DNN

This requires a cudnn_tensorflow_opencv container and the use of a form of the --gpus docker options (ex: docker [...] --gpus all [...])

In your python3 code, make sure to ask OpenCV to use a CUDA backend. This can be achived by adding code similar to:

import cv2

net = cv2.dnn.[...]
net.setPreferableBackend(cv2.dnn.DNN_BACKEND_CUDA)
net.setPreferableTarget(cv2.dnn.DNN_TARGET_CUDA)

You can see more details with this OpenCV tutorial: YOLO - object detection

We note that other target are available, for example net.setPreferableTarget(cv2.dnn.DNN_TARGET_CUDA_FP16)

7.2. Using GPU TensorFlow in your code (only for cudnn- versions)

Code written for Tensorflow should follow principles described in https://www.tensorflow.org/guide/using_gpu

In particular, the following section https://www.tensorflow.org/guide/using_gpu#allowing_gpu_memory_growth might be needed to allow proper use of the GPU's memory. In particular:

config = tf.ConfigProto()
config.gpu_options.allow_growth = True
session = tf.Session(config=config, ...)

Note that this often allocates all the GPU memory to one Tensorflow client. If you intend to run multiple Tensorflow containers, limiting the available memory available to the container's Tensorflow can be achieved as described in https://stackoverflow.com/questions/34199233/how-to-prevent-tensorflow-from-allocating-the-totality-of-a-gpu-memory by instead specifying the percentage of the GPU memory to be used:

config = tf.ConfigProto()
config.gpu_options.per_process_gpu_memory_fraction=0.125
session = tf.Session(config=config, ...)

The built Docker images do NOT install any models, add/build/download your own in your Dockerfile that is FROM datamachines/cudnn_tensorflow_opencv-10.2_2.2.0_4.3.0-20200615

For example:

FROM datamachines/cudnn_tensorflow_opencv-10.2_2.2.0_4.3.0-20200615

# Download tensorflow object detection models
RUN GIT_SSL_NO_VERIFY=true git clone -q https://github.com/tensorflow/models /usr/local/lib/python3.6/dist-packages/tensorflow/models

# Install downloaded models
ENV PYTHONPATH "$PYTHONPATH:/usr/local/lib/python3.6/dist-packages/tensorflow/models/research:/usr/local/lib/python3.6/dist-packages/tensorflow/models/research/slim"
RUN cd /usr/local/lib/python3.6/dist-packages/tensorflow/models/research && protoc object_detection/protos/*.proto --python_out=.

7.3. Using Jupyter-Notebook (A note on exposing ports)

Note: As of 20220329, jupyter_to (FROM tensorflow_opencv) and jupyter_cto (FROM cudnn_tensorflow_opencv) containers (using the root user) following the method listed below are published on DockerHub.

By choice, the core containers built do not expose any ports, or start any services. This is left to the end-user. To start any, the simpler solution is to base a new container FROM one of those containers, expose a port and start said service to be able to access it.

For example, the start and expose Jupyter Notebook (on port 8888) from the cudnn_tensorflow_opencv container, one could write the following Dockerfile and tag it as jupnb:local:

FROM datamachines/cudnn_tensorflow_opencv:11.3.1_2.8.0_4.5.5-20220318
EXPOSE 8888
CMD jupyter-notebook --ip=0.0.0.0 --port=8888 --no-browser --allow-root

, using docker build --tag jupnb:local .

In order to mount weights within the container, or other persistant location for code and such, use one or more -v /local/directory:/container/mount to mount your "local directory" inside the running instance at "container mount" location. In particular, the base FROM container uses /dmc as its WORKDIR, the started notebook will therefore be in this directory. Use -v /path/to/mount:/dmc to make it available as default in the Jupyter Notebook.

Start the built container using a similar command to docker run --gpus all -p 8888:8888 -v /path:/dmc jupnb:local to: 1) use the GPUs (only availble in the to build), 2) publish the container's port 8888 to the local system's port 8888, 3) mount /path as /dmc within the container. When the container is started, an http://127.0.0.1:8888/ based URL will shown with the access token. Using this url in a web browser will grant access to the running instance of Jupyter Notebook.

To exit Jupyter Notebook, you can either docker kill the running instance of press the Quit button within the WebUI.

7.3.1. Build of an allow-root Jupyter Notebook container

An --allow-root container is what the make jupyter_to or make jupyter_cto command lines are designed to do. They will docker build the Jupyter_build/Dockerfile using a recent FROM (as listed on the output of make). This is an --allow-root ran Jupyter Notebook ran within the container. This is the container we have started building and releasing on DockerHub as of 20220329.

7.3.2. Build of a user permission Jupyter Notebook container

Note: this is mostly for Linux builds, volumes mounted on MacOS's Docker Desktop are done so using the user's permissions.

We support the build of user permission Jupyter Notebook; ie a build where a sudo-capable jupyter user with specific uid and gid is used within the container to use the Jupyter Notebook. To support this, we created a Jupyter_build/Dockerfile-user.

This container will be named jupyter_to-user and requires a manual build which is started using (similar process for the _cto version)

 make jupyter_to JN_MODE="-user"

By default, the build will obtain the make user's uid and gid and use those when creating the jupyter user within the container. This is to support easy permission management of data mounted within the container at runtime.

If you prefer to control the jupyter user's uid and gid, specify those on the command line at build time:

make jupyter_to JN_MODE="-user" JN_UID="1001" JN_GID="1002"

Please note that the tool will not check if the uid or gid are valid or compatible with the container's base image (Ubuntu).

It is also possible to manually control the FROM image to be another to or cto release by replacing the JUPBC docker build argument; the Makefile is here as a convenience to build versions for the same release. For example to build a jupyter_to-user:2.8.0_3.4.16 image, with 1003 as the jupyter user's uid and gid:

cd Jupyter_build; docker build --build-arg JUPBC="datamachines/tensorflow_opencv:2.8.0_3.4.16-20220318" --build-arg JUID=1003 --build-arg JGID=1003 -f Dockerfile-user --tag="jupyter_to-user:2.8.0_3.4.16" .

7.3.3. Build of an Unraid specific Jupyter Notebook container

Note: this build is designed to run on an unraid.net server, it specialized the "user" build by using Unraid's specific uid and gid and uses a default password for the WebUI.

Build the to and cto versions using one of:

make jupyter_to JN_MODE="-unraid"
make jupyter_cto JN_MODE="-unraid"

This version uses a default user password (dmc) when started.

7.4. Testing Yolo v4 on your webcam (Linux and GPU only)

It is possible to run Yolov4 using a custom container and building it from source.

In this example we will build YOLOv4, enabling GPUs (61, 75 and 86 compute), CUDNN, OPENCV, OPENMP, the generation of the libdarknet.so which can be used by the darknet.py example.

Copy the following lines in a Dockerfile

FROM datamachines/cudnn_tensorflow_opencv:11.3.1_2.7.0_4.5.5-20220103

RUN mkdir -p /darknet \
    && wget -q --no-check-certificate -c https://github.com/AlexeyAB/darknet/archive/refs/tags/yolov4.tar.gz -O - | tar --strip-components=1 -xz -C /darknet \
    && cd /darknet \
    && perl -i.bak -pe 's%^(GPU|CUDNN|OPENCV|OPENMP|LIBSO)=0%$1=1%g;s%(compute\_61\])%$1 -gencode arch=compute_75,code=[sm_75,compute_75] -gencode arch=compute_86,code=[sm_86,compute_86]%' Makefile \
    && make

WORKDIR /darknet
CMD /bin/bash

In the same directory where the Dockerfile is, build it using docker build --tag "cto_darknet:local" .

Once build is completed, download from https://github.com/AlexeyAB/darknet#pre-trained-models the cfg-file and weights-file you intend to use, for our examples, we use yolov4.cfg and yolov4.weights.

From the directory where both files are, run (adapt RUNDOCKERDIR with the location of the script):

CONTAINER_ID="cto_darknet:local" RUNDOCKERDIR/runDocker.sh -e "--privileged -v /dev/video0:/dev/video0" -c /bin/bash

, here we are telling the script to pass to the docker command line extra (-e) paramaters to run in privileged mode (for hardware access) and pass the webcam device (/dev/video0) to the container. By default, this command will also enable X11 display passthrough and mount the current directory (where the cfg and weights are) as /dmc.

Because the cfg/weights are accesible in /dmc and X11 and webcam can be accessed, running the following command within the newly started container (which started in /darknet) will start your webcam (video0) and run Yolo v4 on what it sees:

./darknet detector demo cfg/coco.data /dmc/yolov4.cfg /dmc/yolov4.weights

For developers, in /darknet you will also have the libdarknet.so which is needed to use python3 with darknet.py and darknet_video.py.

7.4.1. Darknet Python bindings

Darknet provides direct python bindings at this point in the form of darknet_images.py and darknet_video.py. To test those, you have nothing to do but use the previously built container (cto_darknet:local) and run/adapt the following examples (in the default work directory, ie /darknet):

darknet_images.py example using one of the provided images:

python3 darknet_images.py --weights /dmc/yolov4.weights --config_file /dmc/yolov4.cfg --input data/horses.jpg

darknet_video.py example using webcam:

python3 darknet_video.py --weights /dmc/yolov4.weights --config_file /dmc/yolov4.cfg

darknet_video.py example using video file:

python3 darknet_video.py --weights /dmc/yolov4.weights --config_file /dmc/yolov4.cfg --input /dmc/video.mp4

Note: the .py source code takes additional options, run with -h to get the command line help

7.5. Testing PyTorch with CUDA

PyTorch provides examples to test it. Those can be found at https://github.com/pytorch/examples

Here we will test the "Super Resolution" example (for more details, see https://github.com/pytorch/examples/tree/master/super_resolution)

In the directory where the source for cuda_tensorflow_opencv is:

# First, obtain a copy of the examples
git clone --depth 1 https://github.com/pytorch/examples.git pytorch-examples
# Start a recent container (adapt the CONTAINER_ID for your test), this will mount the current working directory as /dmc
CONTAINER_ID="datamachines/cudnn_tensorflow_opencv:11.3.1_2.6.0_4.5.4-20211029" ./runDocker.sh
# Go into the super resolution example directory
cd pytorch-examples/super_resolution
# Train the model (command line copied from the example README.md, will download the dataset the first time) on GPU (remove the --cuda to use CPU)
python main.py --upscale_factor 3 --batchSize 4 --testBatchSize 100 --nEpochs 30 --lr 0.001 --cuda
# Test the trained super resolver (also copied from example README.md) on GPU
python super_resolve.py --input_image dataset/BSDS300/images/test/16077.jpg --model model_epoch_30.pth --output_filename out.png --cuda
# Note1: If you train on GPU you need to test on GPU: ie make sure to use the --cuda in both command lines
# Note2: You can "time python" to see the speedup from your GPU (using --cuda) versus your CPU (without the --cuda)

cuda_tensorflow_opencv's People

Contributors

Stargazers

Watchers

Forkers

gilmar-gomes jesryu ammarnoman fm-soldel lemonsuo insightque devlonesome littleserendipity zamriibr bryansjca chang-1 clintonkildepstein philglobal marieke-s sateeshs renatostavares

cuda_tensorflow_opencv's Issues

CUDA 11

Do you have CUDA 11 support avail?

Problem on making tensorflow work with gpu (for 10.2_2.1.0_4.3.0-20200423)

First, Thanks for helping me out last time.

On working with "datamachines/cudnn_tensorflow_opencv:10.2_2.1.0_4.3.0-20200423" image, I have no problem on enabling cuda support but when I try to use Tensorflow with gpu I have issue of not able to detect my GPU i.e. when I enter "import tensorflow as tf;print(len(tf.config.experimental.list_physical_devices('GPU')))" returns me 0

Is there a way to fix it or do I need to download another image with cuda10.1?

Please help me out

If possible also please mension a way to install darknetpy in any of the image (I think it will be very good enhanceement for ML docker images like this)

Error when running the container on

Hello, I am really thankful for the images and the prebuilt containers.

But I am encountering this error when trying to train a CV model with prebuilt container datamachines/cudnn_tensorflow_opencv:11.6.2_2.9.1_4.6.0-20220815 (this ERROR is logged on vertex AI custom job as I can't test the setup locally having no GPU on my local machine 😢 )

I was running the job with n1-standard-4 machine with NVIDIA_TESLA_K80 attached.

2023-02-11 18:02:27.461890: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/lib:/usr/local/nvidia/lib:/usr/local/nvidia/lib64\n

2023-02-11 18:02:27.461987: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.\n

It seems like that the environment variable is not set correctly. But I thought the prebuilt container already take care of that.

Any help is really appreciated. Really thankful for your time!

Segmentation fault (core dumped) with datamachines/jetsonnano-cuda_tensorflow_opencv:10.0_2.1.0_4.3.0-20200515

With datamachines/jetsonnano-cuda_tensorflow_opencv:10.0_2.1.0_4.3.0-20200515,
"Segmentation fault (core dumped)" was returned after running
/dmc# python3 -c "import cv2; print(cv2.version)"

"ImportError: No module named cv2" was returned if running
python -c "import cv2; print(cv2.version)"

how these errors can be fixed? thanks.

Unable to EXPOSE port

after running the image with cudnn I run the command "jupyter-notebook --ip=0.0.0.0 --port=8888 --no-browser --allow-root" but I am not able to open the notebook on crome. Please help me out.

Building Legacy Tensorflow

Is it possible to build with an older version of tensorflow 1.14.0?
I changed the stable version of TF to this ^, but it fails to build and complains

You have bazel 4.1.0 installed.
Please downgrade your bazel installation to version 0.25.2 or lower to build TensorFlow! To downgrade: download the installer for the old version
(from https://github.com/bazelbuild/bazel/releases) then run the installer.
-- bazel command to run:
bazel build --verbose_failures --config=opt --config=v1 --config=cuda //tensorflow/tools/pip_package:build_pip_package
Starting local Bazel server and connecting to it...

Changing the LATEST_BAZEL version in the makefile doesn't seem to work.

DockerFiles

Hi,
Thanks for providing the builds in hub.docker.
I'm looking to make some changes to one of the images to include NVDEC in the opencv build and be able to use cv2.cudacodec.createVideoReader
Are the original DockerFile available somewhere?
Thanks

do you guys have any plan to build the image with a newer python version?

3.6.9 is kind of old

Support for the latest version of PyTorch 1.8.0, Tensorflow 2.3.1 and OpenCV 4.5.0

@mmartial - Stumbled across your repo and found it to be quite handy! Thanks for putting this together.

We're looking to create a docker image that contains the latest combinations of:
-Tensorflow 2.3.1
-Cuda 10.1, 10.2, 11.0
-PyTorch 1.7.0 and corresponding versions of torchvision (unsure which version is correct)

OpenCV 4.5.0

Any chance you could add support for these combinations?

Thanks!

1.15.2 images are built without tensorflow-gpu (irrespective of CUDA or CUDNN)

Looks like all 1.15.2 images are built with tensorflow and not tensorflow-gpu.

In Makefile:
@$(eval CTO_TENSORFLOW_PYTHON=$(shell if [ "A${CTO_TMP}" == "A${STABLE_TF1}" ]; then echo "tensorflow==${STABLE_TF1}"; else if [ ${CTO_SC} == 1 ]; then echo "tensorflow==${STABLE_TF2}"; else echo "tensorflow-gpu==${STABLE_TF2}"; fi; fi))

Is this intentional?

UNRAID build: change webui password

Hi! Thanks for this.

I am wondering if you could help me find out how to change the password of the webui?

I've tried these things:

adding a new variable within the Unraid template
manually changing the value in /home/jupyter/.jupyter/jupyter_notebook_config.json

.. to no avail.

Any idea? Thanks

Problem with jetsonnano-cuda_tensorflow_opencv:10.0_2.1.0_4.3.0-20200515

Thanks for providing these docker-images. I hoped getting around compiling opencv ;-)
I used the docker image jetsonnano-cuda_tensorflow_opencv:10.0_2.1.0_4.3.0-20200515

But when doing
import cv2 in python3.6 I get the error libnppc.so.10 not found.
I assume the base-nvidia docker has upgraded to jetpack 4.4 which uses cuda 10.2
(instead of jetpack 4.3 which uses cuda 10.0)

Might I be right?
Is the solution to use another base-container in Makefile/Dockerfile: instead of
nvcr.io/nvidia/deepstream-l4t:4.0.2-19.12-base resp nvcr.io/nvidia/l4t-base:r32.3.1?

I cannot figure out what to use instead as the nvcr-catalog is ... well I cannot find anything there :-(

Any help is appreciated.
Thanks

Docker build fail for Jetson Nano 20210218

Hello,

My docker build fail at step 27/41

 ---> Running in f4cf6fcad4d7
Traceback (most recent call last):
  File "setup.py", line 12, in <module>
    import torch
  File "/usr/local/lib/python3.6/dist-packages/torch/__init__.py", line 189, in <module>
    _load_global_deps()
  File "/usr/local/lib/python3.6/dist-packages/torch/__init__.py", line 142, in _load_global_deps
    ctypes.CDLL(lib_path, mode=ctypes.RTLD_GLOBAL)
  File "/usr/lib/python3.6/ctypes/__init__.py", line 348, in __init__
    self._handle = _dlopen(self._name, mode)
OSError: libcurand.so.10: cannot open shared object file: No such file or directory
The command '/bin/sh -c mkdir -p /usr/local/src/torchvision   && wget -q --no-check-certificate https://github.com/pytorch/vision/archive/v0.8.2.tar.gz -O - | tar --strip-components=1 -xz -C /usr/local/src/torchvision   && cd /usr/local/src/torchvision   && python3 setup.py install    && rm -rf /root/.cache/pip /usr/local/src/torchvision' returned a non-zero code: 1
Makefile:95: recipe for target 'actual_build' failed
make[2]: *** [actual_build] Error 1
make[2]: Leaving directory '/home/lisalink/cuda_tensorflow_opencv/JetsonNano'
Makefile:68: recipe for target 'build_prep' failed
make[1]: *** [build_prep] Error 2
make[1]: Leaving directory '/home/lisalink/cuda_tensorflow_opencv/JetsonNano'
Makefile:65: recipe for target 'jetsonnano-cuda_tensorflow_opencv-10.2_2.3_4.5.1' failed
make: *** [jetsonnano-cuda_tensorflow_opencv-10.2_2.3_4.5.1] Error 2

I installed Jetpack 4.5.0 without doing anything else. Not even apt update or apt upgrade
Please can you help with that issue ?

Unraid python3 packages on restart

Pardon my ignorance.

I have to pip install some packages for my use case. As some of you know, after a docker restart the packages are not there anymore and have to be reinstalled.

Is there a way to keep my installed packages (apt-get or pip) to remain persistent on docker reboot?

Thank you!

Build failed - make cudnn_tensorflow_opencv-10.2_2.4.1_4.5.2

Hi
I would like to build an image with cuda support 10.2 - tf 2.4.1 and opencv 4.5.2 but the build failed when compiling tensorflow.
Thanks for your help

[18,609 / 24,292] Compiling tensorflow/core/grappler/optimizers/arithmetic_optimizer.cc [for host]; 16s local ... (12 actions running)
[18,875 / 24,292] Compiling tensorflow/core/kernels/conv_2d_gpu_uint8.cu.cc [for host]; 84s local ... (12 actions running)
[19,033 / 24,292] Compiling tensorflow/core/kernels/pad_op_gpu.cu.cc [for host]; 189s local ... (12 actions running)
[19,153 / 24,292] Compiling tensorflow/core/kernels/pad_op_gpu.cu.cc [for host]; 636s local ... (12 actions running)
[19,154 / 24,292] Compiling tensorflow/core/kernels/pad_op_gpu.cu.cc [for host]; 1168s local ... (12 actions, 11 running)
ERROR: /usr/local/src/tensorflow/tensorflow/core/kernels/linalg/BUILD:193:18: C++ compilation of rule '//tensorflow/core/kernels/linalg:matrix_square_root_op' failed (Exit 4): crosstool_wrapper_driver_is_not_gcc failed: error executing command
(cd /root/.cache/bazel/bazel_root/bbcc73fcc5c2b01ab08b6bcf7c29e42e/execroot/org_tensorflow &&
exec env -
LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64
PATH=/root/.cache/bazelisk/downloads/bazelbuild/bazel-3.7.2-linux-x86_64/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
PWD=/proc/self/cwd
external/local_config_cuda/crosstool/clang/bin/crosstool_wrapper_driver_is_not_gcc -MD -MF bazel-out/host/bin/tensorflow/core/kernels/linalg/objs/matrix_square_root_op/matrix_square_root_op.pic.d '-frandom-seed=bazel-out/host/bin/tensorflow/core/kernels/linalg/objs/matrix_square_root_op/matrix_square_root_op.pic.o' -DTENSORFLOW_USE_CUSTOM_CONTRACTION_KERNEL -DTENSORFLOW_USE_MKLDNN_CONTRACTION_KERNEL -DHAVE_SYS_UIO_H -DTF_USE_SNAPPY -DCURL_STATICLIB -DPLATFORM_LINUX -DENABLE_CURL_CLIENT -DOPENSSL_IS_BORINGSSL -DEIGEN_MPL2_ONLY '-DEIGEN_MAX_ALIGN_BYTES=64' '-DEIGEN_HAS_TYPE_TRAITS=0' -D__CLANG_SUPPORT_DYN_ANNOTATION -iquote . -iquote bazel-out/host/bin -iquote external/com_google_absl -iquote bazel-out/host/bin/external/com_google_absl -iquote external/nsync -iquote bazel-out/host/bin/external/nsync -iquote external/eigen_archive -iquote bazel-out/host/bin/external/eigen_archive -iquote external/gif -iquote bazel-out/host/bin/external/gif -iquote external/libjpeg_turbo -iquote bazel-out/host/bin/external/libjpeg_turbo -iquote external/com_google_protobuf -iquote bazel-out/host/bin/external/com_google_protobuf -iquote external/com_googlesource_code_re2 -iquote bazel-out/host/bin/external/com_googlesource_code_re2 -iquote external/farmhash_archive -iquote bazel-out/host/bin/external/farmhash_archive -iquote external/fft2d -iquote bazel-out/host/bin/external/fft2d -iquote external/highwayhash -iquote bazel-out/host/bin/external/highwayhash -iquote external/zlib -iquote bazel-out/host/bin/external/zlib -iquote external/local_config_cuda -iquote bazel-out/host/bin/external/local_config_cuda -iquote external/local_config_tensorrt -iquote bazel-out/host/bin/external/local_config_tensorrt -iquote external/double_conversion -iquote bazel-out/host/bin/external/double_conversion -iquote external/snappy -iquote bazel-out/host/bin/external/snappy -iquote external/curl -iquote bazel-out/host/bin/external/curl -iquote external/boringssl -iquote bazel-out/host/bin/external/boringssl -iquote external/jsoncpp_git -iquote bazel-out/host/bin/external/jsoncpp_git -iquote external/aws -iquote bazel-out/host/bin/external/aws -iquote external/aws-c-common -iquote bazel-out/host/bin/external/aws-c-common -iquote external/aws-c-event-stream -iquote bazel-out/host/bin/external/aws-c-event-stream -iquote external/aws-checksums -iquote bazel-out/host/bin/external/aws-checksums -iquote external/mkl_dnn -iquote bazel-out/host/bin/external/mkl_dnn -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cuda_headers_virtual -Ibazel-out/host/bin/external/local_config_tensorrt/virtual_includes/tensorrt_headers -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cudnn_header -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cublas_headers_virtual -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cusolver_headers_virtual -isystem external/nsync/public -isystem bazel-out/host/bin/external/nsync/public -isystem external/eigen_archive -isystem bazel-out/host/bin/external/eigen_archive -isystem external/gif -isystem bazel-out/host/bin/external/gif -isystem external/com_google_protobuf/src -isystem bazel-out/host/bin/external/com_google_protobuf/src -isystem external/farmhash_archive/src -isystem bazel-out/host/bin/external/farmhash_archive/src -isystem external/zlib -isystem bazel-out/host/bin/external/zlib -isystem external/local_config_cuda/cuda -isystem bazel-out/host/bin/external/local_config_cuda/cuda -isystem external/local_config_cuda/cuda/cuda/include -isystem bazel-out/host/bin/external/local_config_cuda/cuda/cuda/include -isystem external/double_conversion -isystem bazel-out/host/bin/external/double_conversion -isystem external/curl/include -isystem bazel-out/host/bin/external/curl/include -isystem external/boringssl/src/include -isystem bazel-out/host/bin/external/boringssl/src/include -isystem external/jsoncpp_git/include -isystem bazel-out/host/bin/external/jsoncpp_git/include -isystem external/aws/aws-cpp-sdk-core/include -isystem bazel-out/host/bin/external/aws/aws-cpp-sdk-core/include -isystem external/aws/aws-cpp-sdk-s3/include -isystem bazel-out/host/bin/external/aws/aws-cpp-sdk-s3/include -isystem external/aws/aws-cpp-sdk-transfer/include -isystem bazel-out/host/bin/external/aws/aws-cpp-sdk-transfer/include -isystem external/aws-c-common/include -isystem bazel-out/host/bin/external/aws-c-common/include -isystem external/aws-c-event-stream/include -isystem bazel-out/host/bin/external/aws-c-event-stream/include -isystem external/aws-checksums/include -isystem bazel-out/host/bin/external/aws-checksums/include -isystem external/local_config_cuda/cuda/cublas/include -isystem bazel-out/host/bin/external/local_config_cuda/cuda/cublas/include -isystem external/local_config_cuda/cuda/cusolver/include -isystem bazel-out/host/bin/external/local_config_cuda/cuda/cusolver/include -isystem external/mkl_dnn/include -isystem bazel-out/host/bin/external/mkl_dnn/include -isystem external/mkl_dnn/src -isystem bazel-out/host/bin/external/mkl_dnn/src -isystem external/mkl_dnn/src/common -isystem bazel-out/host/bin/external/mkl_dnn/src/common -isystem external/mkl_dnn/src/cpu -isystem bazel-out/host/bin/external/mkl_dnn/src/cpu -isystem external/mkl_dnn/src/cpu/gemm -isystem bazel-out/host/bin/external/mkl_dnn/src/cpu/gemm -isystem external/mkl_dnn/src/cpu/xbyak -isystem bazel-out/host/bin/external/mkl_dnn/src/cpu/xbyak -Wno-builtin-macro-redefined '-D__DATE="redacted"' '-D__TIMESTAMP="redacted"' '-D__TIME="redacted"' -fPIC -U_FORTIFY_SOURCE '-D_FORTIFY_SOURCE=1' -fstack-protector -Wall -fno-omit-frame-pointer -no-canonical-prefixes -fno-canonical-system-headers -DNDEBUG -g0 -O2 -ffunction-sections -fdata-sections -g0 -w -Wno-sign-compare -g0 '-std=c++14' -DEIGEN_AVOID_STL_ARRAY -Iexternal/gemmlowp -Wno-sign-compare '-ftemplate-depth=900' -fno-exceptions '-DGOOGLE_CUDA=1' '-DTENSORFLOW_USE_NVCC=1' -msse3 -pthread '-DGOOGLE_CUDA=1' -c tensorflow/core/kernels/linalg/matrix_square_root_op.cc -o bazel-out/host/bin/tensorflow/core/kernels/linalg/_objs/matrix_square_root_op/matrix_square_root_op.pic.o)
Execution platform: @local_execution_config_platform//:platform
x86_64-linux-gnu-gcc-7: internal compiler error: Killed (program cc1plus)
Please submit a full bug report,
with preprocessed source if appropriate.
See file:///usr/share/doc/gcc-7/README.Bugs for instructions.
Target //tensorflow/tools/pip_package:build_pip_package failed to build
ERROR: /usr/local/src/tensorflow/tensorflow/tools/pip_package/BUILD:69:10 C++ compilation of rule '//tensorflow/core/kernels/linalg:matrix_square_root_op' failed (Exit 4): crosstool_wrapper_driver_is_not_gcc failed: error executing command
(cd /root/.cache/bazel/bazel_root/bbcc73fcc5c2b01ab08b6bcf7c29e42e/execroot/org_tensorflow &&
exec env -
LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64
PATH=/root/.cache/bazelisk/downloads/bazelbuild/bazel-3.7.2-linux-x86_64/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
PWD=/proc/self/cwd
external/local_config_cuda/crosstool/clang/bin/crosstool_wrapper_driver_is_not_gcc -MD -MF bazel-out/host/bin/tensorflow/core/kernels/linalg/objs/matrix_square_root_op/matrix_square_root_op.pic.d '-frandom-seed=bazel-out/host/bin/tensorflow/core/kernels/linalg/objs/matrix_square_root_op/matrix_square_root_op.pic.o' -DTENSORFLOW_USE_CUSTOM_CONTRACTION_KERNEL -DTENSORFLOW_USE_MKLDNN_CONTRACTION_KERNEL -DHAVE_SYS_UIO_H -DTF_USE_SNAPPY -DCURL_STATICLIB -DPLATFORM_LINUX -DENABLE_CURL_CLIENT -DOPENSSL_IS_BORINGSSL -DEIGEN_MPL2_ONLY '-DEIGEN_MAX_ALIGN_BYTES=64' '-DEIGEN_HAS_TYPE_TRAITS=0' -D__CLANG_SUPPORT_DYN_ANNOTATION -iquote . -iquote bazel-out/host/bin -iquote external/com_google_absl -iquote bazel-out/host/bin/external/com_google_absl -iquote external/nsync -iquote bazel-out/host/bin/external/nsync -iquote external/eigen_archive -iquote bazel-out/host/bin/external/eigen_archive -iquote external/gif -iquote bazel-out/host/bin/external/gif -iquote external/libjpeg_turbo -iquote bazel-out/host/bin/external/libjpeg_turbo -iquote external/com_google_protobuf -iquote bazel-out/host/bin/external/com_google_protobuf -iquote external/com_googlesource_code_re2 -iquote bazel-out/host/bin/external/com_googlesource_code_re2 -iquote external/farmhash_archive -iquote bazel-out/host/bin/external/farmhash_archive -iquote external/fft2d -iquote bazel-out/host/bin/external/fft2d -iquote external/highwayhash -iquote bazel-out/host/bin/external/highwayhash -iquote external/zlib -iquote bazel-out/host/bin/external/zlib -iquote external/local_config_cuda -iquote bazel-out/host/bin/external/local_config_cuda -iquote external/local_config_tensorrt -iquote bazel-out/host/bin/external/local_config_tensorrt -iquote external/double_conversion -iquote bazel-out/host/bin/external/double_conversion -iquote external/snappy -iquote bazel-out/host/bin/external/snappy -iquote external/curl -iquote bazel-out/host/bin/external/curl -iquote external/boringssl -iquote bazel-out/host/bin/external/boringssl -iquote external/jsoncpp_git -iquote bazel-out/host/bin/external/jsoncpp_git -iquote external/aws -iquote bazel-out/host/bin/external/aws -iquote external/aws-c-common -iquote bazel-out/host/bin/external/aws-c-common -iquote external/aws-c-event-stream -iquote bazel-out/host/bin/external/aws-c-event-stream -iquote external/aws-checksums -iquote bazel-out/host/bin/external/aws-checksums -iquote external/mkl_dnn -iquote bazel-out/host/bin/external/mkl_dnn -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cuda_headers_virtual -Ibazel-out/host/bin/external/local_config_tensorrt/virtual_includes/tensorrt_headers -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cudnn_header -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cublas_headers_virtual -Ibazel-out/host/bin/external/local_config_cuda/cuda/virtual_includes/cusolver_headers_virtual -isystem external/nsync/public -isystem bazel-out/host/bin/external/nsync/public -isystem external/eigen_archive -isystem bazel-out/host/bin/external/eigen_archive -isystem external/gif -isystem bazel-out/host/bin/external/gif -isystem external/com_google_protobuf/src -isystem bazel-out/host/bin/external/com_google_protobuf/src -isystem external/farmhash_archive/src -isystem bazel-out/host/bin/external/farmhash_archive/src -isystem external/zlib -isystem bazel-out/host/bin/external/zlib -isystem external/local_config_cuda/cuda -isystem bazel-out/host/bin/external/local_config_cuda/cuda -isystem external/local_config_cuda/cuda/cuda/include -isystem bazel-out/host/bin/external/local_config_cuda/cuda/cuda/include -isystem external/double_conversion -isystem bazel-out/host/bin/external/double_conversion -isystem external/curl/include -isystem bazel-out/host/bin/external/curl/include -isystem external/boringssl/src/include -isystem bazel-out/host/bin/external/boringssl/src/include -isystem external/jsoncpp_git/include -isystem bazel-out/host/bin/external/jsoncpp_git/include -isystem external/aws/aws-cpp-sdk-core/include -isystem bazel-out/host/bin/external/aws/aws-cpp-sdk-core/include -isystem external/aws/aws-cpp-sdk-s3/include -isystem bazel-out/host/bin/external/aws/aws-cpp-sdk-s3/include -isystem external/aws/aws-cpp-sdk-transfer/include -isystem bazel-out/host/bin/external/aws/aws-cpp-sdk-transfer/include -isystem external/aws-c-common/include -isystem bazel-out/host/bin/external/aws-c-common/include -isystem external/aws-c-event-stream/include -isystem bazel-out/host/bin/external/aws-c-event-stream/include -isystem external/aws-checksums/include -isystem bazel-out/host/bin/external/aws-checksums/include -isystem external/local_config_cuda/cuda/cublas/include -isystem bazel-out/host/bin/external/local_config_cuda/cuda/cublas/include -isystem external/local_config_cuda/cuda/cusolver/include -isystem bazel-out/host/bin/external/local_config_cuda/cuda/cusolver/include -isystem external/mkl_dnn/include -isystem bazel-out/host/bin/external/mkl_dnn/include -isystem external/mkl_dnn/src -isystem bazel-out/host/bin/external/mkl_dnn/src -isystem external/mkl_dnn/src/common -isystem bazel-out/host/bin/external/mkl_dnn/src/common -isystem external/mkl_dnn/src/cpu -isystem bazel-out/host/bin/external/mkl_dnn/src/cpu -isystem external/mkl_dnn/src/cpu/gemm -isystem bazel-out/host/bin/external/mkl_dnn/src/cpu/gemm -isystem external/mkl_dnn/src/cpu/xbyak -isystem bazel-out/host/bin/external/mkl_dnn/src/cpu/xbyak -Wno-builtin-macro-redefined '-D__DATE="redacted"' '-D__TIMESTAMP="redacted"' '-D__TIME="redacted"' -fPIC -U_FORTIFY_SOURCE '-D_FORTIFY_SOURCE=1' -fstack-protector -Wall -fno-omit-frame-pointer -no-canonical-prefixes -fno-canonical-system-headers -DNDEBUG -g0 -O2 -ffunction-sections -fdata-sections -g0 -w -Wno-sign-compare -g0 '-std=c++14' -DEIGEN_AVOID_STL_ARRAY -Iexternal/gemmlowp -Wno-sign-compare '-ftemplate-depth=900' -fno-exceptions '-DGOOGLE_CUDA=1' '-DTENSORFLOW_USE_NVCC=1' -msse3 -pthread '-DGOOGLE_CUDA=1' -c tensorflow/core/kernels/linalg/matrix_square_root_op.cc -o bazel-out/host/bin/tensorflow/core/kernels/linalg/_objs/matrix_square_root_op/matrix_square_root_op.pic.o)
Execution platform: @local_execution_config_platform//:platform
INFO: Elapsed time: 3964.637s, Critical Path: 1182.82s
INFO: 19166 processes: 8754 internal, 10412 local.
FAILED: Build did NOT complete successfully
FAILED: Build did NOT complete successfully
Command exited with non-zero status 1
0.30user 3.15system 1:06:07elapsed 0%CPU (0avgtext+0avgdata 12632maxresident)k
10141048inputs+40outputs (42403major+19178minor)pagefaults 0swaps

Docker builds failing on Jetson NX

Hi, I'm not sure if this is meant to work on the NX too, or if you've only tested on Nanos?

I've cloned the repo to the NX, and tried building with

make tensorflow_opencv
and then
make cudnn_tensorflow_opencv

The first gave:

Step 24/42 : RUN curl -s -Lo /usr/local/bin/bazel https://github.com/bazelbuild/bazelisk/releases/download/v${LATEST_BAZELISK}/bazelisk-linux-amd64   && chmod +x /usr/local/bin/bazel   && mkdir -p /usr/local/src/tensorflow   && cd /usr/local/src   && wget -q --no-check-certificate -c https://github.com/tensorflow/tensorflow/archive/v${CTO_TENSORFLOW_VERSION}.tar.gz -O - | tar --strip-components=1 -xz -C /usr/local/src/tensorflow   && cd /usr/local/src/tensorflow   && fgrep _TF_MAX_BAZEL configure.py | grep '=' | perl -ne '$lb="'${LATEST_BAZEL}'";$brv=$1 if (m%\=\s+.([\d\.]+).$+%); sub numit{@g=split(m%\.%,$_[0]);return(1000000*$g[0]+1000*$g[1]+$g[2]);}; if (&numit($brv) > &numit($lb)) { print "$lb" } else {print "$brv"};' > .bazelversion   && bazel clean   && chmod +x /tmp/tf_build.sh   && time /tmp/tf_build.sh ${CTO_TF_CUDNN} ${CTO_TF_OPT}   && time ./bazel-bin/tensorflow/tools/pip_package/build_pip_package /tmp/tensorflow_pkg   && time pip3 install /tmp/tensorflow_pkg/tensorflow-*.whl   && rm -rf /usr/local/src/tensorflow /tmp/tensorflow_pkg /tmp/bazel_check.pl /tmp/tf_build.sh /tmp/hsperfdata_root /root/.cache/bazel /root/.cache/pip /root/.cache/bazelisk
 ---> Running in 3059bbf7dc96
/bin/sh: 1: bazel: Exec format error
The command '/bin/sh -c curl -s -Lo /usr/local/bin/bazel https://github.com/bazelbuild/bazelisk/releases/download/v${LATEST_BAZELISK}/bazelisk-linux-amd64   && chmod +x /usr/local/bin/bazel   && mkdir -p /usr/local/src/tensorflow   && cd /usr/local/src   && wget -q --no-check-certificate -c https://github.com/tensorflow/tensorflow/archive/v${CTO_TENSORFLOW_VERSION}.tar.gz -O - | tar --strip-components=1 -xz -C /usr/local/src/tensorflow   && cd /usr/local/src/tensorflow   && fgrep _TF_MAX_BAZEL configure.py | grep '=' | perl -ne '$lb="'${LATEST_BAZEL}'";$brv=$1 if (m%\=\s+.([\d\.]+).$+%); sub numit{@g=split(m%\.%,$_[0]);return(1000000*$g[0]+1000*$g[1]+$g[2]);}; if (&numit($brv) > &numit($lb)) { print "$lb" } else {print "$brv"};' > .bazelversion   && bazel clean   && chmod +x /tmp/tf_build.sh   && time /tmp/tf_build.sh ${CTO_TF_CUDNN} ${CTO_TF_OPT}   && time ./bazel-bin/tensorflow/tools/pip_package/build_pip_package /tmp/tensorflow_pkg   && time pip3 install /tmp/tensorflow_pkg/tensorflow-*.whl   && rm -rf /usr/local/src/tensorflow /tmp/tensorflow_pkg /tmp/bazel_check.pl /tmp/tf_build.sh /tmp/hsperfdata_root /root/.cache/bazel /root/.cache/pip /root/.cache/bazelisk' returned a non-zero code: 2
Makefile:202: recipe for target 'actual_build' failed
make[3]: *** [actual_build] Error 2
make[3]: Leaving directory '/home/chicken/datamachines/cuda_tensorflow_opencv'
Makefile:145: recipe for target 'build_prep' failed
make[2]: *** [build_prep] Error 2
make[2]: Leaving directory '/home/chicken/datamachines/cuda_tensorflow_opencv'
Makefile:138: recipe for target 'tensorflow_opencv-1.15.5_3.4.14' failed
make[1]: *** [tensorflow_opencv-1.15.5_3.4.14] Error 2
make[1]: Leaving directory '/home/chicken/datamachines/cuda_tensorflow_opencv'
Makefile:132: recipe for target 'tensorflow_opencv' failed
make: *** [tensorflow_opencv] Error 2

And the second gave:

Step 4/43 : RUN apt-get update -y --fix-missing  && apt-get install -y --no-install-recommends     apt-utils     locales     wget     ca-certificates   && apt-get clean
 ---> Running in 52221b94691a
standard_init_linux.go:211: exec user process caused "exec format error"
The command '/bin/sh -c apt-get update -y --fix-missing  && apt-get install -y --no-install-recommends     apt-utils     locales     wget     ca-certificates   && apt-get clean' returned a non-zero code: 1
Makefile:202: recipe for target 'actual_build' failed
make[3]: *** [actual_build] Error 1
make[3]: Leaving directory '/home/chicken/datamachines/cuda_tensorflow_opencv'
Makefile:145: recipe for target 'build_prep' failed
make[2]: *** [build_prep] Error 2
make[2]: Leaving directory '/home/chicken/datamachines/cuda_tensorflow_opencv'
Makefile:141: recipe for target 'cudnn_tensorflow_opencv-9.2_1.15.5_3.4.14' failed
make[1]: *** [cudnn_tensorflow_opencv-9.2_1.15.5_3.4.14] Error 2
make[1]: Leaving directory '/home/chicken/datamachines/cuda_tensorflow_opencv'
Makefile:135: recipe for target 'cudnn_tensorflow_opencv' failed
make: *** [cudnn_tensorflow_opencv] Error 2

Did I miss a step?

cudnn_tensorflow_opencv docker image does not detect GPU when running opencv dnn module

Hi, I need opencv's dnn module to construct a machine learning model and do a single forward pass. When I run the python code without cudnn_tensorflow_opencv docker image, the video I am processing takes about 45 seconds. When I use cudnn_tensorflow_opencv docker image it takes the same time. I can monitor through nvdia-smi the GPU activity. It is not used. Also opencv or anything else do not print any info about any found GPU. I use docker version > 20, I also used the flag --gpus all just in case. Nothing happens. How can I use the GPU with opencv's dnn module?

RuntimeError: CUDA runtime implicit initialization on GPU:0 failed. Status: device kernel image is invalid

Hi,
Thanks for sharing the images and comprehensive instructions! They are very helpful!

When I use this image datamachines/cudnn_tensorflow_opencv:10.2_2.3.1_4.5.0-20201204, tf will have problems. In fact when I run the two following lines from python,
from tensorflow.python.client import device_lib print(device_lib.list_local_devices())
it will result in this error:

  File "<stdin>", line 1, in <module>
  File "/usr/local/lib/python3.8/dist-packages/tensorflow/python/framework/test_util.py", line 131, in gpu_device_name
    for x in device_lib.list_local_devices():
  File "/usr/local/lib/python3.8/dist-packages/tensorflow/python/client/device_lib.py", line 43, in list_local_devices
    _convert(s) for s in _pywrap_device_lib.list_devices(serialized_config)
RuntimeError: CUDA runtime implicit initialization on GPU:0 failed. Status: device kernel image is invalid```