Giter VIP home page Giter VIP logo

Comments (2)

TengHuo avatar TengHuo commented on July 20, 2024

Pytorch 1.0.1的错误与1.1的不一样

Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Core was generated by `python -m emg.app --model cnn --gesture_num 8 --lr 0.01 --epoch 200 --train_bat'.
Program terminated with signal SIGFPE, Arithmetic exception.

#0  0x00007ff0473f991a in cudnn::gemm::conv2dFindConfig(cudnnContext*, cudnnTensor4dStruct*, cudnnFilter4dStruct*, cudnnConvolutionStruct*, cudnnTensor4dStruct*, cudnn::gemm::Conv2dType_t, cudnn::gemm::Conv2dConfig&)
    () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
[Current thread is 1 (Thread 0x7ff01d499700 (LWP 4472))]
(gdb) bt
#0  0x00007ff0473f991a in cudnn::gemm::conv2dFindConfig(cudnnContext*, cudnnTensor4dStruct*, cudnnFilter4dStruct*, cudnnConvolutionStruct*, cudnnTensor4dStruct*, cudnn::gemm::Conv2dType_t, cudnn::gemm::Conv2dConfig&)
    () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#1  0x00007ff047459018 in cudnnGetConvolution4dBackwardDataWorkspaceSize_internal(cudnnContext*, cudnnFilter4dStruct*, cudnnTensor4dStruct*, cudnnConvolutionStruct*, cudnnTensor4dStruct*, cudnnConvolutionBwdDataAlgo_t, unsigned long*) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#2  0x00007ff047459280 in cudnnGetConvolution4dBackwardDataWorkspaceSize(cudnnContext*, cudnnFilter4dStruct*, cudnnTensor4dStruct*, cudnnConvolutionStruct*, cudnnTensor4dStruct*, cudnnConvolutionBwdDataAlgo_t, unsigned long*) () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#3  0x00007ff047170f68 in cudnnGetConvolutionBackwardDataWorkspaceSizeInternal(cudnnContext*, cudnnFilterStruct*, cudnnTensorStruct*, cudnnConvolutionStruct*, cudnnTensorStruct*, cudnnConvolutionBwdDataAlgo_t, unsigned long*) () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#4  0x00007ff04717f101 in cudnnStatus_t getConvolution<cudnnConvolutionBwdDataAlgo_t, cudnnConvolutionBwdDataAlgoPerf_t, 6, 8>(find_get_conv_params, int, int*, cudnnConvolutionBwdDataAlgoPerf_t*) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#5  0x00007ff04716beae in cudnnGetConvolutionBackwardDataAlgorithm ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#6  0x00007ff04526bf98 in at::native::algorithm_search<cudnnConvolutionBwdDataAlgo_t>::getAlgorithm(at::native::ConvolutionArgs const&, cudnnConvolutionBwdDataAlgo_t*) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#7  0x00007ff0452683e7 in at::native::raw_cudnn_convolution_backward_input_out(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::ArrayRef<long>, c10::ArrayRef<long>, c10::ArrayRef<long>, long, bool, bool)
    () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#8  0x00007ff045268cda in at::native::cudnn_convolution_backward_input(char const*, c10::ArrayRef<long>, at::TensorArg const&, at::TensorArg const&, c10::ArrayRef<long>, c10::ArrayRef<long>, c10::ArrayRef<long>, long, bool, bool) () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#9  0x00007ff045268ef7 in at::native::cudnn_convolution_backward_input(c10::ArrayRef<long>, at::Tensor const&, at::Tensor const&, c10::ArrayRef<long>, c10::ArrayRef<long>, c10::ArrayRef<long>, long, bool, bool) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#10 0x00007ff045343a9b in at::CUDAFloatType::cudnn_convolution_backward_input(c10::ArrayRef<long>, at::Tensor const&, at::Tensor const&, c10::ArrayRef<long>, c10::ArrayRef<long>, c10::ArrayRef<long>, long, bool, bool) const () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#11 0x00007ff045266fca in at::native::cudnn_convolution_backward(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::ArrayRef<long>, c10::ArrayRef<long>, c10::ArrayRef<long>, long, bool, bool, std::array<bool, 3ul>) () from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#12 0x00007ff045343992 in at::CUDAFloatType::cudnn_convolution_backward(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::ArrayRef<long>, c10::ArrayRef<long>, c10::ArrayRef<long>, long, bool, bool, std::array<bool, 3ul>) const ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so
#13 0x00007ff03a401984 in torch::autograd::VariableType::cudnn_convolution_backward(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::ArrayRef<long>, c10::ArrayRef<long>, c10::ArrayRef<long>, long, bool, bool, std::array<bool, 3ul>) const ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libtorch.so.1
#14 0x00007ff03a232b16 in torch::autograd::generated::CudnnConvolutionBackward::apply(std::vector<torch::autograd::Variable, std::allocator<torch::autograd::Variable> >&&) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libtorch.so.1
#15 0x00007ff03a207d56 in torch::autograd::Engine::evaluate_function(torch::autograd::FunctionTask&) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libtorch.so.1
#16 0x00007ff03a209e80 in torch::autograd::Engine::thread_main(torch::autograd::GraphTask*) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libtorch.so.1
#17 0x00007ff03a206a47 in torch::autograd::Engine::thread_init(int) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libtorch.so.1
#18 0x00007ff06861733a in torch::autograd::python::PythonEngine::thread_init(int) ()
   from /home/teng/miniconda3/envs/py36/lib/python3.6/site-packages/torch/lib/libtorch_python.so
#19 0x00007ff07944c678 in std::execute_native_thread_routine_compat (__p=<optimised out>)
    at /opt/conda/conda-bld/compilers_linux-64_1534514838838/work/.build/x86_64-conda_cos6-linux-gnu/src/gcc/libstdc++-v3/src/c++11/thread.cc:94
#20 0x00007ff07d5e16db in start_thread (arg=0x7ff01d499700) at pthread_create.c:463
#21 0x00007ff07d30a88f in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:95

from realtime-semg.

TengHuo avatar TengHuo commented on July 20, 2024

reinstall Nvidia driver 430.14, never see the crash again

from realtime-semg.

Related Issues (3)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.