Comments (4)
报错补充:LAUNCH INFO 2024-06-17 17:01:27,787 Pod failed
LAUNCH ERROR 2024-06-17 17:01:27,788 Container failed !!!
Container rank 3 status failed cmd ['/data/miniconda/envs/py38_ocr/bin/python3', '-u', 'tools/train.py', '-c', 'configs/det/ch_PP-OCRv3/ch_PP-OCRv3_det_student.yml', '-o', 'Global.pretrained_model=pretrain_models/ch_PP-OCRv3_det_distill_train/student.pdparams'] code -9 log log/workerlog.3
env {'NV_LIBCUBLAS_VERSION': '11.3.1.68-1', 'NVIDIA_VISIBLE_DEVICES': 'GPU-1314610d-0066-5daa-2856-78e48d9c6b8f,GPU-4408b4f1-7084-1763-3194-0190b4d7e396,GPU-4b5437d0-12bf-5148-3c67-1eeee3c02a2b,GPU-97e8a70d-1660-3d0c-f53e-f7d6e12d27ac', 'KUBERNETES_SERVICE_PORT_HTTPS': '443', 'TRAEFIK_WEB_SERVICE_PORT_80_TCP_PROTO': 'tcp', 'ZHANGZHIYING_IDE_PORT_8080_TCP_PROTO': 'tcp', 'NEXUS3_SERVICE_PORT_WEB': '8081', 'COLORTERM': 'truecolor', 'NV_NVML_DEV_VERSION': '11.2.67-1', 'ZHANGZHIYING_IDE_PORT_8080_TCP': 'tcp://10.10.114.16:8080', 'NV_CUDNN_PACKAGE_NAME': 'libcudnn8', 'KUBERNETES_SERVICE_PORT': '443', 'TERM_PROGRAM_VERSION': '1.85.1', 'POSTRESQL_POSTGRESQL_PORT_5432_TCP': 'tcp://10.10.70.210:5432', 'NV_LIBNCCL_DEV_PACKAGE': 'libnccl-dev=2.8.4-1+cuda11.2', 'CONDA_EXE': '/data/miniconda/bin/conda', 'TRAEFIK_WEB_SERVICE_PORT_80_TCP_ADDR': '10.10.172.32', '_CE_M': '', 'NV_LIBNCCL_DEV_PACKAGE_VERSION': '2.8.4-1', 'TRAEFIK_WEB_SERVICE_PORT_WEB': '80', 'HOSTNAME': 'xulian-ide-99bdfd69c-lrhk2', 'ZHANGZHIYING_IDE_PORT_8080_TCP_PORT': '8080', 'ZENTAO_SERVICE_PORT': 'tcp://10.10.215.153:80', 'NVIDIA_REQUIRE_CUDA': 'cuda>=11.2 brand=tesla,driver>=418,driver<419 brand=tesla,driver>=440,driver<441 driver>=450', 'XULIAN_IDE_PORT_8080_TCP_PORT': '8080', 'TRAEFIK_WEB_PORT_80_TCP_ADDR': '10.10.147.238', 'NV_LIBCUBLAS_DEV_PACKAGE': 'libcublas-dev-11-2=11.3.1.68-1', 'TRAEFIK_WEB_PORT': 'tcp://10.10.147.238:80', 'NV_NVTX_VERSION': '11.2.67-1', 'NV_ML_REPO_ENABLED': '1', 'ZENTAO_SERVICE_SERVICE_HOST': '10.10.215.153', 'NEXUS3_SERVICE_HOST': '10.10.17.156', 'NEXUS3_PORT': 'tcp://10.10.17.156:8081', 'ZHANGZHIYING_IDE_SERVICE_HOST': '10.10.114.16', 'NV_CUDA_CUDART_DEV_VERSION': '11.2.72-1', 'NV_LIBCUSPARSE_VERSION': '11.3.1.68-1', 'NV_LIBNPP_VERSION': '11.2.1.68-1', 'POSTRESQL_POSTGRESQL_SERVICE_PORT': '5432', 'NCCL_VERSION': '2.8.4-1', 'VSCODE_PROXY_URI': 'https://xulian.test.bytebroad.com/proxy/{{port}}/', 'TRAEFIK_WEB_PORT_80_TCP_PORT': '80', 'ZHANGZHIYING_IDE_PORT_8080_TCP_ADDR': '10.10.114.16', 'XULIAN_IDE_PORT': 'tcp://10.10.231.36:8080', 'ZHOUPAN_IDE_PORT': 'tcp://10.10.95.128:8080', 'PWD': '/root/workspace/paddleocr_train', 'CONDA_ROOT': '/data/miniconda', 'XULIAN_IDE_PORT_8080_TCP': 'tcp://10.10.231.36:8080', 'CONDA_PREFIX': '/data/miniconda/envs/py38_ocr', 'NV_CUDNN_PACKAGE': 'libcudnn8=8.1.1.33-1+cuda11.2', 'ZENTAO_SERVICE_PORT_80_TCP_ADDR': '10.10.215.153', 'NVIDIA_DRIVER_CAPABILITIES': 'compute,utility', 'NV_LIBNPP_PACKAGE': 'libnpp-11-2=11.2.1.68-1', 'TRAEFIK_DASHBOARD_SERVICE_PORT_DASHBOARD': '8080', 'NV_LIBNCCL_DEV_PACKAGE_NAME': 'libnccl-dev', 'VSCODE_GIT_ASKPASS_NODE': '/root/code-server/lib/node', 'TRAEFIK_WEB_PORT_443_TCP_PROTO': 'tcp', 'NV_LIBCUBLAS_DEV_VERSION': '11.3.1.68-1', 'TRAEFIK_DASHBOARD_PORT_8080_TCP_PORT': '8080', 'TRAEFIK_DASHBOARD_PORT_8080_TCP': 'tcp://10.10.150.194:8080', 'NV_LIBCUBLAS_DEV_PACKAGE_NAME': 'libcublas-dev-11-2', 'POSTRESQL_POSTGRESQL_PORT_5432_TCP_PORT': '5432', 'NV_CUDA_CUDART_VERSION': '11.2.72-1', 'HOME': '/root', 'ZENTAO_SERVICE_PORT_80_TCP_PROTO': 'tcp', 'LANG': 'en_US.UTF-8', 'KUBERNETES_PORT_443_TCP': 'tcp://10.10.0.1:443', 'LS_COLORS': 'rs=0:di=01;34:ln=01;36:mh=00:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=40;31;01:mi=00:su=37;41:sg=30;43:ca=30;41:tw=30;42:ow=34;42:st=37;44:ex=01;32:.tar=01;31:.tgz=01;31:.arc=01;31:.arj=01;31:.taz=01;31:.lha=01;31:.lz4=01;31:.lzh=01;31:.lzma=01;31:.tlz=01;31:.txz=01;31:.tzo=01;31:.t7z=01;31:.zip=01;31:.z=01;31:.dz=01;31:.gz=01;31:.lrz=01;31:.lz=01;31:.lzo=01;31:.xz=01;31:.zst=01;31:.tzst=01;31:.bz2=01;31:.bz=01;31:.tbz=01;31:.tbz2=01;31:.tz=01;31:.deb=01;31:.rpm=01;31:.jar=01;31:.war=01;31:.ear=01;31:.sar=01;31:.rar=01;31:.alz=01;31:.ace=01;31:.zoo=01;31:.cpio=01;31:.7z=01;31:.rz=01;31:.cab=01;31:.wim=01;31:.swm=01;31:.dwm=01;31:.esd=01;31:.jpg=01;35:.jpeg=01;35:.mjpg=01;35:.mjpeg=01;35:.gif=01;35:.bmp=01;35:.pbm=01;35:.pgm=01;35:.ppm=01;35:.tga=01;35:.xbm=01;35:.xpm=01;35:.tif=01;35:.tiff=01;35:.png=01;35:.svg=01;35:.svgz=01;35:.mng=01;35:.pcx=01;35:.mov=01;35:.mpg=01;35:.mpeg=01;35:.m2v=01;35:.mkv=01;35:.webm=01;35:.ogm=01;35:.mp4=01;35:.m4v=01;35:.mp4v=01;35:.vob=01;35:.qt=01;35:.nuv=01;35:.wmv=01;35:.asf=01;35:.rm=01;35:.rmvb=01;35:.flc=01;35:.avi=01;35:.fli=01;35:.flv=01;35:.gl=01;35:.dl=01;35:.xcf=01;35:.xwd=01;35:.yuv=01;35:.cgm=01;35:.emf=01;35:.ogv=01;35:.ogx=01;35:.aac=00;36:.au=00;36:.flac=00;36:.m4a=00;36:.mid=00;36:.midi=00;36:.mka=00;36:.mp3=00;36:.mpc=00;36:.ogg=00;36:.ra=00;36:.wav=00;36:.oga=00;36:.opus=00;36:.spx=00;36:*.xspf=00;36:', 'NEXUS3_PORT_8081_TCP': 'tcp://10.10.17.156:8081', 'ZHOUPAN_IDE_SERVICE_PORT_HTTP': '8080', 'CUDA_VERSION': '11.2.0', 'NV_LIBCUBLAS_PACKAGE': 'libcublas-11-2=11.3.1.68-1', 'ZHOUPAN_IDE_PORT_8080_TCP_ADDR': '10.10.95.128', 'TRAEFIK_DASHBOARD_SERVICE_SERVICE_HOST': '10.10.210.90', 'CONDA_PROMPT_MODIFIER': '(py38_ocr) ', 'XULIAN_IDE_SERVICE_HOST': '10.10.231.36', 'GIT_ASKPASS': '/root/code-server/lib/vscode/extensions/git/dist/askpass.sh', 'ZHOUPAN_IDE_SERVICE_PORT': '8080', 'ZHANGZHIYING_IDE_PORT': 'tcp://10.10.114.16:8080', 'NV_LIBNPP_DEV_PACKAGE': 'libnpp-dev-11-2=11.2.1.68-1', 'XULIAN_IDE_PORT_8080_TCP_ADDR': '10.10.231.36', 'TRAEFIK_WEB_PORT_80_TCP': 'tcp://10.10.147.238:80', 'TRAEFIK_DASHBOARD_SERVICE_PORT_8080_TCP_PORT': '8080', 'TRAEFIK_DASHBOARD_PORT': 'tcp://10.10.150.194:8080', 'NV_LIBCUBLAS_PACKAGE_NAME': 'libcublas-11-2', 'ZENTAO_SERVICE_PORT_80_TCP': 'tcp://10.10.215.153:80', 'TRAEFIK_WEB_SERVICE_HOST': '10.10.147.238', 'NV_LIBNPP_DEV_VERSION': '11.2.1.68-1', 'VSCODE_GIT_ASKPASS_EXTRA_ARGS': '', 'TRAEFIK_WEB_SERVICE_PORT_80_TCP': 'tcp://10.10.172.32:80', 'NEXUS3_SERVICE_PORT': '8081', 'TRAEFIK_DASHBOARD_PORT_8080_TCP_ADDR': '10.10.150.194', 'LESSCLOSE': '/usr/bin/lesspipe %s %s', 'TERM': 'xterm-256color', 'NV_ML_REPO_URL': 'https://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu2004/x86_64', 'NV_LIBCUSPARSE_DEV_VERSION': '11.3.1.68-1', 'CE_CONDA': '', 'TRAEFIK_WEB_SERVICE_SERVICE_PORT': '80', 'LESSOPEN': '| /usr/bin/lesspipe %s', 'TRAEFIK_WEB_SERVICE_PORT': '80', 'ZHOUPAN_IDE_PORT_8080_TCP_PORT': '8080', 'TRAEFIK_WEB_SERVICE_PORT_80_TCP_PORT': '80', 'LIBRARY_PATH': '/usr/local/cuda/lib64/stubs', 'NV_CUDNN_VERSION': '8.1.1.33', 'VSCODE_GIT_IPC_HANDLE': '/tmp/vscode-git-e94dce1104.sock', 'CONDA_SHLVL': '3', 'ZHOUPAN_IDE_PORT_8080_TCP_PROTO': 'tcp', 'POSTRESQL_POSTGRESQL_SERVICE_HOST': '10.10.70.210', 'POSTRESQL_POSTGRESQL_PORT_5432_TCP_PROTO': 'tcp', 'TRAEFIK_DASHBOARD_SERVICE_HOST': '10.10.150.194', 'SHLVL': '2', 'POSTRESQL_POSTGRESQL_PORT_5432_TCP_ADDR': '10.10.70.210', 'NV_CUDA_LIB_VERSION': '11.2.0-1', 'NVARCH': 'x86_64', 'TRAEFIK_DASHBOARD_SERVICE_PORT': 'tcp://10.10.210.90:8080', 'KUBERNETES_PORT_443_TCP_PROTO': 'tcp', 'ZHANGZHIYING_IDE_SERVICE_PORT_HTTP': '8080', 'NV_CUDNN_PACKAGE_DEV': 'libcudnn8-dev=8.1.1.33-1+cuda11.2', 'TRAEFIK_WEB_SERVICE_SERVICE_HOST': '10.10.172.32', 'KUBERNETES_PORT_443_TCP_ADDR': '10.10.0.1', 'NV_CUDA_COMPAT_PACKAGE': 'cuda-compat-11-2', 'ZHOUPAN_IDE_SERVICE_HOST': '10.10.95.128', 'ZENTAO_SERVICE_PORT_80_TCP_PORT': '80', 'CONDA_PYTHON_EXE': '/data/miniconda/bin/python', 'NV_LIBNCCL_PACKAGE': 'libnccl2=2.8.4-1+cuda11.2', 'LD_LIBRARY_PATH': '/data/miniconda/envs/py38_ocr/lib/python3.8/site-packages/cv2/../../lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64', 'POSTRESQL_POSTGRESQL_SERVICE_PORT_TCP_POSTGRESQL': '5432', 'ZENTAO_SERVICE_SERVICE_PORT': '80', 'TRAEFIK_WEB_PORT_443_TCP': 'tcp://10.10.147.238:443', 'CONDA_DEFAULT_ENV': 'py38_ocr', 'XULIAN_IDE_SERVICE_PORT_HTTP': '8080', 'NEXUS3_PORT_8081_TCP_PROTO': 'tcp', 'KUBERNETES_SERVICE_HOST': '10.10.0.1', 'TRAEFIK_WEB_PORT_80_TCP_PROTO': 'tcp', 'ZHANGZHIYING_IDE_SERVICE_PORT': '8080', 'KUBERNETES_PORT': 'tcp://10.10.0.1:443', 'KUBERNETES_PORT_443_TCP_PORT': '443', 'VSCODE_GIT_ASKPASS_MAIN': '/root/code-server/lib/vscode/extensions/git/dist/askpass-main.js', 'TRAEFIK_DASHBOARD_SERVICE_PORT_8080_TCP_ADDR': '10.10.210.90', 'TRAEFIK_WEB_SERVICE_PORT_WEBSECURE': '443', 'BROWSER': '/root/code-server/lib/vscode/bin/helpers/browser.sh', 'PATH': '/data/miniconda/envs/py38_ocr/bin:/data/miniconda/condabin:/data/miniconda/envs/py38/bin:/data/miniconda/condabin:/root/code-server/lib/vscode/bin/remote-cli:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/data/miniconda/envs/py38/bin:/data/miniconda/bin:/root/code-server/bin:/root/clangd_17.0.3/bin:/root/golang/bin', 'NODE_EXEC_PATH': '/root/code-server/lib/node', 'XULIAN_IDE_PORT_8080_TCP_PROTO': 'tcp', 'TRAEFIK_DASHBOARD_SERVICE_PORT_8080_TCP_PROTO': 'tcp', 'NV_LIBNCCL_PACKAGE_NAME': 'libnccl2', 'NV_LIBNCCL_PACKAGE_VERSION': '2.8.4-1', 'XULIAN_IDE_SERVICE_PORT': '8080', 'NEXUS3_PORT_8081_TCP_ADDR': '10.10.17.156', 'TRAEFIK_DASHBOARD_PORT_8080_TCP_PROTO': 'tcp', 'CONDA_PREFIX_1': '/data/miniconda', 'TRAEFIK_DASHBOARD_SERVICE_PORT_8080_TCP': 'tcp://10.10.210.90:8080', 'CONDA_PREFIX_2': '/data/miniconda/envs/py38', 'TRAEFIK_DASHBOARD_SERVICE_SERVICE_PORT': '8080', 'TRAEFIK_WEB_PORT_443_TCP_PORT': '443', 'POSTRESQL_POSTGRESQL_PORT': 'tcp://10.10.70.210:5432', 'NEXUS3_PORT_8081_TCP_PORT': '8081', 'OLDPWD': '/root/workspace', 'TRAEFIK_WEB_PORT_443_TCP_ADDR': '10.10.147.238', 'TERM_PROGRAM': 'vscode', 'ZHOUPAN_IDE_PORT_8080_TCP': 'tcp://10.10.95.128:8080', 'VSCODE_IPC_HOOK_CLI': '/tmp/vscode-ipc-11f41773-ceee-4dec-8b14-b37683de6c18.sock', '': '/data/miniconda/envs/py38_ocr/bin/python3', 'LC_CTYPE': 'C.UTF-8', 'CUSTOM_DEVICE_ROOT': '', 'OMP_NUM_THREADS': '1', 'QT_QPA_PLATFORM_PLUGIN_PATH': '/data/miniconda/envs/py38_ocr/lib/python3.8/site-packages/cv2/qt/plugins', 'QT_QPA_FONTDIR': '/data/miniconda/envs/py38_ocr/lib/python3.8/site-packages/cv2/qt/fonts', 'POD_NAME': 'cvxshq', 'PADDLE_MASTER': '10.20.59.202:39009', 'PADDLE_GLOBAL_SIZE': '4', 'PADDLE_LOCAL_SIZE': '4', 'PADDLE_GLOBAL_RANK': '3', 'PADDLE_LOCAL_RANK': '3', 'PADDLE_NNODES': '1', 'PADDLE_CURRENT_ENDPOINT': '10.20.59.202:39013', 'PADDLE_TRAINER_ID': '3', 'PADDLE_TRAINERS_NUM': '4', 'PADDLE_RANK_IN_NODE': '3', 'PADDLE_TRAINER_ENDPOINTS': '10.20.59.202:39010,10.20.59.202:39011,10.20.59.202:39012,10.20.59.202:39013', 'FLAGS_selected_gpus': '3', 'PADDLE_LOG_DIR': '/root/workspace/paddleocr_train/log'}
LAUNCH INFO 2024-06-17 17:01:27,789 ------------------------- ERROR LOG DETAIL -------------------------
LAUNCH INFO 2024-06-17 17:01:35,863 Exit code -9
from paddleocr.
你进'/root/workspace/paddleocr_train/log'看看,报错日志应该在这里面
from paddleocr.
Related Issues (20)
- cpu环境下onnxruntime推理报错 HOT 1
- rec 模型训练,训练突然变差的情况越来越频繁了
- 加入一些带阴影效果的文字训练rec模型,正常无阴影的文字反而容易出现错字了,怎么避免?
- Error PaddleRE: The shape of tensor assigned value must match the shape of target shape: [512, 3], but now shape is [513, 3] HOT 3
- 运行的官方demo效果这么差么,这识别的都是啥呀,还是我的用法不对 HOT 11
- 最新的release2.7.5代码无法运行 HOT 1
- SER index error: IndexError: (OutOfRange) label value should less than the shape of axis dimension when label value(2) not equal to ignore_index(-100)
- 自己训练ch_PP-OCRv4_rec_svtr_large.yml,导出模型时报错,怎么解决
- 自己训练rec svtr ocr4模型导出后推理报错
- NotImplementedError: (Unimplemented) Delete Weight Dequant Linear Op Pass is not supported for per-channel quantization HOT 4
- KIE: question about num_classes parameter
- 中文版面分析CDLA,自己训练出来的验证集bbox ap比官方的picodet_lcnet_x1_0_fgd_layout_cdla低好几个点
- PaddleOCR returning only the first page when performing ocr on a PDF
- 短数字无法检测问题 HOT 1
- kie_ser训练问题
- Paddle OCR 推理模型转ONNX,固定shape后,ONNX结果相差很大,不固定shape,结果与paddle推理模型保持一致,这个问题要怎么处理哇: HOT 1
- 关于[中文混合拼音]的长文本OCR方案请教(eg: 灿烂的笑róng) HOT 5
- indonesian 是哪个字典?没有找到id 的 HOT 2
- PPStructure版面分析时对原图返回的是figure,对原图灰度化后却返回table是为什么,请问怎么指定他返回figure? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paddleocr.