cvhub520 / x-anylabeling Goto Github PK

View Code? Open in Web Editor NEW

2.5K 28.0 283.0 54.18 MB

Effortless data labeling with AI support from Segment Anything and other awesome models.

License: GNU General Public License v3.0

Python 99.97% Shell 0.03%

labeling-tool paddle pytorch resnet sam yolo deep-learning deeplearning onnx clip

x-anylabeling's Introduction

English | 简体中文

X-AnyLabeling-Demo.mp4

📄 Table of Contents

🥳 What's New
👋 Brief Introduction
🔥 Highlight
- 🗝️Key Features
- ⛏️Model Zoo
📋 Usage
- 📜 Docs
- 🧷Hotkeys
📧 Contact
✅ License
🙏🏻 Acknowledgments
🏷️ Citing

🥳 What's New ⏏️

Mar. 2024:
- 🤗 Release the latest version 2.3.5 🤗
Feb. 2024:
- Release version 2.3.4.
- Enable label display feature.
- Release version 2.3.3.
- ✨✨✨ Support YOLO-World model.
- Release version 2.3.2.
- Support YOLOv9 model.
- Support the conversion from a horizontal bounding box to a rotated bounding box.
- Supports label deletion and renaming. For more details, please refer to the document.
- Support for quick tag correction is available; please refer to this document for guidance.
- Release version 2.3.1.
Jan. 2024:
- 👏👏👏 Combining CLIP and SAM models for enhanced semantic and spatial understanding. An example can be found here.
- 🔥🔥🔥 Adding support for the Depth Anything model in the depth estimation task.
- Release version 2.3.0.
- Support YOLOv8-OBB model.
- Support RTMDet and RTMO model.
- Release a chinese license plate detection and recognition model based on YOLOv5.
Dec. 2023:
- Release version 2.2.0.
- Support EdgeSAM to optimize for efficient execution on edge devices with minimal performance compromise.
- Support YOLOv5-Cls and YOLOv8-Cls model.
Nov. 2023:
- Release version 2.1.0.
- Support InternImage model (CVPR'23).
- Release version 2.0.0.
- Added support for Grounding-SAM, combining GroundingDINO with HQ-SAM to achieve sota zero-shot high-quality predictions!
- Enhanced support for HQ-SAM model to achieve high-quality mask predictions.
- Support the PersonAttribute and VehicleAttribute model for multi-label classification task.
- Introducing a new multi-label attribute annotation functionality.
- Release version 1.1.0.
- Support pose estimation: YOLOv8-Pose.
- Support object-level tag with yolov5_ram.
- Add a new feature enabling batch labeling for arbitrary unknown categories based on Grounding-DINO.
Oct. 2023:
- Release version 1.0.0.
- Add a new feature for rotation box.
- Support YOLOv5-OBB with DroneVehicle and DOTA-v1.0/v1.5/v2.0 model.
- SOTA Zero-Shot Object Detection - GroundingDINO is released.
- SOTA Image Tagging Model - Recognize Anything is released.
- Support YOLOv5-SAM and YOLOv8-EfficientViT_SAM union task.
- Support YOLOv5 and YOLOv8 segmentation task.
- Release Gold-YOLO and DAMO-YOLO models.
- Release MOT algorithms: OC_Sort (CVPR'23).
- Add a new feature for small object detection using SAHI.
Sep. 2023:
- Release version 0.2.4.
- Release EfficientViT-SAM (ICCV'23),SAM-Med2D, MedSAM and YOLOv5-SAM.
- Support ByteTrack (ECCV'22) for MOT task.
- Support PP-OCRv4 model.
- Add video annotation feature.
- Add yolo/coco/voc/mot/dota export functionality.
- Add the ability to process all images at once.
Aug. 2023:
- Release version 0.2.0.
- Release LVMSAM and it's variants BUID, ISIC, Kvasir.
- Support lane detection algorithm: CLRNet (CVPR'22).
- Support 2D human whole-body pose estimation: DWPose (ICCV'23 Workshop).
Jul. 2023:
- Add label_converter.py script.
- Release RT-DETR model.
Jun. 2023:
- Release YOLO-NAS model.
- Support instance segmentation: YOLOv8-seg.
- Add README_zh-CN.md of X-AnyLabeling.
May. 2023:
- Release version 0.1.0.
- Release YOLOv6-Face for face detection and facial landmark detection.
- Release SAM and it's faster version MobileSAM.
- Release YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOX.

👋 Brief Introduction ⏏️

X-AnyLabeling stands out as a robust annotation tool seamlessly incorporating an AI inference engine alongside an array of sophisticated features. Tailored for practical applications, it is committed to delivering comprehensive, industrial-grade solutions for image data engineers. This tool excels in swiftly and automatically executing annotations across diverse and intricate tasks.

🔥 Highlight ⏏️

🗝️Key Features

Supports inference acceleration using GPU.
Handles both image and video processing.
Allows single-frame and batch predictions for all tasks.
Facilitates customization of models and supports secondary development design.
Enables one-click import and export of mainstream label formats such as COCO, VOC, YOLO, DOTA, MOT, and MASK.
Covers a range of visual tasks, including classification, detection, segmentation, caption, rotation, tracking, estimation, and ocr.
Supports various image annotation styles, including polygons, rectangles, rotated boxes, circles, lines, points, as well as annotations for text detection, recognition, and KIE.

⛏️Model Zoo

Object Detection	SOD with SAHI	Facial Landmark Detection	2D Pose Estimation

2D Lane Detection	OCR	MOT	Instance Segmentation

Image Tagging	Grounding DINO	Recognition	Rotation

SAM	BC-SAM	Skin-SAM	Polyp-SAM

For more details, please refer to 👉 model_zoo 👈

📋 Usage ⏏️

📜Docs
- 🔜Quick Start
- 📋User Guide
- 🚀Load Custom Model
🧷Hotkeys

Click to Expand/Collapse

Shortcut	Function
d	Open next file
a	Open previous file
p or [Ctrl+n]	Create polygon
o	Create rotation
r or [Crtl+r]	Create rectangle
i	Run model
q	`positive point` of SAM mode
e	`negative point` of SAM mode
b	Quickly clear points of SAM mode
g	Group selected shapes
u	Ungroup selected shapes
s	Hide selected shapes
w	Show selected shapes
Ctrl + q	Quit
Ctrl + i	Open image file
Ctrl + o	Open video file
Ctrl + u	Load all images from a directory
Ctrl + e	Edit label
Ctrl + j	Edit polygon
Ctrl + c	Copy selected shapes
Ctrl + v	Paste selected shapes
Ctrl + d	Duplicate polygon
Ctrl + g	Display overview annotation statistics
Ctrl + h	Toggle visibility shapes
Ctrl + p	Toggle keep previous mode
Ctrl + y	Toggle auto use last label
Ctrl + m	Run all images at once
Ctrl + a	Enable auto annotation
Ctrl + s	Save current annotation
Ctrl + l	Toggle visibility Labels
Ctrl + t	Toggle visibility Texts
Ctrl + Shift + s	Change output directory
Ctrl -	Zoom out
Ctrl + 0	Zoom to Original
[Ctrl++, Ctrl+=]	Zoom in
Ctrl + f	Fit window
Ctrl + Shift + f	Fit width
Ctrl + z	Undo the last operation
Ctrl + Delete	Delete file
Delete	Delete polygon
Esc	Cancel the selected object
Backspace	Remove selected point
↑→↓←	Keyboard arrows to move selected object
zxcv	Keyboard to rotate selected rect box

📧 Contact ⏏️

🤗 Enjoying this project? Please give it a star! 🤗

If you find this project helpful or interesting, consider starring it to show your support, and if you have any questions or encounter any issues while using this project, feel free to reach out for assistance using the following methods:

Create an issue
Email: [email protected]

✅ License ⏏️

This project is released under the GPL-3.0 license.

🙏🏻 Acknowledgments ⏏️

I extend my heartfelt thanks to the developers and contributors of the projects LabelMe, LabelImg, roLabelImg, AnyLabeling, and Computer Vision Annotation Tool. Their dedication and contributions have played a crucial role in shaping the success of this project.

🏷️ Citing ⏏️

BibTeX

If you use this software in your research, please cite it as below:

@misc{X-AnyLabeling,
  year = {2023},
  author = {Wei Wang},
  publisher = {Github},
  organization = {CVHub},
  journal = {Github repository},
  title = {Advanced Auto Labeling Solution with Added Features},
  howpublished = {\url{https://github.com/CVHub520/X-AnyLabeling}}
}

🔝 Back to Top

x-anylabeling's People

Contributors

Stargazers

Watchers

Forkers

zyoungxu jingzhengli cv-ip changpeng08 athrunsunny xuanjiawang liufangtao 3321zsc reddsfan5 yangyuke001 wangwshuai biovbreed codwest zshxie scumechanics keyman9848 kukupigs st7109 monkeybin11 scottflybird ningshuang-yao lyrhy benjamesbabala youdontnowme curtis18 themilkyway-newbie hdnh2006 vaibhavpanday hsaigroup calibrec zghmvpgoogle syaikhipin jie311 wuqua ksx20115 pupu2014 dennisjcy xlmxbw haorenshiwo ikingye visionbang hjsybyq jiao956 caojun1997 shuxjweb xiangsanye youhuang67 linhong00316 outbreak-hui pytrick520 xialuxi liuxing9848 wittech memmat liu-b-s 189859319 kindow saviola001 mud-player aliushn xianghan228 yiluzhou yueyedeai fusang1337 whuyyc ksutaaron wpq3142 off0set richard-mei yywds lueriky zhouzw87 zlhad zfb132 eamon-cai cuidid yiluzhou1 ymzlygw anonymouszj 18140033613 hankhaohao caoyaoyuan jiaxihang loovelj nichhb d710055071 huaxiao2020 sunqiang188 ucassmy yigmmk techthiyanes liuxing9848 coachingjane fwang16 aiyou9 karlvision hbingbin samwen louisnust hustwayway

x-anylabeling's Issues

下载了gpu版本的exe文件，根据onnx1.14安装了CUDA 11.6以及cudnn 8.5.0.69，但是加载segment anything（Vit-b Quant）模型还是报错。

博主您好：
我下载了gpu版本的exe执行文件，运行加载segment anything模型报错，onnxruntime::python::CreateExecutionProviderInstance CUDA_PATH is set but CUDA wasn't able to be loaded. please install the correct version of CUDA and cuDNN as mentioned in the GPU requirements page,make sure they're in PATH,and that your GPU is supported。

以下是我的操作步骤，帮忙看看是什么问题，多谢多谢！！！

1.我检查了CUDA版本，nvcc-V :
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2021 NVIDIA Corporation
Built on Fri_Dec_17_18:28:54_Pacific_Standard_Time_2021
Cuda compilation tools, release 11.6, V11.6.55
Build cuda_11.6.r11.6/compiler.30794723_0

2.安装onnx要求的cudnn 8.5.0.96版本，并且将所有lib,include,bin文件拷贝到了CUDA 11.6的文件夹中。

3.进入CUDA安装路径：C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.6\extras\demo_suite运行bandwidthTest.exe以及deviceQuery.exe检测cudnn安装成功。

4.使用cup版本的exe，运行segment anything (Vit-b Quant)模型没有问题，只是运行gpu版本exe会报错。

5.onnxruntime的要注，下载的CUDA与cudn.

收集支持txt prompt 的相关项目信息

为未来可能支持 text prompt 收集相关项目信息

1 lang-segment-anything

luca-medeiros/lang-segment-anything#27 (comment)
luca-medeiros/lang-segment-anything#28

2 ov-seg

https://github.com/facebookresearch/ov-seg

如何加载自定义语义分割模型

https://github.com/JCruan519/EGE-UNet
您好，请问怎么将EGE-unet的onnx模型加载到此项目中呢？
训练的是分割单个类别的权重，但输入图像需要进行减均值除方差的处理。
因为直接用SAM自动标注的效果很差，所以想加载自己训练的模型。
感谢作者的解答

加载自定义模型用于自动标注报错

显示也不是很完整。

載入Segmentation相關的模型時報錯

vietanhdev/anylabeling#86

相同的錯誤有人回報在原版的issue區，我自己遇到的情況是這樣：
我原本裝的是CUDA 11.8跟cudnn 8.9.3，開啟原版的GPU版有成功載入onnxruntime的模型進行偵測，但才點到第三個物件，程式就崩潰。重開之後，就無法載入模型，出現該條錯誤。
嘗試裝上正確的CUDA跟cudnn，仍然出現同樣的錯誤。

改使用X-AnyLabeling，仍有同個錯誤，
但比較特別的是，X-AnyLabeling中新增的「YOLO相關的Segmentation模型」也會出現同樣錯誤（一般YOLO模型不會）

放大图片后，支持上下，左右滚动，能够超出图片，用于处理边缘区域的对象

有时候目标对象，正好与图片边缘相切，如果目标较小，需要放大才能标注，但是由于在边缘，需要先放大，取点，然后再缩小，才能在边缘，点击下一个点，如果放大图片后，软件窗口的左侧工具栏能够给一个图标，点击后变色，锁定支持滚动超出图片。不需要时，就再次点击，取消，恢复空白

使用自定义的yolov8n模型时候，只能使用640尺寸的模型，其他尺寸的模型没有结果，请问是什么问题？

使用后的效果问题

为什么使用蔬菜图片效果很不错，但分析我自己的图片时，完全无法分割出任何物体呢？是模型加载问题还是图片问题

对象列表，支持按钮，进行分类查看，然后能够，所有对象，或按照分类过滤后的对象，根据在图片中位置，从左到右，从上到下，进行排序，方便快速筛查

关于分类查看

然后支持，在编辑模式下，全选对象对话框中，标签对话框选定的标签的对象
关于对象框中的排序
对多目标标注，往往标注的顺序，有时在图片上不是从上向下的，尤其是自动标注，需要给一个按钮，点击后，自动排序对象对话框中的顺序，这样可以在编辑模式中，在放大图片的情况下，在不同视野下，以键盘上下键，从上向下，进行检查

【功能需求】文件列表，支持按文件创建时间排序

当前，文件列表按照文件名排序，希望能给一个按钮，文件列表，以文件创建时间排序

加载自定义模型闪退

没有任何报错或提示，程序直接退出
type: test
name: testA
display_name: Test
model_path: ‪D:/Auto/best.onnx
input_width: 640
input_height: 640
score_threshold: 0.45
classes:

enemy
friend

支持保存非SAM的不同模型及不同参数下，运行所得标注，给出切换列表，并支持颜色叠加，来找到重叠的标注框，进行人工标注

多目标跟踪

可以自动标注多目标跟踪MOT格式的数据集吗？

转换标注失败'gbk' codec can't decode byte 0xff in position 0: illegal multibyte sequence

windows 10
python 3.10

(.venv) D:\Projects\X-AnyLabeling>python tools/label_converter.py --src_path source/valid/ --dst_path source/valid/ --mode custom2yolo

# 报错信息如下
 File "D:\Projects\X-AnyLabeling\tools\label_converter.py", line 162, in custom_to_yolov5
    data = json.load(f.read())
UnicodeDecodeError: 'gbk' codec can't decode byte 0xff in position 0: illegal multibyte sequence

对所有打开文件，进行标签对象的计数功能，评估数据集的标注不平衡

请问图形界面怎么进行标注格式的选择？

我在图像界面和issue里面查找了标注格式转换的相关信息，但没有查找到具体操作。
请问图形界面怎么进行标注格式的选择？

如何載入custom yolo-nas model進行標註

Hi 我目前有訓練了一個yolo-nas-s的model並將其轉為Onnx檔案
但在載入的時候雖然可以載入但卻無法偵測到物件想請問這個是什麼問題?
yaml檔設定
`
type: yolo_nas
name: yolo_nas_s-r20230615
display_name: YOLO-NAS-S Deci-AI
model_path: yolo_nas_s.onnx
input_width: 640
input_height: 640
nms_threshold: 0.45
score_threshold: 0.5
classes:

connector
ic
capacitor_sme
capacitor_smc
diode
usb
power_jack
clock
so
sot23
xresistor
push_button
header
resistor
del
fuse
`
載入結果:完全偵測不到

接入https://github.com/SysCV/sam-hq 增强零样本分割能力

SAM vs. HQ-SAM

Linux版如何打开GUI

您好!
请问下Ubuntu下用什么打开它,我试了很多打开方式都打不开GUI

自定义前后处理过称

您好，目前需要bisenetv2网络，请问自定义前后处理过程可以参考哪个文件？

如何添加自定义模型？

您好，请问已经将paddleseg框架中的模型（pp_liteseg_stdc2）转换为onnx格式，想要在软件中添加该模型，如何操作？

【BUG】Custom2YOLO 导出有bug，只能在输出文件夹不存在时，运行一次

报错

(XAnyLabelingenv) D:\gitlib\github\BioVbreed\X-AnyLabeling>python tools/label_converter.py --src_path imgDB\images --dst_path imgDB\label\130xyolo --classes imgDB\la
bel\classes.txt --mode custom2yolo
Starting conversion to custom2yolo format...
Converting files:   0%|                                                                                                          | 1/1820 [00:00<04:01,  7.52file/s]
Traceback (most recent call last):
  File "tools/label_converter.py", line 399, in <module>
    main()
  File "tools/label_converter.py", line 372, in main
    os.makedirs(args.dst_path, exist_ok=False)
  File "C:\Users\zhenyuchen\.conda\envs\XAnyLabelingenv\lib\os.py", line 223, in makedirs
    mkdir(name, mode)
FileExistsError: [WinError 183] 当文件已存在时，无法创建该文件。: 'imgDB\\label\\130xyolo'

输出结果

windows下使用X-AnyLabeling.exe使用自定义模型进行标注时，有关类别等信息的配置文件放在哪里

如何加载自定义模型

我已经将自训模型从pt转为onnx，依旧无法加载。
请问如何加载自定义模型？

一次性全部标注

非常感谢分享，现在导入模型后，只能单帧进行标注，是否可增添一次性全部生成选项

所有模型下载，可否提供个百度网盘链接？

自动使用上一次的label和保持之前的缩放尺寸这两个功能有BUG

1.自动使用上一次的label，经常使用的不是上次的label
2.保持之前的图像缩放尺寸，有时候突然就变回原图了

可以参考一下这个https://github.com/tzutalin/labelImg 的功能改进版本

https://github.com/ZeXin-Wang/labelimg

为SAM分割，标注的矩形，在完成时，设置增加长宽，特定的目标检测，需要四周光影分界线

文本标签自动标注

什么时候支持文本OCR自动标注，paddleocr v3模型？

加载YOLOv8自定义模型出错

加载YOLOv8自定义模型时出现上述错误，配置文件如下

type: yolov8
name: YOLOv8s-parachute
display_name: YOLOv8s-parachute
model_path: E:\project\9_parachute_annotation\model\best.onnx
input_width: 640
input_height: 640
nms_threshold: 0.7
score_threshold: 0.25
confidence_threshold: 0.25
classes:
  - parachute

关于custorm格式转换成json ,其中classes 这个是怎么生成的

寻求您的帮助，谢谢，我执行了下面的语句 python tools/label_converter.py --src_path xxx_folder --dst_path xxx_folder --classes xxx.txt --mode custom2coco 一，需要配置一下 --classes xxx.txt ， xxx.txt是怎么生成的，

自行下载和添加模型的方法

打开路径：X-AnyLabeling-main\anylabeling\configs\auto_labeling 找到想要使用的模型的yaml文件，如 segment_anything_vit_h_quant

将其复制到希望存放的位置，用记事本打开，使用任意下载工具或方法下载encoder_model_path，decoder_model_path后的2个onnx文件

然后复制到与yaml同路径下，在修改encoder_model_path，decoder_model_path为相同路径：

然后在加载模型种选择加载自定义模型，选择新创建的yaml文件即可。

使用自己训练的yolov5加载模型没有问题但不输出任何内容

希望能够在标注完，设置透明度的颜色，一键填充标注框，检查是否有遗漏

主要针对一张图中，目标数量较多的情况，不是通用场景，优先级不高

能否接入LVM-Med的模型，这个项目在多种医学数据集上做了fine-tune SAM的训练

尤其是LVM-Med, 在livecell数据集的训练模型
项目地址：
https://github.com/duyhominhnguyen/LVM-Med

自动标注，无法编辑标定有误的框框

编辑按钮是灰色的

关于标注文件格式切换的疑问

首先感谢您的工作，这个工具非常有意思，但是有一点我在工具中没有发现，就是亮点中的“支持转换成标准的COCO-JSON、VOC-XML以及YOLOv5-TXT文件格式”，请问这个功能是内置的吗？还是需要编写脚本自行转换？

有没有linux版本

请问下可不可以支持在linux版本下使用

下载.exe版本如何指定conda环境？

下载的exe文件可以运行，但是加载模型的时候无法指定自己的conda环境，软件中没有看到能够指定环境的选项，除非用源码编译

加载模型的时候出了问题

我自己训练了YOLOv8，输出了onnx模型，仿照官方的yaml写了配置文档，然后在x-anylabeling加载我的模型就报错，有两个错误：
第一个：Error in loading model:OpenCV(4.7.0)D:\a\opencv-python\opencv-python\opencv\modules\dnn\src\onnx\onnx_importer.cpp:1073:error:(-2:Unspecified error)in function 'cv::dnn::dnn4_v20221220::ONNXImporter::handleNode'> Node [[email protected]]:(onnx_node!/model.22/Split) parse error:opencv(4.7.0) D:\a\opencv-python\opencv-python\opencv\modules\dnn\src\layers\slice_layer.cpp:274:error: (-215:Assertion failed) splits > 0 && inpShape[axis_rw] % split>
第二个：Error in loading model:OpenCV(4.7.0)D:\a\opencv-python\opencv-python\opencv\modules\dnn\src\graph_simplifier.cpp:76:error:(-212:Parsing error) Input node with name /model.22/Gather_2_output_0 not found in function 'cv::dnn::Subg
我在网上也查不到这是啥意思，这是什么情况呢

接入 https://ultralytics.com 提供的远程免费YOLO检测的api

yolov8官方提供免费的运行平台 https://ultralytics.com
如果数据集可以传上网，没有太强的保密要求。
可以上传到yolov8官方的平台，通过colab训练，获得模型，yolov8官方提供免费的检测运行，可以用来循环预标注
虽然ultralytics也能导出onnx的格式，但是在x-anylabeling会报错和这个 #4 一样的错误 #4 (comment)

但是，可以通过其免费的运行检测任务，通过cURL和Python进行调用api

返回json结果

支持多选，批量，修改对象列表中的群组编号，和对象文本

YOLOv8模型推理无结果，YOLOv5结果正常，都是默认模型

加载模型失败后必须重启程序才能换模型

环境是ubuntu20 +conda
选择新模型后，可能由于网络原因下载失败，就必须重新启动程序才能换模型，否则图一的状态一直存在。由于这个下载速度太慢了，能不能加一个中断下载的按钮，我自己从浏览器下载再复制到对应目录。

json转换coco格式的一些问题

感谢您提供的工具，这个工程十分有趣。但是我在运行过程中发现了一下问题，按照教程我进行json转coco格式时，报出如下信息
“Traceback (most recent call last):
File "label_converter.py", line 247, in
main()
File "label_converter.py", line 230, in main
converter.to_coco(input_dir, output_dir)
File "label_converter.py", line 71, in to_coco
x_min = min(points[0][0], points[1][0])
IndexError: list index out of range”

根据提示，我在代码的第71行找到了如下代码：
“x_min = min(points[0][0], points[1][0])
y_min = min(points[0][1], points[1][1])
x_max = max(points[0][0], points[1][0])
y_max = max(points[0][1], points[1][1])

        width = x_max - x_min
        height = y_max - y_min”

这部分似乎是在表示一个框，但是我进行的时关键点的标注，坐标值应该只有两个，这个问题应该如何解决？或者说，关键点标注必须标注一个框将其包括起来吗？期待您的回复。

COCO2custom converter is not working

Hello,

I am trying to convert a dataset from coco to labelme format and it seems there's some error dealing with dictionaries:

python tools/label_converter.py --src_path ~/Downloads/Tayqan_1.v5i.coco/train/_annotations.coco.json --dst_path ~/Downloads/Tayqan_1.v5i.coco/train/ --img_path ~/Downloads/Tayqan_1.v5i.coco/train/ --mode coco2custom
Starting conversion to coco2custom format...
Traceback (most recent call last):
  File "tools/label_converter.py", line 399, in <module>
    main()
  File "tools/label_converter.py", line 392, in main
    converter.coco_to_custom(args.src_path, args.dst_path, args.img_path)
  File "tools/label_converter.py", line 289, in coco_to_custom
    "imagePath": img_dic[dic_info["file_name"]],
KeyError: 'PHOTO-2022-12-07-00-31-19-2_jpg.rf.00a6a50ec0c8efe0815b459e14226c46.jpg'

I think the problem is your code is handling the filename as a key of the dictionary.

I am attaching my json in case you want to check the error.
_annotations.coco.zip

如何使用GPU

如题

使用自定义的模型的时候暂时还不能使用分割模型？

我看文档中支持自定义模型，但是并没有给出分割的模型的配置文件写法，比如我基于自己训练的yolov8的分割模型，如何进行模型导入？

加载自定义模型并推理没有结果

我用yolov7-main分支训练的模型，使用：
python export.py --weights /lvol1/jjpeng/yolov7/runs/train/terminal_status_landi_newland_pax_newpos_verfone/weights/best.pt --end2end --simplify --topk-all 100 --iou-thres 0.65 --conf-thres 0.35 --img-size 1280 1280 --max-wh 1280进行导出onnx模型。导出过程无问题，使用onnx推理也验证模型没问题。验证结果如下：

但是按照教程配置并使用自定义模型的时候，加载似乎没有问题，但是运行后没有任何输出。

以下是我的yaml配置文件：