您好，请问image_reasoning - clevr数据集具体是哪个？我按文章中的引用找到了<a href="https://cs.stanford.edu/peopl

您好，图像数据都是用的<a href="https://huggingface.co/datasets/MMInstruction/M3IT" rel="nofollow"

在输出的时候还是遇到了一些问题，还得请教下您。下面是我的code： <div class="snippet-clipboard-content notranslat

clevr数据集的使用 about ask-anything HOT 15 OPEN

LiJiaqi96 commented on May 27, 2024

clevr数据集的使用

from ask-anything.

Comments (15)

Andy1621 commented on May 27, 2024 1

对滴

from ask-anything.

Andy1621 commented on May 27, 2024

您好，图像数据都是用的M3IT中提供的。

from ask-anything.

LiJiaqi96 commented on May 27, 2024

谢谢，看了下M3IT，里面json中image是一长串字符，如何将它们对应到VideoChat2给出的“train/39065.jpg”这样的形式？

from ask-anything.

Andy1621 commented on May 27, 2024

我们是根据M3IT给的标注，根据序列idx生成的idx.jpg

from ask-anything.

LiJiaqi96 commented on May 27, 2024

没太明白...想请教下如何将M3IT中的"image_str"和CLEVR数据集中具体的image名称对应起来呢？

from ask-anything.

Andy1621 commented on May 27, 2024

image_str是base64字符串，可以直接读取。我们是转成了RGB图像，image名称是根据for循环遍历M3IT中的数据，对应的idx生成的，不是根据原始CLEVR数据得到的。

from ask-anything.

LiJiaqi96 commented on May 27, 2024

明白了！您的idx对应的是使用datasets加载数据后遍历的idx对吧？

from ask-anything.

LiJiaqi96 commented on May 27, 2024

好的，感谢您的解答

from ask-anything.

LiJiaqi96 commented on May 27, 2024

在输出的时候还是遇到了一些问题，还得请教下您。下面是我的code：

import os
import base64
import datasets

save_dir = "clevr_M3IT"
ds = datasets.load_dataset("./datasets/M3IT/", "clevr", split="train", streaming=True)
cur_dir = os.path.join(save_dir, "train")
i = 0
for d in ds:
    image = base64.decodebytes(d["image_base64_str"][0].encode())
    with open(cur_dir+f"/{i}.jpg", "wb") as fh:
        fh.write(image)
    i += 1

在输出了一些图片后，我手动看了下部分图片的内容，发现它们并不能和您在HF发布的OpenGVLab/VideoChat2-IT中的QA匹配，比如train/90.jpg，

[ { "a": "The answer is cylinder.", "i": "Analyze the given image and respond to the associated question with a correct answer.", "q": "There is a green object that is behind the small rubber cylinder that is to the left of the matte cylinder to the right of the gray thing; what is its shape?" } ]

from ask-anything.

Andy1621 commented on May 27, 2024

奇怪，我们这边不是这个图嘞，我让当时处理的小伙伴康康

from ask-anything.

LiJiaqi96 commented on May 27, 2024

好的，感谢~

from ask-anything.

Andy1621 commented on May 27, 2024

你好，找小伙伴check了一下，对于某些数据集（如CLEVR），M3IT里给的meta信息里有image_index，对于其他数据集，通过for循环的index得到

from ask-anything.

LiJiaqi96 commented on May 27, 2024

原来如此，不过好像在CLEVR的metadata里没有看到image_index，代码是：

ds = datasets.load_dataset("./datasets/M3IT/", "clevr", split="train", streaming=True)
ds.info

from ask-anything.

clevr数据集的使用 about ask-anything HOT 15 OPEN

Comments (15)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent