zhanxlin / product1m Goto Github PK

View Code? Open in Web Editor NEW

85.0 85.0 6.0 3.44 MB

Product1M

Python 99.90% Shell 0.10%

product1m's People

Contributors

Stargazers

Watchers

Forkers

xiaodongsuper chaorensx hxyshare monarch2022 trellixvulnteam liuheng0111

product1m's Issues

Pretrained model

How can I get the pretrained model for testing?

How to use the model to retrieval when I input image with multi products?

您的论文非常有价值，但是在阅读时，我对实现的细节产生了诸多疑惑，希望您可以帮忙解答。
第一，您是使用RPN得到的proposal作为输入图像embedding，但是这就带来了一个问题，您通过卡阈值使用top10至top36的样本作为输入，并不经过NMS，这就使得必然存在重叠度极高的proposal区域，将其中一部分mask掉，真的能够复原吗？或者举个例子，文本中描述有2件商品，您的RPN最终生成了10个感兴趣区域，那现在直接mask掉其中一个，请问如何恢复这个区域呢？还是说我的理解有偏差，请指正。
第二，您在论文4.5节的描述我没读懂，您一边说用Co-Transformer的最终输出的图文token相乘作为某一件商品的表征，一边又说使用做对比学习的text/image transformer的输出concat起来作为检索用的特征，到底是怎样的呢？
第三，接上一个疑问，如果是concat起来，那么问题来了，对于有多件商品的输入，明明应该每一件商品都有一个表征，那[CLS]token concat起来不就是全图的表征了吗？
还请您指教。

RPN 模型能开源吗

你好大佬能方便提供一下这个模型的开源版本吗？方便大家复现和学习谢谢了！

About performance gap between the CLIP* model and our reimplementation

Many thanks for your code. I implement a CLIP-like architecture with vit-base-patch16 and bert-base-uncased, the input size of the image is 224. The model is optimized by only contrastive loss. I adopt the whole image as input just like ViT do ( I suppose you adopt region features as input). According to your released evaluation code, I got a much higher performance(mAP@10=87.7) than CLIP* in table 2.
Except for region input, is there any difference between our implementation and paper such as initialization, and model structure? Would you please offer us more details of it?

What's more, would you release the model weights of CAPTURE especially the weights of your costumed RPN for reimplementation?

info_nce_loss

About the pretrained RPN for MultiProduct Detection

Thanks for sharing your wonderful work!

In the paper, you mentioned that you trained a new RPN for Multi-Product Detection. But I could not find the pretrained checkpoint of this RPN module in the github page. Will this RPN be open source?

Thanks agian, have a nice day~!

无法下载测试数据 can not download test data

例如：
http://gongxiangdm.oss-cn-zhangjiakou.aliyuncs.com/alin/product1m/datasets/images/588193613533.jpg

AccessDenied You have no right to access this object because of bucket acl. 64C9214BFEE4193732A4A998 gongxiangdm.oss-cn-zhangjiakou.aliyuncs.com 0003-00000001

About the code for evaluation

Is there a code for evaluation calculation?

zhanxlin / product1m Goto Github PK

product1m's People

Contributors

Stargazers

Watchers

Forkers

product1m's Issues

Pretrained model

How to use the model to retrieval when I input image with multi products?

RPN 模型能开源吗

About performance gap between the CLIP* model and our reimplementation

info_nce_loss

About the pretrained RPN for MultiProduct Detection

无法下载测试数据 can not download test data

About the code for evaluation

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent