Comments (8)
(1) We do not attempt to train fashionbert only with image-text-matching task. Since we evaluate fashionbert model with fashion-gen dataset, we donot perform fine-tuning again.
(2) Need to resize the image patch to fit the feature extraction.
(3) Resnet50
from easytransfer.
Thanks a lot!
Do you use the specific masks for image patche in training set? Besides, in your code, a pertained imagebert is used for initializing the fashionbert. Which dataset the imagebert is pretrained based on?
from easytransfer.
Thanks a lot!
Do you use the specific masks for image patche in training set? Besides, in your code, a pertained imagebert is used for initializing the fashionbert. Which dataset the imagebert is pretrained based on?
- randomly mask the image patches
- fashionbert is continuely pretrained based on google bert(base) with fashionGen dataset
from easytransfer.
I am confused about the pertained model you loaded in the code
where the 'pretrain_model_name_or_path=pai-imagebert-base-en' instead of ‘= google-bert-base-en’
from easytransfer.
When you run_train.sh first time, ez will automatically download model to modelZooBasePath. Pleas check your os.getenv("HOME") / eztransfer_modelzoo.
flags.DEFINE_string("modelZooBasePath", default=os.path.join(os.getenv("HOME"), ".eztransfer_modelzoo"), help="eztransfer_modelzoo")
from easytransfer.
Yes. What I am confused is about the downloaded model.
In the readme file, the command for pretrained model is 'pretrain_model_name_or_path=pai-imagebert-base-en' instead of ‘= google-bert-base-en’. Should I change it to googlebert when making pretrain on FashionGen?
from easytransfer.
You can conduct two experiments to compare the two pretrained models.
from easytransfer.
Thanks a lot!
Do you use the specific masks for image patche in training set? Besides, in your code, a pertained imagebert is used for initializing the fashionbert. Which dataset the imagebert is pretrained based on?
- randomly mask the image patches
- fashionbert is continuely pretrained based on google bert(base) with fashionGen dataset
Thank you for your excellent work. May I ask one following-up question about FashionBERT?
Could you please give some details about the ratios of positive pairs and negative pairs during the training process? Like in one mini-batch, 50% of examples are positive and the rest are negative. Thanks a lot!
from easytransfer.
Related Issues (20)
- 请问text在转化为id时使用的是哪个版本的BERT vocab? HOT 3
- pai-bert-tiny-zh模型不收敛 HOT 1
- EasyTransfer/examples/clue_quick_start.ipynb在colab上运行出错 HOT 1
- ERROR: File system scheme 'oss' not implemented
- Meta Finetune problem HOT 3
- Meta-fine problem HOT 1
- tsv文件点击后无法自动下载 HOT 1
- export to pb models occured an error:"TypeError: __int__ returned non-int (type NoneType)" HOT 1
- 在TensorFlow12.3版本下测试hcnn模型报错 HOT 1
- 8卡的线性度很低
- FashionBERT pretrained model error
- 知识蒸馏
- pai-imagebert-base-en是在英文<text, image>语料库上训练的吗?有没有在中文数据集上预训练好的模型?
- 使用pai-bert-tiny-zh model ,报错ValueError: not enough values to unpack (expected 5, got 3),使用pai-imagebert-base-en可以正常跑完
- 钉钉社群二维码过期了 HOT 1
- meta-finetune是否可以用于多标签分类
- wechat QR code is outmoded
- 论文Learning to Expand: Reinforced Response Expansion for Information-seeking Conversations 源码共享
- Dingtalk QR code is outdated
- 用EasyTransfer训练的bert模型能用huggingface transformers推理吗?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from easytransfer.