camtl / ca-mtl Goto Github PK
View Code? Open in Web Editor NEWConditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
In your paper, you mentioned that this model frozen parameters in the 1-6 layers. However, in your code, the parameters in first six layers implemented by BertLayer9
seems not been frozen.
It looks like there are arguments (percent_of_max_data_size
& uniform_mt_sampling
) missing that are consumed by the MultiTaskTrainer
for the __len__
method.
Line 162 in f994a44
This is causing errors in some of the super methods.
https://github.com/huggingface/transformers/blob/b42586ea560a20dcadb78472a6b4596f579e9043/src/transformers/trainer.py#L367
I've traced them through the repo but found only this reference and do not see them listed in the cl args.
How would you suggest mitigating?
Hi there,
Thanks for sharing the code. I am pretty excited to try the code. However, the performance cannot reproduce the paper results.
I ran the code with command:
python run.py --model_name_or_path CA-MTL-base --data_dir $DATA_DIR --output_dir $OUTPUT_DIR --do_train --do_eval
The results for
cola is 0.00286100001416597
mnli is 0.39949057564951607
rte is 0.4657039711191336
qqp is 0.6150630719762552
sst-2 is 0.6502293577981652
qnli is 0.6637378729635731
This result is a complete failure. Could you provide the command to reproduce your results in paper?
Should change
python run.py --model_name_of_path CA-MTL-base --data_dir $DATA_DIR --output_dir $OUTPUT_DIR --do_train
change model_name_of_path
to model_name_or_path
Random Joe looking at your paper (very interesting work by the way)
I was looking at the code and this line struck me. It's just mapping an index to itself. This attribute self.task_id_2_task_idx
is used only here (idem for BERT-base) which ends up doing nothing when we compare task_type
before and after.
All of this to get embedding indexes. Just wanted to raise your attention in case that was not the expected behaviour.
Hi. I am working to adapt your code to another application and am running into an error here:
Line 193 in f994a44
Package versions align with requirements.txt
Any suggestions?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.