Giter VIP home page Giter VIP logo

ca-mtl's People

Contributors

am1n3e avatar camtl avatar jpilaul avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

ca-mtl's Issues

frozen parameters

In your paper, you mentioned that this model frozen parameters in the 1-6 layers. However, in your code, the parameters in first six layers implemented by BertLayer9 seems not been frozen.

AttributeError: 'MultiTaskTrainingArguments' object has no attribute 'uniform_mt_sampling'

It looks like there are arguments (percent_of_max_data_size & uniform_mt_sampling) missing that are consumed by the MultiTaskTrainer for the __len__ method.

if self.args.uniform_mt_sampling:

This is causing errors in some of the super methods.
https://github.com/huggingface/transformers/blob/b42586ea560a20dcadb78472a6b4596f579e9043/src/transformers/trainer.py#L367

I've traced them through the repo but found only this reference and do not see them listed in the cl args.

How would you suggest mitigating?

Cannot reproduce paper results

Hi there,

Thanks for sharing the code. I am pretty excited to try the code. However, the performance cannot reproduce the paper results.

I ran the code with command:
python run.py --model_name_or_path CA-MTL-base --data_dir $DATA_DIR --output_dir $OUTPUT_DIR --do_train --do_eval

The results for
cola is 0.00286100001416597
mnli is 0.39949057564951607
rte is 0.4657039711191336
qqp is 0.6150630719762552
sst-2 is 0.6502293577981652
qnli is 0.6637378729635731

This result is a complete failure. Could you provide the command to reproduce your results in paper?

error with model_name_of_path

Should change

python run.py --model_name_of_path CA-MTL-base --data_dir $DATA_DIR --output_dir $OUTPUT_DIR --do_train

change model_name_of_path to model_name_or_path

`CaMtlBaseEncodre.task_id_2_task_idx` is, maybe, problematic

Random Joe looking at your paper (very interesting work by the way)

I was looking at the code and this line struck me. It's just mapping an index to itself. This attribute self.task_id_2_task_idx is used only here (idem for BERT-base) which ends up doing nothing when we compare task_type before and after.

All of this to get embedding indexes. Just wanted to raise your attention in case that was not the expected behaviour.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.