talschuster / crosslingualcontextualemb Goto Github PK

View Code? Open in Web Editor NEW

98.0 8.0 9.0 46 KB

Cross-Lingual Alignment of Contextual Word Embeddings

License: MIT License

Python 78.87% Shell 3.30% Jsonnet 17.83%

pytorch nlp wordembeddings zeroshot-learning crosslingual elmo allennlp bert contextual-embeddings

crosslingualcontextualemb's People

Contributors

Stargazers

Watchers

Forkers

gaimjkp xrc10 gptcod so2jia helicqin akkikiki notani wangclnlp rootofmylife

crosslingualcontextualemb's Issues

section 3.3 in paper

Not a code issue, but I have a question regarding your paper.
For footnote 3 in section3.3 (Supervised Anchored Alignment), it says "In practice, we may have multiple target words for a single source word, and the extension is straight-forward.".

My understanding is, dictionary D is a dictionary for source and target word pairs. However for a single source word that may have multiple target words, denote this as 'i', (e.g bear in source(english) can be translated to multiple target words), D would contain more than one entries of (i,D(i)) . So dictionary D would look like:
....
(source, target)
(bear, target word for 'animal bear')
(bear, target word for 'to have')
....
....

Say,
a = anchor vector for source word bear
b = anchor vector for target word 'animal bear'
c = anchor vector for target word 'to have'

then,
we learn a mapping between (a,b) and (a,c).
Is my understanding correct?

The problem is, it is hard to intuitively understand how this method can lead to results demonstrated in Figure2, where distinct point clouds are aligned with their corresponding distinct words in target

Code for generating embedding visualization

Thank you for the nice work. I read your paper and saw a few embedding visualizations that look very nice, but I can't seem to find the code for generating those in the repo here.

I'm also trying to visualize contextualized embeddings (for BERT). How did you manage to do that? Thank you so much for your help.

Few-shot training

Is it possible to train anchored LM for few-shot parsing?

How to generate the mapping matrixs for the ELMo of my own language.

I've trained several ELMo weights for other languages (e.g., Finnish, Chinese ...)(I consider to enrich your repo), and now I wanna align them (including each LSTM layer) to English space, as just you did.

But, it seems you did not release the codes for generating such aligning matrix (like such below).

P.S.: note that you only released the code for generating the anchors, and I believe it is nothing to do with the aligning matrix.
Or, if I misunderstand the approach, please show the hints, correct my wrong.

So, may I have your prompt reply concerning this issue?
Thx a lot.

option262.json

Hi, when running demo.py models/option262.json is not present by default, are we supposed to download?

Alignment matrices for unsupervised cases

Hi!

Could you please also share the alignment matrices for the unsupervised part of your work?

Thanks

Best alignment mapping for English

Hi,

Thank you for this interesting work. I am currently working on extending this approach on top of mBERT and would need to generate English mapping from scratch. Did you learn the W matrix for English by aligning English to itself using MUSE? Wouldn't that be redundant?

Thanks,

Detailed setting info on anchor generation

This is not an issue, but a question. Could you tell me more about how your anchors were generated for your experiments? The paper says anchors were computed from the evaluation set (which amounts to 5% of the total CoNLL data), but I'd like to know more details - like exactly how many sentences were used, for each language, etc.

Thanks!

What layer is aligned and what the English alignment matrix for?

Hi,

I've read your article and the readme here and I'm a little confused about 2 things:

You state

We provide the alignment of the first LSTM output of ELMo to English

and also:

Note that our alignments were done on the second layer of ELMo

My understanding is that "fist LSTM output" is not the second layer. Am I misunderstanding the quote? What layer do you align?

If the alignment matrices aligned the models to the English model, what is the purpose of the English alignment matrix?

Thanks!

How to you generate pre-trained embedding?

@TalSchuster I'm wondering how do you generate the pretrained model?
I have the following questions,
Were you using https://github.com/allenai/bilm-tf with the default argument?
How long did you train for each model?
And how many gpu and what kind of gpu you used?

talschuster / crosslingualcontextualemb Goto Github PK

crosslingualcontextualemb's People

Contributors

Stargazers

Watchers

Forkers

crosslingualcontextualemb's Issues

section 3.3 in paper

Code for generating embedding visualization

Few-shot training

How to generate the mapping matrixs for the ELMo of my own language.

option262.json

Alignment matrices for unsupervised cases

Best alignment mapping for English

Detailed setting info on anchor generation

What layer is aligned and what the English alignment matrix for?

How to you generate pre-trained embedding?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent