Comments (8)
By my experience, the transformer decoder works well on other tasks. I personally haven't tried BERT as an encoder for generation.
What summarization data are you using?
Have you tried replacing transformer decoder with an RNN decoder for a quick comparison?
from texar.
I am using CNN data for building the model. I have not tried RNN decoder will try to do that .
But can you please check the code once in the given link if something is wrong.
from texar.
I was able to debug the the problem. Bert weights got disturbed and it was giving similar embeddings irrespective of example I am passing so the context of the sentence at the encoder is not correctly captured. Is there a way where I can set trainable as false for Bert based encoder directly?
from texar.
You can use tf.stop_gradient
to disable gradient backpropagation to BERT, or specify the variables
argument when calling tf.contrib.layers.optimize_loss
to exclude BERT variables
from texar.
I am still facing the same issue.
Can you please elaborate on how you solved it?
from texar.
Is this the correct way?
hparams=tf.stop_gradient(hparams)
from texar.
This seems to be working:
encoder_output = tf.stop_gradient(encoder_output)
Is this the right way ?
from texar.
allvars = tf get_trainable_variables()
nonBert =[v for v in allvars if 'bert' not in v]
train_op = tx.core.get_train_op(
mle_loss,
learning_rate=learning_rate,
variables=non Bert,
global_step=global_step,
hparams=opt)
from texar.
Related Issues (20)
- BERT as encoder and a transformer as a decoder. HOT 2
- Is there anyway to empty cache/memory after loading get-2 into our model?
- "infer_greedy" and "infer_sample" for GPT2 Decoder cannot work correctly HOT 2
- How to get the source data set and target set in PairedTextData?
- Is there anyway to train "big data" using transformer? HOT 1
- In Text-Style-Transfer Example, how do you use the model on a chosen sentence? HOT 5
- TypeError: unsupported operand type(s) for +=: 'float' and 'list' HOT 3
- ModuleNotFoundError: No module named 'texar' HOT 1
- ImportError: No module named 'texar.tf' HOT 1
- ERROR: Directory '.' is not installable. Neither 'setup.py' nor 'pyproject.toml' found. HOT 8
- Text Style transfer - Train on my own dataset HOT 4
- save model and predict
- Importing ABC directly from collections will be removed in Python 3.10
- How do I print the output value of the function tx.modules.MLPTransformConnector()
- ASYML project suggestions
- Output is not as expected(style does not getting transferred)
- Question about the installation about the version 0.2.1 HOT 1
- Performance issues in the program
- pip install -e error
- Dependencies in `requirements.txt` have module conflicts.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from texar.