vinairesearch / bartpho Goto Github PK
View Code? Open in Web Editor NEWBARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
License: MIT License
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
License: MIT License
Thank you so much for this fantastic work. I'm currently doing research on BART model and having some questions regarding to BARTpho model in need for elaborating.
I have been using this training paradigm https://github.com/yixinL7/BRIO for text summarization, they used the pretrained BART model, facebook/bart-large-cnn as baseline and achieved pretty good result on CNN/DM dataset. I figured I could replaced the baseline model with BARTpho and do the same with my custom dataset. But the cross validation during training was quite poor no matter how I change the configuration.
So my question are:
Hello @datquocnguyen ,
Can you provide a sample code to do text summarization task with this model?
Thank you.
Hi authors,
AttributeError: module transformers.models.bartpho has no attribute BartphoTokenizer
I'm using Google Colab. Could you guys let me know how to overcome this issue.
Thanks.
Hi Mr. Đạt,
Do you plan to release version 2 of bartpho-word-base which will be trained on 20gb of Wikipedia and news text + 120 GB of texts from OSCAR-2301 like phobert-base ?
I want to ask about Multiple Mask Tokens. For example TXT = "chúng tôi [mask] nghiên [mask] viên" I want to return the top_k of the [mask] at position 1 and the top_k of the [mask] at position 2 at the same time, does the model support it?
Could you update the decoder_start_token_id in the model.config because I found None in the config.json
Thank you.
[deleted]
How many epochs is BARTpho trained?
Hi @datquocnguyen,
I am so attracted by your project. I followed your tutorial and try to train a new model, but I can not find config.json
in fairseq-bartpho-word.zip
. Can you tell me how to get it?
Thank you.
"I have tried as instructed here: [https://github.com//issues/2#issuecomment-1146988402]"
"I have additionally followed the instructions provided in this CSV file format:
When I call BARTpho-word via AutoModelForSeq2SeqLM to fine tune text summarization task, the decoder_start_token_id is None. How can I load correctly the model ? or how can I fine tune BARTpho for text summarization on my dataset ? My code is below:
model = AutoModelForSeq2SeqLM.from_pretrained(model_args.model_name_or_path)
print(model.config.decoder_start_token_id) None
I would like to know the source code related to Section 3 and 4 in the paper.
Hi authors,
I plan to pretrain BARTpho model on my custom vietnamese datasets with denoising objective (text infilling + sentence permutation as suggested in your paper). Having checked all issues and found this related one: #8, however, I still cannot find any example/notebooks in your given HF link which shows an instruction on how to pretrain BART on a custom dataset in denoising manner.
Could you please provide me with the link to pretrain BART? It would be very grateful.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.