Giter VIP home page Giter VIP logo

Comments (6)

brightmart avatar brightmart commented on May 18, 2024

you can have a try.
and be aware that there are some differences between bert and albert in modelling.py
why do you want to train multillingual model?

from albert_zh.

kewin1807 avatar kewin1807 commented on May 18, 2024

i want to use model with vietnamese language. The important of change is share parameters, i know. how i can train with my language. Thanks for support =)

from albert_zh.

brightmart avatar brightmart commented on May 18, 2024

1、you can change vocab.txt in ./albert_config, then set non_chinese to True when create pretrain data using create_pretraining_data.py
2、then do pre train using run_pretraining.py

from albert_zh.

kewin1807 avatar kewin1807 commented on May 18, 2024

okay. Thanks for support. Best repo =)

from albert_zh.

kewin1807 avatar kewin1807 commented on May 18, 2024

I have tried to pretrain with my dataset, but i see the loss is very small but accuracy is not improve. How i can improve result

from albert_zh.

geekboood avatar geekboood commented on May 18, 2024

@brightmart Can we have a multilingual model for just Chinese and English? Cause in practical scenerios we may meet many english words in APP names, music names, Apple's all products's name and so on, and Google's multilingual model has too many languages. Our daliy life cannot leave English, you can see that Apple try to use purely Chinese in its products, such as replace Finder with 访达 which I think is totally a mess.
Maybe a language model for just Chinese and English can have huge impact on both research and industry and many multilingual tasks can benefit from it.

from albert_zh.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.