jiangtaojy / mlm_bert_traning Goto Github PK
View Code? Open in Web Editor NEW基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现
基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现
[email protected] 非常感谢作者大大!
你好,请问能否提供预训练模型,且我有几个问题:
1、请问你的预训练数据是多少?
2、这个模型是否是个通用模型,在特定场景下也需要如图bert一样去调优?
3、这个方法在asr文本纠错中的效果如何?
本人刚开始做asr文本纠错,不知道能否给点建议啥的 O(∩_∩)O哈哈~ 谢谢大佬
您好,想试一下您的预训练语言模型:[email protected], 谢谢
大佬训练数据能分享一下吗?[email protected] 非常感谢
请问训练的时候,都没办法保存训练结果,该怎么修正呢 ?
另外,训练时loss从10几开始,然后慢慢往下降,这样是正常的吗?
我的训练数据是没有音符的,后面对应的label也是使用unicode
你好,可以给一个训练后的checkpoint吗?谢谢
您好,不知是否方面分享一下模型文件呢?
我的邮箱是[email protected]。感谢大佬
谢谢,如果是百度网盘最好了
感谢!!
自己预训练的话 是需要用utils.py的模糊音算法来构造一批数据是吧,构造数据错误特征要自己从声学模型中找,构造后再投入训练。
作者你好 求一份模型 邮箱 [email protected]
你好,我在使用transformer进行拼音转写的时候,不带有纠错功能,在通用数据上,准确率可以达到97-98%左右,请问你这个模型在专有领域上可以达到多少准确率,麻烦发一下模型,看一看效果谢谢啦,[email protected]
[email protected]
谢谢大佬!
您好,
RT, 想试一下您的预训练语言模型:[email protected], 谢谢!!!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.