Comments (9)
- 您不需要生成自己的Word_Sense_Sememe_File,这个文件是通用的,它是直接从HowNet中抽取得到,一般来讲足以覆盖新的语料。
- 可以直接使用我提供的Word_Sense_Sememe_File做对齐,它包含了HowNet中的所有词。
from se-wrl.
谢谢解答!
from se-wrl.
您好,我不知道是不是我理解有误:
Word_Sense_Sememe_File 文件中,有一些我无法理解的现象,比如第12行的
“也 1”
“也”字可以理解为词,但是后面紧跟着的数字“1”怎么解释呢?
我发现 Word_Sense_Sememe_File 文件后面存在很多这样的情况——一个词后面跟着一个数字1——这应该怎么理解?
from se-wrl.
表示在hownet里没有对这个词的解释,只有一个词,没有对应的sense和sememe,所以标1
from se-wrl.
谢谢指教!
from se-wrl.
如果“在hownet里没有对这个词的解释”,那么 SE-WRL 模型应该怎么训练这个词的 embedding 呢?
from se-wrl.
模型随机初始化这个词的embedding,每次预测这个单词的出现概率的时候,都会用这个embedding作为在当前上下文环境下的词表示。因为我们认为这个单词只有一个含义,所以词表示不会随着上下文变化而改变。训练过程和skip-gram类似。
from se-wrl.
谢谢您耐心的解答!
from se-wrl.
建议加一个文件格式说明的文件。
from se-wrl.
Related Issues (20)
- vectors.bin HOT 5
- 数据预处理文件 & 其他语言 & 效率问题 HOT 8
- 训练和评估问题 HOT 2
- 训练结果 HOT 1
- 您好,我目前已经跑出了SAT的结果,在similarity和analogy的mean rank指标上表现都很好,但唯独在analogy的accuracy指标上与论文中的结果相差很远。我的参数设置如下:
- how to get the "HowNet.txt" HOT 2
- muti-embedding for one context word
- VocabFile problem HOT 1
- 论文打不开,提示缺少字体
- 词义消歧 HOT 3
- word_vec的作用 HOT 3
- 未能复现 SSA 结果
- 未能复现结果
- 语料库 HOT 1
- MST.c 运行时段错误 HOT 1
- 您好,关于pretrained词表规模的一点问题。 HOT 1
- 请问只有一个sense的词的sememe id怎么得到? HOT 1
- 模型应用 HOT 5
- 请问这个模型用C写的初衷是什么呢? HOT 1
- 可以公开训练好的word embedding文件吗
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from se-wrl.