Comments (2)
UPDATE:
I downloaded the code again compiled and ran it. It now shows this classification: GCAT--GCRIM, for the sample input:
{"doc_label": ["Computer--MachineLearning--DeepLearning","Neuro--ComputationalNeuro"],"doc_token": ["I", "love", "deep", "learning"],"doc_keyword": ["deep learning"],"doc_topic": ["AI", "Machine learning"]}
is it correct? anybody can confirm, please?
Also, it seems training the model with the same dataset multiple times does not produce the same prediction result. Is it what we expect? I am confused.
Last question. In the training dataset the doc_labels are like "CCAT--C31--C312" which is what the algorithm learns to classify when "doc_token" is given. But why do we need "doc_label" (isn't the "doc_label" we are trying to predict?) in the input that we want to predict? and why the "doc_label" texts are in "word" format rather than "code" format? It might be a trivial question to someone who is familiar with the work but I am kind of new in this area. Any input would be highly appreciated. Thanks in advance.
from neuralnlp-neuralclassifier.
UPDATE:
I downloaded the code again compiled and ran it. It now shows this classification: GCAT--GCRIM, for the sample input:{"doc_label": ["Computer--MachineLearning--DeepLearning","Neuro--ComputationalNeuro"],"doc_token": ["I", "love", "deep", "learning"],"doc_keyword": ["deep learning"],"doc_topic": ["AI", "Machine learning"]}
is it correct? anybody can confirm, please?
If you only edit hierarchical
and model_name
in train.hmcn.json
, your training is still using the default toy dataset data/rcv1_train.hierar.json
and vocabulary data/rcv1.taxonomy
. Of course your model can only output labels defined in data/rcv1.taxonomy
.
BTW, HMCN is a model designed for hierarchical classification, so you should not set hierarchical
to false
when using HMCN.
Last question. In the training dataset the doc_labels are like "CCAT--C31--C312" which is what the algorithm learns to classify when "doc_token" is given. But why do we need "doc_label" (isn't the "doc_label" we are trying to predict?) in the input that we want to predict? and why the "doc_label" texts are in "word" format rather than "code" format? It might be a trivial question to someone who is familiar with the work but I am kind of new in this area. Any input would be highly appreciated. Thanks in advance.
In prediction, the label you provide is actually not used, we need it because training and prediction share the same code, and we need label in training.
from neuralnlp-neuralclassifier.
Related Issues (20)
- why use word frequency as word id? any good to do it? HOT 1
- does TNeuralNLP-NeuralClassifier use fixed length as input with padding or non-fixed length ? HOT 1
- how to handle unknow vocabulary on Tencent / NeuralNLP-NeuralClassifier HOT 1
- Hi, is there an example predict.json file for reference? HOT 1
- HMCN RuntimeError: The size of tensor a (13) must match the size of tensor b (10) at non-singleton dimension 1 HOT 2
- f-score 等于 0 HOT 1
- very low performance HOT 1
- HMCN模型在Evaluation时报错 HOT 1
- 构建dict_rcv1/doc_label.dict 时采用的是json文件而非taxonomy文件 HOT 9
- 使用Hierar效果反而不如Flat HOT 3
- RCV1数据集 HOT 1
- HMCN模型的代码实现里面,没有Hierarchical Violation Penalty loss ? HOT 2
- HMCN运行报错 HOT 6
- 运行python train.py conf/train.hmcn.json 报错RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor HOT 1
- 请问这个层次分类支持bert吗? HOT 1
- HMCN是怎么同时考虑层次标签之间关系的呢? HOT 1
- 在进行中文文本分类出现了这个错误TypeError: cannot unpack non-iterable NoneType object
- 训练hierarchical classifier with HMCN应该如何修改分类的层级
- 设置“region_embedding” 运行报错“RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!”
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from neuralnlp-neuralclassifier.