zhoujx4 / duee Goto Github PK

View Code? Open in Web Editor NEW

70.0 70.0 10.0 11.49 MB

百度2021年语言与智能技术竞赛多形态信息抽取赛道事件抽取部分torch版baseline

Python 100.00%

duee's Introduction

Hi there 👋

About me

NLP算法工程师一枚🤓 多实践、多交流、多思考

公众号: NLP煎饼摊
微信: ZHOU-JXX
知乎
 谷歌学术

duee's People

Stargazers

Watchers

Forkers

qcwthu masterkmp rela0426 pink-duck-chao xiaoanshi zygao95 73ad yy2lyx beeevita jiangxinke

duee's Issues

有关DuEE-1.0预测结果后处理，生成预测文件部分的问题

您好，这部分需要运行的文件应该是duee_1_postprocess.py这个文件吧？您在readme里面写的是duee_1_data_prepare.py，这里是不是写的有些问题？

训练DuEE-Fin数据的enum分类时，batch_loss=nan

我用的本机电脑训练批次是8，learning_rate=1e-6、1e-5、1e-4都试过了还是nan
我感觉是批次太小导致情况没有概括全，训练批次为1我也试过了不行如下图：

DuEE缺少data文件夹

问下数据处理的时候调用的from data.data_utils import schema_process, data_process
data这个文件夹不存在，能发下么

f1分数

有的预测结果event_list是空的，提交上去计算出来的f1值都是0，请问这两者之间有关系吗，还是说提交文件的格式有错误

运行duee predict_cls出现的问题

运行:

CUDA_VISIBLE_DEVICES=2 python predict_cls.py --dataset=DuEE-Fin --event_type=enum -- max_len=256 --per_gpu_eval_batch_size=32 --model_name_or_path=/home/user/pretrained-model/chinese-roberta-wwm-ext-large --fine_tunning_model_path=./output/DuEE-Fin/enum/best_model.pkl --test_json=./data/DuEE-Fin/sentence/test.json

报错:
predict_cls.py: error: the following arguments are required: --fine_tunning_model_path, --test_json

请问大佬数据集DuEE1.0中的tsv文件和预测用的duee_test1.json在哪里下载到的，找了好久测试集tsv格式和json格式的数据不太一样?

您好！我在复现时碰到以下问题：
File "run_ner.py", line 179, in
main()
File "run_ner.py", line 161, in main
eval_p, eval_r, eval_f1, eval_loss = evaluate(args, eval_iter, model, metric)
File "run_ner.py", line 46, in evaluate
n_infer, n_label, n_correct = metric.compute(batch["all_seq_lens"], preds, batch['all_labels'])
File "/home//DuEE/metric/metric.py", line 74, in compute
] for sent_index in range(len(lengths))]
File "/home//DuEE/metric/metric.py", line 74, in
] for sent_index in range(len(lengths))]
File "/home/***/DuEE/metric/metric.py", line 73, in
for index in labels[sent_index][:lengths[sent_index]]
KeyError: -1

打印出来label
[[-1 26 26 ... -1 -1 -1]
[-1 26 26 ... -1 -1 -1]
[-1 26 26 ... 26 26 26]
...
[-1 26 26 ... -1 -1 -1]
[-1 26 26 ... -1 -1 -1]
[-1 26 26 ... -1 -1 -1]]

对比一下，除了与paddle的ChunkEvaluator类中，相关的下标不同以外

好像没有其他区别。
请问有没有解决方法？

model = bert_lstm_crf(args.model_name_or_path, args.id2label,num_classes=args.num_classes,
rnn_hidden_size=args.rnn_hidden_size,rnn_layers=args.rnn_layers)
    model.cuda()
    net = nn.DataParallel(model)
    model=net.module
    # model.to(args.device)

但是运行过程中，GPU的利用率情况是下面这样的：

依旧是1个GPU在跑，而且显存已经快爆了，但是利用率却不高。另外一个GPU一直不动。在你的代码里，有一个参数是do_distri_train，跟它有关系吗？
求帮忙，谢谢~~