Giter VIP home page Giter VIP logo

Comments (10)

yumath avatar yumath commented on July 18, 2024

@pomeloking01 是有可能会出现这种情况。我的理解是基于条件随机场的参数计算输出结果时,因为需要计算整个序列的联合概率,求得最优标记序列,神经网络输出的轻微扰动有可能会导致这一结果出现一定的变化。

from bertner.

wjxdp avatar wjxdp commented on July 18, 2024

@pomeloking01 是有可能会出现这种情况。我的理解是基于条件随机场的参数计算输出结果时,因为需要计算整个序列的联合概率,求得最优标记序列,神经网络输出的轻微扰动有可能会导致这一结果出现一定的变化。

我也遇到一样的问题,但是打印每层输出发现,bert_embedding每次输出都不一样,想不明白是哪里的问题,作者可以帮忙看看么?

from bertner.

LHMdanchaofan avatar LHMdanchaofan commented on July 18, 2024

@pomeloking01 是有可能会出现这种情况。我的理解是基于条件随机场的参数计算输出结果时,因为需要计算整个序列的联合概率,求得最优标记序列,神经网络输出的轻微扰动有可能会导致这一结果出现一定的变化。

我也遇到一样的问题,但是打印每层输出发现,bert_embedding每次输出都不一样,想不明白是哪里的问题,作者可以帮忙看看么?

当你预测时,应该把bert_embedding的is_training改为False,否则的话预测的时候会dropout。导致预测的结果有差异。

from bertner.

FancyZhang96 avatar FancyZhang96 commented on July 18, 2024

作者,您好,请问一下您的进行实验的配置是怎样的?用到的服务器的性能是怎样的,我用自己的电脑是直接内存爆炸的,可以请教一下吗?

from bertner.

yumath avatar yumath commented on July 18, 2024

@FancyZhang96 没有GPU吗?那你把BatchSize设置为1,看你的内存能不能带得动,这样还不行的话,你得想办法弄台设备。

from bertner.

pomeloking01 avatar pomeloking01 commented on July 18, 2024

from bertner.

FancyZhang96 avatar FancyZhang96 commented on July 18, 2024

@FancyZhang96 没有GPU吗?那你把BatchSize设置为1,看你的内存能不能带得动,这样还不行的话,你得想办法弄台设备。
您好,我基于自己的数据进行实验,引入Bert之后效果没有原来的BiLSTM-CRF效果好,请问能指教一下原因吗?

from bertner.

yumath avatar yumath commented on July 18, 2024

@FancyZhang96 你给的信息太少了,没法分析。

from bertner.

FancyZhang96 avatar FancyZhang96 commented on July 18, 2024

@FancyZhang96 你给的信息太少了,没法分析。

我之前用了自己的数据集在您提到的BiLSTM-CRF中训练F值达到了91%,参数都没有改,然后将数据集放在了这个引入BERT中模型中,也是参照您的参数,没有改动,F值只有88%。

from bertner.

yumath avatar yumath commented on July 18, 2024

@FancyZhang96 是领域数据吗?你如果有问题,请另开一个issue,避免与本issue主题无关的讨论影响别人,本issue close了

from bertner.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.