Giter VIP home page Giter VIP logo

commonlit-readability-prize-silver-medal-solution's Introduction

2021 Kaggle Featured Code Competition:CommonLit Readability Prize(Silver Medal Solution, Final Rank 96/3633 teams

image

  • 比赛任务: 构建算法来评估 3-12 年级课堂使用的阅读文本段落的复杂性,用来评估文本的可读性,是否通俗易懂。
  • 评估指标: RMSE
  • 比赛数据
    • id: 每条文本的唯一ID
    • url_legal:数据来源,测试集中为空
    • license :数据许可协议,测试集中为空
    • excerpt :需要预测的测试集文本
    • target :可读性分数,目标值
    • standard_error :衡量每个摘录的多个评分者之间的分数分布。不包括测试数据

Solution: RoBERTa Large/Base + Attention/Mean Head

ITPT====>>Finetune====>>{RoBERTa Large/Base + Attention/Mean Head}====>>Inference

• 根据target范围进行kfod数据划分( 1.train-val-split)
• 基于比赛任务给定的训练集语料进行继续预训练:MLM任务
• 对于预训练模型输出拼接其他网络层进行微调,主要用到的池化层有AttentionHead,MeanPooling
• 融合:根据公榜分数设置权重进行加权相加

  • ITPT:继续预训练 image
    Bert是在通用的语料上进行预训练的,如果要在特定领域应用文本分类,数据分布一定是有一些差距的。这时候可以考虑进行深度预训练。Within-task pre-training:Bert在训练语料上进行预训练。(2.clrp-pretrain)

  • 不同层的特征
    BERT的每一层都捕获输入文本的不同特征。根据不同层的特征有效性, 进行微调模型并记录测试错误率的来优化精度。(3.clrp-finetune-roberta-large)

  • 模型层间差分学习率
    发现为下层分配较低的学习率对微调Roberta-Large 是有效的,比较合适的设置是 ξ=0.9 和 lr=3.0e-5,其中24代表Large模型encoder层数,如果使用base需要改成12。

Final Rank: 96/3633

image

image

再次成功带队SOLO摘银,算是学习NLP近半年的成果验收:)!

commonlit-readability-prize-silver-medal-solution's People

Contributors

leo1998-lu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.