Giter VIP home page Giter VIP logo

icme2019_short_video_understanding_challenge_rank14's Introduction

ICME2019短视频内容理解与推荐竞赛Rank 14方案

比赛链接

队员:Yumho, byxshr

文件说明

  • utils.py: 提供了生成label均值特征的函数
  • preprocess.py: 视频特征和音频特征预处理
  • model.py: 模型文件
  • track2.py: 训练文件

机器配置

  • 内存256G,TITAN Xp 12G显存 * 2

模型

基于DeepCTR的xDeepFM模型,做了些修改来支持视频特征和音频特征的输入。其中视频特征和音频特征通过128->embedding_size的神经网络做embedding,拼接到所有特征的embedding向量后面。

具体参数设置可查看track2.py

特征

  1. 原始特征(uid, user_city, item_id, author_id, item_city, music_id, did, video_duration)
  2. 计数特征,即统计某个字段的出现次数(uid, did, item_id, author_id, uid-author_id)
  3. label均值特征,即根据某个字段分组统计每个分组的标签均值(uid, did, item_id, uid-author_id, uid-did, did-channel)
  4. nunique特征,例如uid_item_nunique,是统计每个uid下有多少不同的item_id,等频离散化
    • uid_icity_nunique
    • uid_item_nunique
    • uid_author_nunique
    • uid_music_nunique
    • item_ucity_nunique
    • item_uid_nunique
    • author_uid_nunique
  5. 视频特征
  6. 音频特征
  7. 标题特征,提取视频标题的不重复字段,当作序列特征输入,最后做sum pooling得到embedding向量

成绩

最终成绩是跑10次取平均;
track1:a/b榜都是15;
track2:a榜第14,分数为0.79405 (0.73,0.93);b榜第14,分数为0.79485 (0.74,0.93)

参考

icme2019_short_video_understanding_challenge_rank14's People

Contributors

rearcher avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.