Giter VIP home page Giter VIP logo

siamese_dssm's Introduction

siamese_dssm

v1.0

simaese 判断句子相似度。

v2.0

添加 基于siamese的句子相似度排序,类似于 搜索召回

v3.0

添加 dssm,判断句子相似度

v4.0

dssm和 siamese融合,强化句子相似度排序

目前处于v3.0阶段

入口文件:train.py 执行方式:python train.py 句向量召回测试: infer.py 优化

语料:corpus.txt

所用版本: python=3.5.2 tensorflow=1.3.0

优化方式: 目前已做优化:

    1.余弦距离计算方式完善
    
    2.添加激活函数
    
尚待优化:

    1.更改相似度计算方式及损失函数,余弦距离+方差 改为其他诸如 交叉熵等等;

    2.更改句子向量获取方式,rnn改为cnn;

    3.rnn输出,output或者state作为下一步的变量

siamese_dssm's People

Contributors

zheng5yu9 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.