Giter VIP home page Giter VIP logo

benchmark-nlp-2018's Introduction

Benchmark report

Setup

3 corpus called Chatbot, Ask Ubuntu and Web Applications as described in the paper http://workshop.colips.org/wochat/@sigdial2017/documents/SIGDIAL22.pdf

corpus num of intents train test
Chatbot 2 100 106
Ask Ubuntu 5 53 109
Web Applications 8 30 59

For Ask Ubuntu and Web Application corpus, there is a specific None intent for sentences that should not be matched with the other intents.

Intent classification results

While the paper did the benchmark for both intent classification and entity extraction, we will focus only on intent classification. We compute the f1 score for each corpus and the overall f1:

Platform\Corpus Chatbot Ask Ubuntu Web Applications Overall
Botfuel 0.98 0.90 0.80 0.91
Luis 0.98 0.90 0.81 0.91
API (DialogFlow) 0.93 0.85 0.80 0.87
Watson 0.97 0.92 0.83 0.92
RASA 0.98 0.86 0.74 0.88
Snips 0.96 0.83 0.78 0.89
Recast 0.99 0.86 0.75 0.89

  • Result for Botfuel is done using the Botfuel Trainer
  • Result for Luis, API (now DialogFlow), Watson and RASA are extracted from paper (only the intent classification part)
  • Result for Snip is computed from their result on the github page (https://github.com/snipsco/nlu-benchmark/tree/master/2018-01-Braum-et-al-extension). They have results for both intent classification and entity extraction and we extracted the intent classification part.
  • Result for Recast is done by us using Recast API and UI (for manual verification if intents and entities are the same as in the training set).

benchmark-nlp-2018's People

Contributors

tcnguyen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

benchmark-nlp-2018's Issues

Results with Recast.AI

Hi, I have a question about how you came up with the Recast.AI results. Do you know when you made your bot on their platform?
I am getting slightly different results: 0.97 on Chatbot, 0.83 on AskUbuntu and 0.72 on WebApplications.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.