Giter VIP home page Giter VIP logo

standard-benchmark's People

Contributors

elshize avatar

Watchers

 avatar  avatar  avatar  avatar

standard-benchmark's Issues

Repo name

We should change the repo/project name to something more generic. The major objective is to run a big query benchmark but it's becoming more than that.

For instance, it could be used by anyone just to build collections, compress and such, and then work on that for, say, feature extraction. This is more or less my use case in query routing project. I will want to do all from prasing through indexing and wand data, but then I'll do some custom stuff. I will definitely reuse that and having something like that would help with reproducibility of any paper we publish in the future that is based on PISA codebase.

So, any suggestions?

Add option to print certain logs to file

For example, there's a lot of logs that are printed out from PISA itself, like parsing collection. However, it might be good to have the ability to save only [EXEC] logs to see what has been executed throughout the run.

trec_eval

trec_eval is now considered a system dependency but it's small and fast to compile, and it would be nice to make it part of the project. Not sure how yet, I'll think of a good approach but this is not a priority so I'm creating this issue.

Take multiple topic files

Like so:

runs:
    - collection: wapo
      type: evaluate
      topics:
        - /data/collections/WashingtonPost.v2/topics.core18.txt
        - /data/collections/WashingtonPost.v2/topics.core18.txt
      qrels:
        - /data/collections/WashingtonPost.v2/qrels.core18.txt
        - /data/collections/WashingtonPost.v2/qrels.core18.txt
      output: wapo.trec

Failing test (sometimes?)

---- run::tests::test_evaluate_simple_topics stdout ----
eval_results = 
eval_results = 
thread 'run::tests::test_evaluate_simple_topics' panicked at 'assertion failed: `(left == right)`
  left: `EchoOutput([])`,
 right: `EchoOutput(["/tmp/build.J86mqR353kVz/trec_eval -q -a qrels /tmp/build.J86mqR353kVz/output.trec.wand.results"])`', src/run.rs:175:9
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace.

Testing against gold standard

In run's config, both bench and eval, there should be a way of pointing to a gold standard. If different (?), it should fail. For use in, say, Jenkins or Travis.

Override config

I want to provide an override for any given program run from executor.

This would be an "unstable" feature in a way that it will not ensure it's correct and just pass it to the program as is. It is not ideal but on the other hand very flexible. It would be for example used when waiting for a feature to be implemented.

I think in general this should be discouraged. If we wanted something this unstable, we could have done it in python or bash. I really want these things to fail early whenever they can, preferably when loading config.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.