Giter VIP home page Giter VIP logo

pig-bench's Introduction

PIG: Performance Introspection of Graph Databases

The explosion of graph data in social and biological networks, recommendation systems, provenance databases, etc. makes graph storage and processing of paramount importance.

PIG is a new graph benchmarking framework, which provides both a methodology for evaluating graph database performance and a mechanism to carry out such evaluations. It takes a hierarchical approach to benchmarking. The suite has three layers of benchmarks:

  • Primitive operations such as reading and writing vertices and edges
  • Composite access patterns such as extracting k-hop neighborhoods
  • Graph algorithms such as shortest path finding and computing centrality metrics

This framework allows for comparisons between systems as well as single system introspection. Such introspection allows one to evaluate the degree to which systems exploit their knowledge of graph access patterns. The suite also comes with a web interface that makes it easy to run benchmarks and to visualize and analyze the collected data.

Quick-Start

To run PIG, you will need:

After installing all the prerequisites and checking out the source code of PIG, cd into the graphdb-bench directory and type:

mvn install

You can then start the web interface using:

./runWebInterfaceServer.sh

This will start a server on port 8080. Or you can run the benchmark tools directly from the command-line using:

./runBenchmarkSuite.sh`

Use the --help option to get the list of available commands or +help to see advanced options and options for configuring the JVM.

Configuration

To edit the configuration of PIG, please edit the following file:

graphdb-bench/src/main/resources/com/tinkerpop/bench/bench.properties`

You can also override many options using command-line arguments and/or the web interface.

Datasets

You can generate your own datasets using fgftool distributed as a part of Blueprints Extensions (one of the prerequisites of PIG). You can also download datasets with up to 1 million nodes from here:

https://drive.google.com/folderview?id=0B3jkRHQ7nKvnbDhsWHBySVV6VVk&usp=sharing

Place the datasets in the directory specified in the configuration file. The default is data/datasets in the project directory.

Publications

  • Peter Macko, Daniel Margo, and Margo Seltzer. Performance Introspection of Graph Databases. 6th International Systems and Storage Conference (SYSTOR '13), Haifa, Israel, June 2013. (pdf)

pig-bench's People

Contributors

joshsh avatar okram avatar pmacko86 avatar

Stargazers

 avatar

Watchers

James Cloos avatar

pig-bench's Issues

Error during mvn install

What steps will reproduce the problem?
1. mvn install
2.
3.

What is the expected output? What do you see instead?

Expected: Successful
Instead: Error
[ERROR] Failed to execute goal on project graphdb-bench: Could not resolve 
dependencies for project com.tinkerpop:graphdb-bench:jar:0.1-SNAPSHOT: Failed 
to collect dependencies for [net.sf.jopt-simple:jopt-simple:jar:4.3 (compile), 
net.sf.opencsv:opencsv:jar:2.3 (compile), edu.harvard.pass:cpl:jar:1.01 
(compile), commons-lang:commons-lang:jar:2.4 (compile), 
commons-io:commons-io:jar:2.4 (compile), 
com.tinkerpop.gremlin:gremlin-groovy:jar:2.3.0 (compile), 
com.tinkerpop.gremlin:gremlin-java:jar:2.3.0 (compile), 
com.tinkerpop:pipes:jar:2.3.0 (compile), 
com.tinkerpop.blueprints.extensions:blueprints-extensions-core:jar:1.0-SNAPSHOT 
(compile), 
com.tinkerpop.blueprints.extensions:blueprints-extensions-impl:jar:1.0-SNAPSHOT 
(compile), 
com.tinkerpop.blueprints.extensions:blueprints-bdb-graph:jar:1.0-SNAPSHOT 
(compile), 
com.tinkerpop.blueprints.extensions:blueprints-sql-graph:jar:1.0-SNAPSHOT 
(compile), junit:junit:jar:4.5 (compile), log4j:log4j:jar:1.2.14 (compile), 
org.eclipse.jetty:jetty-server:jar:8.1.5.v20120716 (compile)]: Failed to read 
artifact descriptor for edu.harvard.pass:cpl:jar:1.01: Could not transfer 
artifact edu.harvard.pass:cpl:pom:1.01 from/to tinkerpop-repository 
(http://tinkerpop.com/maven2): Access denied to: 
http://tinkerpop.com/maven2/edu/harvard/pass/cpl/1.01/cpl-1.01.pom , 
ReasonPhrase:Forbidden. -> [Help 1]


What version of the product are you using? On what operating system?

Latest version on Xubuntu 13.03
Oracle JDK 7

Original issue reported on code.google.com by [email protected] on 22 Jul 2013 at 8:15

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.