anoma / zkp-compiler-shootout Goto Github PK

View Code? Open in Web Editor NEW

118.0 23.0 13.0 20.84 MB

Evaluating & benchmarking ZKP compilation strategies.

Home Page: https://anoma.github.io/zkp-compiler-shootout/

License: GNU General Public License v3.0

Rust 40.38% Common Lisp 32.76% Makefile 0.21% Emacs Lisp 0.04% Parrot 26.60%

benchmarking zero-knowledge zkp

zkp-compiler-shootout's People

Contributors

$fraccaman avatar$

Stargazers

Watchers

Forkers

lispc ctrl-siberkaptan gmh5225 risc0 clvv brianretford junkicide tritonvm austinabell binadamu-isiyoonekana muhtasimtanmoy bidost

zkp-compiler-shootout's Issues

Reorganize Rust Structure

The current structure of having Sudoku folder with the risc0 code was fine before, however I wish to scale up the repo, and have multiple similar folders per idea. Thus I propose the following folder structure

Program Name [Sudoku]
- Approach Name with relevant code [Risc0, Halo2, Alucard, etc]
  - Code with an explicitly exported module
Benchmark (This contains all benchmarks which can be run. A Test property will be had so each benchmark can be run standalone)

Lower Benchmark timing

It seems like running the benchmarker takes 15 minutes on my machine. I need to reduce the number of repeat tests from 100 to 20.

Consider Benchmarking Libraries

Currently we are using criterion to test rust programs. This programs seems quite nice for benchmarking programs, however I should conduct a study of available benchmarking tools.

Another important consideration is how do we benchmark programs not written in rust? It would be cool if we can get a csv file out of the times, and import it into some system, which does the statistical projection for us. This way all our statistics look and feel consistent.

Add More Hashing Benchmarks

Currently I've added a Blake2 benchmark for risc0, but it would be nice to get a good basis of hash functions bench marked:

Known hashing functions to benchmark and compare

It would be great to get contributions for all ZKVM machines. Ideally we can get all the machines to have the same algorithm, so we can compare speeds more easily.

I should also verify the answer somehow.

For the Blake 3: Miden example I take it on faith value that I'm feeding the inputs correct and getting the correct output

Triton: Preserving Expected Block Order

as seen in #16, there is no easy way to jump to a label without affecting the call stack. This demands that abstractions like loop or if after their termination, will have to have their exit code labels be placed after the label they were called from.

(def nonsense-example
  (tagbody 
   :nonesense-entry-point
     (dup 0)
     (if (begin (push 2) add)
         (begin (push 3) add))
   ;; In real code, this is generated by if, not written in code explicitly
   :after-if 
     (push 4)
     (loop swap (push 3) add swap)
   ;; In real code, this is generated by loop, not written in code explicitly
   :after-loop
     (dup 0)
     (push 100)
     lt skiz recurse return))

This should generate code to look something like

nonsense-entry-point:
  dup0
  skiz
  call if-wraper
after-if:
  push 4
  call loop
after-loop:
  dup0
  push 100
  lt skiz recurse return

if-wraper: ...
loop: ....

where loop and if-wrapper: are filled in with their proper code. However note where after-if: and after-loop: come. Their ordering was set to be after whatever the current block at the time was!

This behavior is not trivial to implement, as we can not rely on all gotos's to get the ordering naturally like a SSA style control flow graph.

In fact, due to how ordering works, we have to make sure we move down the any labels that were created before the continues-at point to the end, as the if/loop return label may have labels that already implicitly follow it!

We therefore have to carefully remember this ordering ourselves. Thankfully I propose the following solutions:

Solution 1: Flip the Chessboard, Anything that Branches, is now an adhoc procedure!

This solution is simpler than the second solution and is more elegant, however it needs some build up.

When we say something like (loop swap (push 3) add swap), how we think of the control graph, is that it calls into some loop boiler plate with the user logic inside, then it returns back to the caller.

Thus instead of thinking of concepts like loop and if as primitives or like a normal instruction, we can think of it like a higher order procedure!

Namely, an invocation of loop or if creates a brand new procedure, with the code the user specified being inside the generated procedure.

Since these blocks always call and return they are safe to move anywhere. This completely removes any need for reordering logic.

All that needs to happen from the code standpoint is:

Extend labels with a notion of created procedures that we accumulate.
When making an abstraction which takes user code and has any branches, remember to mark it as a procedure!
- This puts a burden upon the abstraction writer, however hopefully it should be obvious when this happens
  - Maybe if I have enough examples I can nicely abstract it away from the author...

Solution 2: Extending Tlabels with Ordering and Hashtables

We change tlabels from being just a a list of blocks, to being a record containing the following fields.

A current ordering
A hashtable mapping the keyword label of a block to the block itself.
A current block that we are adding instructions onto.
A list of the current explicit follows.
An enum of :front, :end, or nil. For current blocks without labels, telling them where they go in the ordering

What the 4. point does, is when we finalize the block, we will move all nodes between the current node and what follows to the end.

Thus if we have

:a :b :c :d :e

and we say :d follows :a, then the ordering list will now look like

:a :d :e :b :c

This method is slow and is O(n^2) in the number of explicit follows. However if this is found to slow down the speed of compilation, then I can implement a O(n) method by some sort of numbering.

A note about merging tlabels

An important means of combination for tlables is appending an instruction or a set of instructions to the front.

This will serve as the modified version of my existing triton:cons-instructions-to-labels

Consing an opcode (push, call, etc.):
1. if there is a label for the current block:
  1. Finish the block, updating/adding it to the hashtable at its label
  2. Create a new block, with the enum field set to :front
2. If there is no label for the current block
  1. Just cons onto the current block!
Consing a label:
1. If there is a label for the current block
  - Finish the block, updating/adding it to the hashtable at its label
  - Create a new block, and place it's ordering to the front of the list, and set the enum to nil
2. If there is no label for the current block
  - Add the label to the current block
  - Add the new label to the ordering, at the front or end depening on what the value of the enum is.
Consing a block:
1. If there is no current block
  - Then set the current block to the given block
2. If there is no label for the current block
  - Merge the two blocks. Note if the block we are consing has a label, then simply call the logic for consing a label
3. If there is a label for the current block
  1. Finish the block, updating/adding it to the hashtable at its label
  2. Add block, and call logic for consing a label
Consing a tlabels:
1. Merge the hashtables.
2. If there is no name for the current block and the consed tlabels is not empty:
  - gensym an unique label for the current block.
3. Finalize the current block
4. Take the tlabels's current block as our own, along with its enum value

Add a notion of blocks
- blocks are a label + code inside
- nested blocks are not allowed!
Add a notion of collected labels
- These are all the labels in a function
- This also serves as an abstraction mechanism
  - Since functions like if create new labels to continue at, it is important to be able to compose this with code that comes next
  - Meaning that if we end off with a continuation address, and some code after it in it's own block, we should unify the jump points and give the block two names or unify the names
  - This means that we can effectively compose any abstractions, even those which create branches
#17
Add a notion of functions
- Functions will collect all labels, and compose an entry point
Make Program encompass this
- This means that a program will now be some some code that also keeps the relevant functions in mind and generates out the proper calls.
- Further this would be the entry point to the circuit

Re-enable RISC0

Currently due to compiler tool chain versions, risc0 is now disabled.

I'd ideally get this working again.

Add Cairo to the shootout

Cairo is a rather neat project, we should add it to the benchmark.

However, Cairo has a few ways of operating so we should test the following provers

For the compiler I recommend using

https://github.com/lambdaclass/cairo-vm

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.