Giter VIP home page Giter VIP logo

vitte-rs's Introduction

Vitte-rs : A way to sample from your collection

This library is a Rust port of Method D from "An Efficient Algorithm for Sequential Random Sampling", Jeffrey Scott Vitter, ACM Transactions on Mathematical Software, 13(1), March 1987, 58-67

Description

To quote the paper itself :

The problem is to draw a random sample of size n without replacement from a file containing N records; the n records must appear in the same order in the sample as they do in the file. Another formulation is to form a sorted random set of n elements from (1, 2, . . . , N). The sample size n is typically very small relative to the file size N.

In other words : "How to sample in order a collection as fast as possible ?"

Usage

use vitte_rs::sampler::Sampler;

fn main() {
    let size_of_collection = 1_000_000_000;
    let size_of_sampled = 1_000_000;
    let v: Vec<u64> = (1..size_of_collection).collect();
    let l = v.len();
    let p = v
        .into_iter()
        .sample(size_of_sampled, l, 13)
        .collect::<Vec<_>>();
    println!("{:?}", p);
}

Etymology

The name of the crate is a pun with the name of the original researcher (Jeffrey Scott Vitter) and the french word for quickly ("vite").

vitte-rs's People

Contributors

thesirc avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.