Giter VIP home page Giter VIP logo

baker's People

Contributors

arl avatar danimaribeiro avatar dependabot[bot] avatar guerinoni avatar lfarroco avatar lucarin91 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

baker's Issues

Performance chapter on the site

On an AWS EC2 instance of size c5.2xlarge, Baker can read zstandard records from S3, uncompress them and apply a basic filtering logic, compressing them back on local files using ~90% of capacity of each vCPU (8 in total) and ~3.5GB of RAM.

It reads and writes a total of 94 million records in less than 9 minutes, that’s 178k records per second.

On a c5.2xlarge instance (48 vCPUs) the same test takes 2 minutes, so that’s a speed of 775k records per second.

This excerpt is from https://getbaker.io/docs/performance/, the readme correctly states the last one is a c5.12xlarge instance.

bug: does not compile on windows

install baker in windows 10 with command go get github.com/AdRoll/baker

go: github.com/AdRoll/baker upgrade => v0.0.0-20201209102217-af7b7c12682b
go: downloading github.com/charmbracelet/glamour v0.2.0
go: downloading github.com/sirupsen/logrus v1.4.2
go: downloading github.com/rasky/toml v0.1.1-0.20160309013025-90bcb678a72a
go: downloading github.com/konsorten/go-windows-terminal-sequences v1.0.2
go: downloading golang.org/x/sys v0.0.0-20200413165638-669c56c373c4
go: downloading github.com/yuin/goldmark v1.2.0
go: downloading github.com/muesli/termenv v0.6.0
go: downloading github.com/muesli/reflow v0.1.0
go: downloading github.com/mattn/go-runewidth v0.0.9
go: downloading github.com/alecthomas/chroma v0.7.3
go: downloading github.com/microcosm-cc/bluemonday v1.0.2
go: downloading github.com/olekukonko/tablewriter v0.0.4
go: downloading github.com/lucasb-eyer/go-colorful v1.0.3
go: downloading github.com/google/goterm v0.0.0-20190703233501-fc88cf888a3f
go: downloading github.com/dlclark/regexp2 v1.2.0
go: downloading github.com/danwakefield/fnmatch v0.0.0-20160403171240-cbb64ac3d964
# github.com/AdRoll/baker
..\..\dvgamerr\go\pkg\mod\github.com\!ad!roll\[email protected]\help_markdown.go:56:38: not enough arguments in call to syscall.Syscall
..\..\dvgamerr\go\pkg\mod\github.com\!ad!roll\[email protected]\help_markdown.go:56:39: undefined: syscall.SYS_IOCTL
..\..\dvgamerr\go\pkg\mod\github.com\!ad!roll\[email protected]\help_markdown.go:58:11: undefined: syscall.TIOCGWINSZ
go version go1.15.6 windows/amd64

Thanks.

Doesn't work on windows

I check the quick start example and get this error

# github.com/AdRoll/baker/input/inpututils
..\..\..\go\pkg\mod\github.com\!ad!roll\[email protected]\input\inpututils\fastreader.go:59:3: unknown field 'Setpgid' in struct literal of type syscall.SysProcAttr

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.