Giter VIP home page Giter VIP logo

data-kale's Introduction

Data Kale

PyPI version

Data kale is a simple data lake intended to abstract away an S3 compatible backend like Wasabi.

Kale

Configuration

credential

s3-access-key

The access key to the s3 backend

s3-secret-key

The access secret to the s3 backend

data

root

The location of the local directory that will contain all the repositories directories. Ex. data.root is set to /data then the data for the repository namespace-puddle will end up in /data/namespace-puddle.

Note: Just as pathlib.Path(.), the default is to be relative to home directory, i.e. data is ~/data. It supports expanduser so it's easier to just be explicit with ~/data.

Example

~/.kale.toml:

[credentials]
s3-access-key = "ACCESS"
s3-secret-key = "SECRET"

[data]
root = "~/data"

Install

virtualenv venv
source venv/bin/activate
pip install -e .

Run

Download

python -m data_kale.download namespace-puddle

Upload

python -m data_kale.upload namespace-puddle

List

python -m data_kale.list_remote

Test

Install

source venv/bin/activate
python setup.py test

Run

source venv/bin/activate
pytest

Publish

Use pypi-publish-template

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.