Giter VIP home page Giter VIP logo

dut3062796s / fluid Goto Github PK

View Code? Open in Web Editor NEW

This project forked from fluid-cloudnative/fluid

0.0 2.0 0.0 11.2 MB

Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)

Home Page: http://pasa-bigdata.nju.edu.cn/fluid/index.html

License: Apache License 2.0

Dockerfile 0.09% Makefile 1.00% Go 89.92% Smarty 0.45% Shell 5.23% Mustache 3.30%

fluid's Introduction

License CircleCI Build Status codecov Go Report Card Artifact HUB

What is Fluid?

Fluid is an open source Kubernetes-native Distributed Dataset Orchestrator and Accelerator for data-intesive applications, such as big data and AI applications. It is hosted by the Cloud Native Computing Foundation (CNCF) as a sandbox project.

Fluid

English | 简体中文

notification What is NEW!
April. 27th, 2021. Fluid accpeted by CNCF! Fluid project was accepted as an official CNCF Sandbox Project by CNCF Technical Oversight Committee (TOC) with a majority vote after the review process. New beginning for Fluid! .
Mar. 16th, 2021. Fluid v0.5.0 is RELEASED! It provides various new features, such as on-the-fly dataset scale out/in, metadata backup, support Fuse global mode and so on. Please check the CHANGELOG for details.
Nov. 6th, 2020. Fluid v0.4.0 is RELEASED! It provides various features and bugfix, such as Prefetch Dataset automatically before using it. Please check the CHANGELOG for details.
Oct. 1st, 2020. Fluid v0.3.0 is RELEASED! It provides various features and bugfix, such as Data Access Acceleration For Persistent Volume and Hostpath mode in K8s. Please check the CHANGELOG for details.

Features

  • Native Support for DataSet Abstraction

    Make the abilities needed by data-intensive applictions as navtive-supported functions, to achieve efficient data access and reduce the cost of multidimensional management.

  • Cloud Data Warming up and Accessing Acceleration

    Fluid empowers Distributed Cache Capaicty(Alluixo inside) in Kubernetes with Observability, Portability, Horizontal Scalability

  • Co-Orchestration for Data and Application

    During application scheduling and data placement on cloud, taking both the app's characteristics and data location into consideration, to improve the performance.

  • Support Multiple Namespaces Management

    User can create and manage datasets in multiple namespaces

  • Support Heterogeneous Data Source Management

    Unify the Data access for OSS, HDFS, CEPH and Other underlayer storages

Key Concepts

Dataset: A set of logically related data that will be used by a computing engine, such as Spark for big data and TensorFlow for AI scenarios. The management of dataset has many metrics, has multiple dimensions, such as security, version management and data acceleration. And we hope to start with data acceleration and provide support for the management of data sets.

Runtime: Security, version management and data acceleration, and defines a series of life cycle interfaces. You can implement them.

AlluxioRuntime: From Alluixo, Fluid manages and schedules Alluxio Runtime to achieve dataset visibility, elastic scaling, and data migration. It is an engine which supports data management and caching of datasets.

Prerequisites

  • Kubernetes version > 1.14, and support CSI
  • Golang 1.12+
  • Helm 3

Quick Start

You can follow our Get Started guide to quickly start a testing Kubernetes cluster.

Documentation

You can see our documentation at docs for more in-depth installation and instructions for production:

Quick Demo

Demo 1: Accelerate Remote File Accessing with Fluid

Demo 2: Machine Learning with Fluid

Demo 3: Accelerate PVC with Fluid

Demo 4: Preload dataset with Fluid

Demo 5: On-the-fly dataset cache scaling

Roadmap

See ROADMAP.md for the roadmap details. It may be updated from time to time.

Community

Feel free to reach out if you have any questions. The maintainers of this project are reachable via:

DingTalk:

Contributing

Contributions are highly welcomed and greatly appreciated. See CONTRIBUTING.md for details on submitting patches and the contribution workflow.

Adopters

If you are intrested in Fluid and would like to share your experiences with others, you are warmly welcome to add your information on ADOPTERS.md page. We will continuousely discuss new requirements and feature design with you in advance.

Open Source License

Fluid is under the Apache 2.0 license. See the LICENSE file for details. It is vendor-neutral.

Code of Conduct

Fluid adopts CNCF Code of Conduct.

fluid's People

Contributors

allenhaozi avatar apc999 avatar cheyang avatar congee524 avatar densil-phd avatar dingjiefeng avatar fania98 avatar fly923 avatar frankleaf avatar ghd1111 avatar heu-kevin avatar hhhercules avatar hu-chi avatar huanghaoding avatar hxycode avatar iluoeli avatar jiangshen95 avatar kniter1 avatar littletiger123 avatar mmhzheng avatar phoenixwu0229 avatar ronggu avatar trafalgarzzz avatar wangyanghack avatar xiao-hou avatar xieydd avatar yangyuliufeng avatar yz09191 avatar zebulon-chen avatar zhongweichang001 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.