Giter VIP home page Giter VIP logo

TuringCell: Run Linux over Paxos/Raft

TuringCell Logo

This article belongs to the open source project TuringCell. This article focuses on the principle, a simple proof of Turing Cell computing model, and its corresponding implementation -- the core design of TuringCell computer.

Turing Cell Model is a computer model which runs over distributed consensus algorithm (like Paxos/Raft). TuringCell computer is an open source implementation of the Turing Cell Model. With TuringCell, you can easily add the characteristics of a Paxos/Raft group like high availability, consensus, and fault tolerance to nearly any already existing software. At the meantime, TuringCell is a business friendly opensource project. Its core strength is from an open and inclusive opensource community. No matter where you come from and what language you speak, you can join in to develop and change the future of TuringCell equally and freely!

Author: Sen Han <[email protected]>

中文文档 Tweet

Table of Content

0 What is Turing Cell?

turingcell_computer_and_common_computer_arch

Turing Machine + Paxos/Raft = Turing Cell

A Turing Cell computer is an implementation of Turing Cell model, which just like the common computer is the implementation of the Turing Machine model.

A Turing Cell computer has no big differences with a common computer except it runs over a Paxos/Raft group and thus it has all those natural supports for high availabilty, fault tolerance, consensus and any property that a Paxos/Raft group could have (others like membership changement, multi-master in Paxos and etc).

So, you can run any operating system and its corresponding userspace applications over a Turing Cell computer just as you could do on a common computer as long as this operating system supports the ISA and I/O devices of this certain Turing Cell computer implementation. In addition to the running of Linux which the OS needs MMU, you could also even choose to run a RTOS without MMU requirement or even the bare metal way if you really care about the performance of your distributed applications.

What is "Cell"?

The cell is one of the fundamental elements of biological tissue. After receiving a certain degree of non-fatal injury, the general biological tissue can recover from the damage, which is called "regeneration" in biology. At the same time of regeneration, the tissue can ensure that its function in the overall system is in a certain degree of the normal state. One of the initial design intentions of the TuringCell computer is to have a certain degree of similar biological regeneration feature.

1 Details and Proof of Turing Cell Model

1.1 Replicated State Machine Model

tape_of_RSM

What is Replicated State Machine (RSM)?

 Based on the distributed consensus algorithm (for example, by combining infinite Paxos instances sequentially, for more information about this topic please refer to 《Paxos Made Easy》), to build a consensus, high-available and fault-tolerant distributed infinitely-long instruction tape. Several state machine executors with the same initial state execute the instructions on the tape sequentially. If each instruction on the tape is mathematically deterministic, then it is certain that when all state machine executors finish the execution of a same instruction at the same position on the tape, the internal state of all state machines must be the same. This model is called the "replicated state machines model". The essence of the replication state machine is to obtain a consensus, high-available and fault-tolerant distributed state machine through a consensus, high-available and fault-tolerant distributed infinitely-long instruction tape.

Definition of the mathematical deterministic function 𝘮𝘥𝑓:

Mathematical deterministic function 𝘮𝘥𝑓In any case, as long as the mdf function is given a certain input state S1, then it must be able to uniquely and definitely map to a certain output state S2

  S2 = mdf(S1)

In this way, we can get the mdf-version definition of the RSM

 Multiple state machine executors with the same initial state execute the instructions on the distributed instruction tape sequentially. The instruction on the tape is a kind of mdf function. Due to the characteristics of the mdf function, then it is certain that when all state machine executors finished the executing of the same 𝘮𝘥𝑓 function at the same position on the tape, the internal states of all these state machines are necessarily the same. This model is called the "replicated state machine" model.

mdf-RSM-diagram

1.2 Common Computer Running Model

state_registers_of_common_computer

state{cpu regsters' state, memory state, i/o devices' state}

In a sense, the running of the common computer can be viewed as a continuous sequence of the state transition.

To simplify the model, we will only discuss the computer has single-cycle-instruction architecture below. Of course, these conclusions can be easily extended to more general computers.

 Suppose that the state of the running computer at the beginning of the clock cycle t1 is denoted as S1, and the state of the beginning of the clock cycle t2 is denoted as S2. It is known that t2 > t1, then there must exist a function f of state mapping, which satisfies

S2 = f(S1)

 Then we can regard the running of the computer as the sequential execution of one after another such f-functions (that is, the f-function is equivalent to the CPU instruction to execute in the current clock cycle).

f-common-computer-diagram

1.3 TuringCell Theorem

In summary, we could get

TuringCell TheoremFor an running common computer 𝒞, its 𝑓 function is 𝑓n, where n is any non-negative integer. There also exists a replicated state machine running model ℛ, and its 𝘮𝘥𝑓 function is 𝘮𝘥𝑓n, where n is any non-negative integer. If the proposition "𝑓n and 𝘮𝘥𝑓n are all equivalent" is true, then we could call 𝒞 and ℛ is equivalent from the perspective of state transition.

For a common computer, all CPU instructions can be regarded as a mdf function. The only thing that introduces "random" into the state of the system is the I/O device -- the fundamental reason of that is the nature of I/O is the interaction between the system and the outside world. In order to mdf-functionalize all these I/O devices, we can move all the parts that could generate "random" data out of the RSM itself -- that is, by introducing the I/O mechanism via the definition of special I/O instructions in the RSM, thus the RSM can interact with the outside world from the perspective of mdf function.

Then, we could get

The equivalence of state transition between 𝒞 and ℛ stated in TuringCell theorem do exist and could also be implemented

2 The Design and Implementation of TuringCell Computer

turingcell_computer_architecture

The picture above is a diagram of the overall architecture of the TuringCell computer.

In fact, there are many choices for the implementation of the distributed consensus algorithm underneath the RSM. For example, in order to pursue the ultimate performance, you can choose to customize special-purpose hardware to achieve that goal, or even choose to use synchronous communication link to ensure the performance of communication. But in order to let the implementation be more general and low-cost, we choose to use the distributed Paxos/Raft group composed of common computers as the underlying consensus algorithm implementation of RSM.

Read this article《How Does TuringCell Computer Run》to get more details about the running of TuringCell computer.

TuringCell-Computer-Architecture-v0.1-and-v1.0

TuringCell computer v0.1 is for the smallest prototype verification, and v1.0 version is the 1st stable version which could be used in the production environment.

2.1 Choice of CPU

x86 or x64
    Cost of Implementation: 
        Large amount of engineering cost, simply all physical work. Because the decoder of CISC CPU is more 
        complicated, the interpretation performance of CISC-CPU will have certain disadvantages compared to 
        RISC-CPU
    Ecological prosperity of compiler toolchain: 
        Excellent
    Maintainability: 
        Poor
ARM
    Cost of Implementation: 
        RSIC has rather low implementation cost, simple decoding, and instructions are more concise than CISC
    Ecological prosperity of compiler toolchain: 
        Good
    Maintainability: 
        Good
MIPS
    Cost of Implementation: 
        RSIC has rather low implementation cost, simple decoding, and instructions are more concise than CISC; 
        MIPS is even simpler than ARM
    Ecological prosperity of compiler toolchain: 
        Not so good compared to ARM
    Maintainability:
        Good

Hardware digital circuits like AISC/FPGA is very good at CISC instruction decoding because they are just simple 
parallelable combinatorial logic circuits whereas the common instruction-executing-serially CPU is not so good 
at this kind of task. In contrast with CISC, RISC instructions decoding is much simpler and straightforward, 
thus the performance penalty of instruction decoding would be much smaller.

So, it is finally decided that the v0.1 version would choose to emulate the ARMv4 architecture and further the 
ARMv5 architecture in the 1st stable release version, i.e. v1.0.

In addition, the TuringCell computer also supports addons for extending other kinds of CPU implementations.

2.2 Core I/O Devices

Timer: It is essential. For example, it is often necessary when we want to implement time sharing and preemption between multi-tasks in the operating system.

UART: It is probably the simplest general-purpose I/O device that can interact with the outside world. For example, it can be used as a console, or simply as a general data link for communication with other external systems.

Disk: A very common persistent storage block devices.

Interrupt controller: Listens for the event status of all I/O devices, and notifies the CPU of the information interested by the CPU in the form of events, which provides the CPU with another option besides polling the status register of I/O devices.

In addition, the TuringCell computer also supports addons for extending other kinds of I/O devices implementations.

2.3 Distributed Infinitely-Long Instruction Tape

As the prototype verification version, V0.1 chooses to use the etcd underneath the RSM temporarily.

As the first stable release version, v1.0 will be implemented with a TuringCell Paxos group which developed by the TuringCell community itself, including but not limited to the following features:

  1. Egalitarianism
  2. Multi-Master
  3. Out-of-order and parallel chosen
  4. Dynamic election and with a variable election weight of each master
  5. Membership changement
  6. Single-RTT chosen for ordinary operations
  7. Optimizations for the complex WAN environment
  8. This service can be a basic component in any other project, and the goal is to become one of the best choices in the industry's open source implementations of distributed consensus algorithm
  9. Any other cool ideas is welcome

2.4 More Detailed Design

For more detailed design of TuringCell computer please refer to this document.

3 Community, Support and Cooperation

Welcome to join the TuringCell community!

Your participation, support and feedback are essential to this open source project! The exchange and collision of ideas, openness and inclusiveness, equality and freedom are always the charms of open source! Become a member of the TuringCell community, let us build the next exciting distributed opensource project together!

You can choose to join the mailing list, Wechat group, apply to be a member of TuringCell GitHub organization, share and spread, ask questions, star/watch/follow, donate, etc. to support this project.

In addition, any form of cooperation is very welcome. Please contact me.

4 Donate

Thank you very much for your generous donation!

5 History of Versions

v0.01 2017.5
    Sen Han <[email protected]>

v0.2  2017.12-2018.2 
    Sen Han <[email protected]>

v0.9   2020.5 
    Sen Han <[email protected]>

6 Copyright and License

Author: Sen Han <[email protected]>

Website: https://turingcell.org/

License: This article is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License, except the picture of the TuringCell Logo which is under the Creative Commons Attribution-NoDerivatives 4.0 International License.

TuringCell's Projects

logdb icon logdb

A Customize Logdb Implementation for Veela

paxos-made-easy icon paxos-made-easy

Paxos Made Easy: The Geometric Meaning and Geometric Proof of Paxos Algorithm

veela icon veela

A Paxos Library (Work in Progress)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.