Giter VIP home page Giter VIP logo

Comments (6)

zuston avatar zuston commented on April 28, 2024

And how to upgrade coordinator partially to ensure gracefully recommission/decommission coordinator? Do we have some best practice?

from firestorm.

jerqi avatar jerqi commented on April 28, 2024

It looks like all coordinators will serve for all spark jobs instead of the only one serve for online(like master/slave). And the spark job will pick one coordinator to get assignment according to quorm sequence.

And the coordinator is only responsible for partition assignment.

If we have two coordinator serve for online, it means when one coordinator dead, the RSS still works. And when all coordinators dead, the existing shuffleIDs still work, and the new-coming shuffle request will fail.

If i am wrong ,feel free to let me know. Anyone could help provide more detailed architecture design info?

Your understanding is right.

from firestorm.

jerqi avatar jerqi commented on April 28, 2024

And how to upgrade coordinator partially to ensure gracefully recommission/decommission coordinator? Do we have some best practice?

We can upgrade coordinator one by one. Only one is active, the cluster can provide normal service. And our internal team are working on a k8s operator. The work will be submitted to this repo soon. We're testing the operator.

from firestorm.

zuston avatar zuston commented on April 28, 2024

If using local disk of shuffle servers to store shuffle data, shuffle server can be deployed to K8s? Or just to deploy coordinators on K8s using operator?

And do we have any plan to control specified spark jobs to communicate with specified coordinator(the one have been upgraded) instead of according to quorm sequence defined in config?

from firestorm.

jerqi avatar jerqi commented on April 28, 2024

1.We will deploy shuffle servers which use local disks on K8S.
2.No such plan.

from firestorm.

zuston avatar zuston commented on April 28, 2024

Close it. Thanks for your explanation. @jerqi

from firestorm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.