Giter VIP home page Giter VIP logo

Comments (8)

hiboyang avatar hiboyang commented on September 3, 2024

There is max bytes limit in shuffle server to protect the server, see https://github.com/uber/RemoteShuffleService/blob/master/src/main/java/com/uber/rss/execution/ShuffleExecutor.java#L81

You could change that value if your shuffle data exceeds that limit.

from remoteshuffleservice.

Lobo2008 avatar Lobo2008 commented on September 3, 2024

Thanks,I'll try it

from remoteshuffleservice.

mayurdb avatar mayurdb commented on September 3, 2024

Hi, @Lobo2008 Let us know as Bo mentioned, if the max app shuffle data size per server is the issue or not. You should see a RssTooMuchDataException in the stack trace.

If that's not the issue, please check

  • are you using the latest master
  • what's task time of the failing task and shuffle data written

from remoteshuffleservice.

Lobo2008 avatar Lobo2008 commented on September 3, 2024

Hi @mayurdb

  • It's the latest version. I cloned and compiled the master branch in April 2022.
  • no RssTooMuchDataException ever happened, just RssNetworkException
  • I have re-run the app without change the size as Bo mentioned ( i'll try it later) and so far it runs well. I'll post the detail if the application finished or failed
  • Wonder if the DEFAULT_APP_MAX_WRITE_BYTES=3TB is one stage shuffle size limitation or the accumulative size of all the shuffle write(?) stages for one application ? Stage-6 has 3TB but still works fine.

image

from remoteshuffleservice.

cpd85 avatar cpd85 commented on September 3, 2024

I think that DEFAULT_APP_MAX_WRITE_BYTES is actually per server, so if you write 3TB of data but evenly distribute it to multiple servers you would not run into the issue

from remoteshuffleservice.

Lobo2008 avatar Lobo2008 commented on September 3, 2024

I think that DEFAULT_APP_MAX_WRITE_BYTES is actually per server, so if you write 3TB of data but evenly distribute it to multiple servers you would not run into the issue

I guess so.

from remoteshuffleservice.

Lobo2008 avatar Lobo2008 commented on September 3, 2024

Hi @mayurdb

  • It's the latest version. I cloned and compiled the master branch in April 2022.
  • no RssTooMuchDataException ever happened, just RssNetworkException
  • I have re-run the app without change the size as Bo mentioned ( i'll try it later) and so far it runs well. I'll post the detail if the application finished or failed
  • Wonder if the DEFAULT_APP_MAX_WRITE_BYTES=3TB is one stage shuffle size limitation or the accumulative size of all the shuffle write(?) stages for one application ? Stage-6 has 3TB but still works fine.

image

Finished successfully. But I found that the exception hit exception writing heading bytes is caused by one or some of RSS running out of disk storage space.

from remoteshuffleservice.

hiboyang avatar hiboyang commented on September 3, 2024

Cool, glad you found the cause, and thanks for the update!

from remoteshuffleservice.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.