Comments (1)
Hi @Lobo2008, how much memory usage do you see in RSS server? RSS server gets shuffle data in small blocks from Spark mapper, and write the block to disk. It will not cache large amount of data in memory. Thus curious how much memory you see RSS server uses.
Normally the bottleneck of RSS server will be disk io and network bandwidth since Spark applications write/read a lot of data from there. You could start with 10 to 50 spark executors mapping to one RSS server. Then observe disk/network metrics on RSS server, and adjust accordingly.
from remoteshuffleservice.
Related Issues (20)
- [Spark 3] RSS performance with Adaptive Skew Join Optimization HOT 3
- Corrupted block detected during decompression
- spark 3.0 HOT 4
- Using remote shuffle service with Spark operator HOT 2
- Shuffle Files Storage Is stored by default.Whether alluxio storage is supported and how to implement it. HOT 5
- write amplification HOT 2
- fault tolerance of restarting server HOT 7
- Does RSS support multiple StreamServers on the same node? HOT 4
- Metrics in ScheduledMetricCollector
- hit exception writing heading bytes XXXXX HOT 8
- How long the shuffle data of each ShuffleStage will be stored in RSS nodes? HOT 6
- Root directory not configurable via Helm chart
- Disk damage causes failure HOT 10
- Rss shuffle data size is much larger than external shuffle service HOT 6
- Can Rss have stage retry when one server is down? HOT 13
- what may cause RssInvalidServerVersionException? HOT 2
- Does zeus only support jdk 11 + HOT 2
- Does Rss support YARN executor preemption?
- Spark 3.1/3.2 failed sql skew and local reader tests HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from remoteshuffleservice.