Comments (7)
Do you have the solution of the problem?
from incubator-uniffle.
Yes, it is be testing in our production environment. I will watch it for a while. If it's OK, I will create a pr
from incubator-uniffle.
Could you share your solution? We can discuss first.
from incubator-uniffle.
Could you share your solution? We can discuss first.
- In server side, if
requireBufferId
not found when send data, thrown an exception. - In client side, if fail to send data, require buffer again.
from incubator-uniffle.
cc @colinmjj . There seems not be cases in our production environment. But I think the analysis is correct. What do you think?
from incubator-uniffle.
I think @xianjingfeng is right, with current implementation, OOM will happen if requireBufferId
was expired in Shuffle Server already, this maybe caused by GC, network problem, high workload in shuffle server etc.
It's better to have the limitation to accept the data with requireBufferId
only to avoid such problem.
from incubator-uniffle.
closed by #157
from incubator-uniffle.
Related Issues (20)
- [FEATURE] Show logs in Dashboard
- [FEATURE] Show IO/CPU/Disk usage in Dashboard
- [Bug] The StorageManager cache might not function effectively under heavy IO pressure
- [Bug] Occasionally encountering IllegalReferenceCountException when releasing ShuffleIndexResult
- [Bug] [Operator] ShuffleSever cannot be deleted even though there are no more application. HOT 4
- [Bug] ShuffleTaskInfo may leak when app is removed. HOT 1
- [FEATURE] Determine whether data can be written and read based on the actual disk IO situation
- [Bug] app localdisk folder remains when app is expired
- [Improvement] Use thread pool to control the concurrency of data reading threads when enabling Netty HOT 1
- [Improvement] Optimize CompositeByteBuf initialization for better performance when getting memory data HOT 1
- [FEATURE] Add rpc queued time and rpc process time. HOT 1
- [FEATURE] Add gauge metrics for reading data
- [Improvement] Set Netty as the default server type
- [Umbrella] Release 0.9.0 HOT 4
- Bump `master` to `0.10.0-SNAPSHOT`.
- [DOCS] Add licences and notices regarding new dashboard module HOT 4
- [Bug] Optimized FileSegmentManagedBuffer.nioByteBuffer to avoid multiple read file
- [Bug] Fix flaky tests
- [DOCS] Update the descriptions and default values of outdated configurations
- [Bug] Assertions will not take effect during production runtime
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from incubator-uniffle.