Comments (20)
profile:
mo_profile0329.tar.gz
from matrixone.
第三次测试:(打开gc日志)
mo sysbench bulk_insert 1000threads 测试运行大概到380s时被oom kill,执行tps结果和第一次基本差不多
运行到380s时内存情况:
mo log:
mo_log2.tar.gz
profile:
mo_profile_bulk.tar.gz
from matrixone.
last gc log:
gc 58 @523.221s 2%: 0.25+4656+0.31 ms clock, 16+235/74447/169465+20 ms cpu, 232580->236203->147538 MB, 247313 MB goal, 18 MB stacks, 0 MB globals, 64 P
from matrixone.
处理其他 s-1
from matrixone.
处理其他 s-1
from matrixone.
没有进展,在做其他issue
from matrixone.
与 tn 相关占用巨大内存的地方有两个:
- dequeuePreparing -> MutIndex 建立内存索引
- HandleRowsInsert
初步看来主要原因是由于 flush 不及时导致,没有及时清理内存中的数据。
增快 flush 频率,可以避免 OOM
from matrixone.
在做其他 issue
from matrixone.
##15580 merge 后,oom 应该可以避免 @heni02
后面会进一步减少 tn 的内存消耗
from matrixone.
自测中
from matrixone.
SQL statistics:
queries performed:
read: 0
write: 28595
other: 0
total: 28595
transactions: 917734260 (981824.22 per sec.)
queries: 28595 (30.59 per sec.)
ignored errors: 0 (0.00 per sec.)
reconnects: 0 (0.00 per sec.)
General statistics:
total time: 934.7219s
total number of events: 917734260
Latency (ms):
min: 0.00
avg: 1.00
max: 55095.98
95th percentile: 0.00
sum: 913347614.78
Threads fairness:
events (avg/stddev): 917734.2600/34714.55
execution time (avg/stddev): 913.3476/9.71
自测 15分钟已经没有再出现 OOM.
from matrixone.
今晚回归测试验证
from matrixone.
main commit:2eeef9246a17f1dfe7265f911976caa68626defc
job:https://github.com/matrixorigin/mo-nightly-regression/actions/runs/9019462365/job/24803638728
昨晚回归测试该流程时段内存基本占满了,但没有oom
三分钟后内存释放完
三分钟期间,连接mo失败报错
mo log and profile plz contact me
from matrixone.
在看其他issue
from matrixone.
处理其他 issue
from matrixone.
处理其他 issue
from matrixone.
@w-zr please continue to investigate this issue
from matrixone.
分布式环境也存在这个问题:
job url:https://github.com/matrixorigin/mo-nightly-regression/actions/runs/9211125528/job/25354293997
commit:e4c504a606585cb373a5bb7c452e804469716610
from matrixone.
date5.26回归,bulk_insert导致dn oom
job: https://github.com/matrixorigin/mo-nightly-regression/actions/runs/9244059586/job/25434572970
https://grafana.ci.matrixorigin.cn/d/cluster-detail-namespaced/cluster-detail-namespaced?orgId=1&var-namespace=mo-nightly-regression-20240526&var-account=All&var-interval=$__auto_interval_interval&var-cluster=.%2A&var-loki=loki&from=1716734826666&to=1716775847600
2次oom时间
from matrixone.
两次OOM前的内存情况。
from matrixone.
Related Issues (20)
- [Bug]: rolling-update CN behind proxy cause prepared stmt lost HOT 1
- [Bug]: force flush failed. HOT 3
- [Bug]: snapshot bvt may panic HOT 3
- [Bug]: restore account HOT 3
- [Bug]: restore account if exists fk table ref other databasee will report "can not drop database. It has been referenced by foreign keys" HOT 1
- [Tech Request]: rename restrict operator to filter
- [Tech Request]: optmize duplicate check memory usage for sql like insert into t1 selct from t2
- [Bug]: stock_level txn timeout in 60s in tpcc 10-10 longrunning during statbility test on distributed HOT 2
- [Bug]: [tke regression] tpcc 100-1000 test oom. HOT 6
- [Bug]: CI/multi cn e2e run bvt `load data LOCAL infile` got FileNotFoundException HOT 4
- [Bug]: UT TestKill got error `panic BUG: StartStatement called twice`
- [Tech Request]: support read from cache without copy HOT 1
- [Tech Request]: add txn id in log
- [Bug]: restore fk table report can not find table by id in ci HOT 1
- [Bug]: Optimizer doesn't support ProjectList in SEMI Join
- [Bug]: restore single db multi table with fk report 'no such table'. HOT 1
- [Bug]: panic during merge when object is empty
- [Bug]: `context deadline exceeded` error occurs HOT 2
- [Bug]: New account used about 16GB storage HOT 2
- [Bug]: restore reported `w-w conflict` during race-enabled bvt test HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from matrixone.