Comments (21)
上报的数据的timestamp字段也打印一下
另外就是看transfer、tsdb的日志
from nightingale.
timestamp 没有问题,直接使用time.Now().Uinx(),所以绝大多数请求正确。
transfer,无错误日志。
tsdb只有最近2个小时日志,虽然有报错,但是好像无关。
from nightingale.
这个错误有可能是原因,tsdb会把数据落盘存储,sync_disk就是在落盘,落盘失败,数据就断点了。你是本地虚拟机么?硬盘有什么特殊的么?
from nightingale.
是我的测试机器。但是这个报错并不影响所有的数据,看这个指表就是正常的。
from nightingale.
嗯,一个监控指标一个rrd文件,如果部分rrd文件有问题,其他的是不受影响的。从逻辑上来说,代码是一套,如果是代码的问题,有指标不正常应该全部不正常。但是这里只有部分不正常。
所以,坦白讲,我也没有好思路
from nightingale.
OK,没有问题,我抽时间研究下,如果有结果再和你沟通。
from nightingale.
from nightingale.
排查到一些线索
2020-04-10 15:31:24.393253 WARNING rpc/push.go:61 push obj error, obj: <Endpoint:host Metric:key, Tags:, TagsMap:map[], Value:0, TS:1586503740 2020-04-10 15:29:00 DsType:GAUGE, Step:60, Heartbeat:120, Min:U, Max:U>, error: data @1586502900, timestamp old than previous chunk. currentchunk t0: 1586503800
from nightingale.
服务端收到的数据是:先收到了一条新数据,又收到了一条老数据,但是监控数据是要求有时序的。所以报错。
这是自己推送的数据?时间获取的是否有问题?机器时间同步了么?
from nightingale.
是自己推送的数据,时间获取没有问题,机器时间也是同步的。
也就是要找到服务端收到数据无续的原因,就能解决问题了?
from nightingale.
我已经调整了推送代码,如果有进一步结论再同步。
from nightingale.
目前看,问题并没有得到解决。
from nightingale.
你是所有指标都有问题,还是只有部分有问题,有规律么
from nightingale.
哪一个指标有问题,需要去查所有链路上的组件,比如是插件上报的指标abc,就要从collector开始查,到transfer、tsdb,看abc这个指标到底上报对了么,是否在哪个环节出问题
from nightingale.
好的,我按你说的做个全量的review。目前看只是部分指标的问题。
from nightingale.
重启n9e-tsdb后,目前还没有复现过问题。
from nightingale.
你是自己搭建的虚机测试的?还是在正式生产环境的机器测试的?看现象不是软件的问题,像是环境的问题
from nightingale.
是正式环境,用的阿里云ESC
from nightingale.
这个issue先关了,后面如果还有问题,把各块日志都贴出来,重开一个issue再看。这个问题略诡异。
from nightingale.
我这里重启tsdb也不能解决这个问题
from nightingale.
我这里重启tsdb也不能解决这个问题
不行试试3.3.0版本,用M3DB作为存储引擎试试,rrdtool看起来在有些场景下有问题
from nightingale.
Related Issues (20)
- Can we add machine management inside the infrastructure? HOT 1
- n9e和categraf交互可以增加认证方式吗 HOT 1
- Can log analysis be performed using only Elasticsearch and Loki data sources?
- Can the "Alert Management"-"Historical Alerts" page be added with the option to export as an excel file attachment?
- Is there a chart for alerts similar to Zabbix? HOT 2
- The lines of the timing diagram cannot be displayed HOT 2
- es 查询 选择索引 跟默认索引下 可选字段不一致 HOT 1
- tracer support grafana tempo
- A newbie here, how do I monitor a Huawei S5720 switch that has SNMP V2 enabled? HOT 1
- notify the media and contact information that if the "identifier" is changed to the same as the default identifier, there will be a bug.
- mysql deadlock when update target table and many errorlog in edge and n9e log HOT 2
- The default 60-minute interval for repeated notifications is too long. HOT 1
- oauth2/oidc sso enhancement
- ADD MYSQL TYPE OF DATASOURCE HOT 1
- Time offset alarm parameter acquisition HOT 2
- The message field content will not automatically wrap. HOT 2
- The number of processes and threads is inconsistent with what top shows. HOT 2
- I hope nightingale can add a feature to draw quantile lines.
- Hello, I'm currently in the process of making a selection and I wanted to inquire about any upcoming plans to launch tracing. HOT 1
- 触发报警时,发送钉钉消息被艾特的人其实收不到提醒 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nightingale.