hastic-zzz / hastic-server Goto Github PK

Hastic data management server for analyzing patterns and anomalies from Grafana

License: GNU General Public License v3.0

JavaScript 2.66% TypeScript 94.46% Python 1.26% Dockerfile 0.53% Makefile 0.30% Shell 0.79%

alerting analytics anomaly-detection docker elasticsearch grafana graphite hastic-server influxdb metrics monitor monitoring monitoring-server monitoring-tool pattern-detection pattern-recognition prometheus self-hosted selfhosted timeseries

hastic-server's Issues

Smart automatic correlation searching

We want to send data source where we will find correlated metrics automatically. When just reports than metrics which analytic found.
Sounds futuristic , but lets make subtasks.

One panel - one worker

Based on https://github.com/hastic/hastic-server/issues/53#issuecomment-402556227

Missing pattern detection

Sometimes you want to detect when you don't have a pattern in your data and get notifications about it

Any thoughts?

For example, we need to detect patterns in for few metrics together like in merge sort.
Another example is: https://github.com/hastic/hastic-server/issues/44 where apply model to predict class.
So we need to keep in memory model for few pattern predictors.

Module for multiple metrics analysis
Alerting refactoring

node 6.14 support

We want to support old versions of Node.js
I think we better setup CI for this and update it in docker

Python3 / pip3 installation instructions

I am sorry to say that, but I need to use google to find instructions about how should I install python and pip.

I just want to start to use software. Is there a nice way to install Python/pip on my machine without googling ? :)

For example, with node.js you only need to link to: https://nodejs.org/en/download/package-manager/#debian-and-ubuntu-based-linux-distributions and that's it. Just one command.

@rozetko what do you think?

Learning on deletion

All of your models: drops, peaks, ... don`t give a shit about deletion.

requirements.txt for pip deps

https://pip.readthedocs.io/en/1.1/requirements.html

Persistence only on node.js

We want to make node.js part the service which responsible for data processing.
So python doesn't write to anywhere.

Documentation about entities naming

We have entities like:

anomaly
pattern
segment
and so on
It can be hard to understand them sometimes.
We should create doc with their interpretation.

Release size reduce

Current release-archive size is ~55MB
We should reduce it somehow

Error: type object 'sklearn.tree...'

With usage "General approach", user gets error from python
like follows:

all_anomalies.json gets rewrited after restart

Steps to reproduce:

create anomaly
find it in data/anomalies/all_anomalies.json
restart server
create new anomaly

Now there is only new anomaly in all_anomalies.json
It happens because loadAnomaliesMap() method doesn't get executed on anomaly insert

Filter features

It would be nice to have Kalman filter for example before we do analytics

Analytics / server messaging

We want to use zeromq for messaging between node.js / python

It will allow to separate processes and debug easier

ZeroMQ Node/Python worker refactoring
Docs to new installation
Remove redundant deps after refactoring
Return analytics status
Fix building for production

Messages from analytics instead of files

We update alerts.json to send "notifications" about new detected alerts. It is crazy.
I think it is not the only place.

Release docs

We need to add documentation on how to build release version

NeDB instead of files for analytics

We can't scale with files. Obviously.

Return version of server

We can face a problem where user has an older version of hastic-panel and we need to return version of hastic-server. Also it is necessary for bug reports.

So we need to

keep track of current version of server with is the same as release
return server version when get status (root) url

The first time we get this problem: https://github.com/hastic/hastic-server/pull/65

Learning on creation of pattern from beginning

If you make a new pattern with name "drop" and you already have "drop" in database, learning state would start

Support datasources other than InfluxDB

Currently only InfluxDB datasource is supported
We should find a way to support other Grafana's datasources

Module for messaging on node.js side

The module https://github.com/hastic/hastic-server/blob/8fce88d76a795656e73262704df735f625717978/server/src/services/analytics.ts

has both transport & logic in one code. Lets separate it.

Kill analytics spawned process on stop server process

https://nodejs.org/api/child_process.html#child_process_options_detached

I am not sure that we close it properly @rozetko
let's check it

Leave only node 6.14 build

Guess we don't need 2 types of build because node 6.14 build would work in any 6.14+ version.
Also, documentation with different types of build looks confusing.
@jonyrock what do you think?

Update dataset after every learning cycle

Currently, dataset doesn't get updated after first downloading
We should update it on every learning

Multiple metrics patterns

We now want to have pattern for multiple series.

We need to make

Algorithms with multiple series
Node Server part
UI

Docker file after 0.12-alpha release

@rozetko
I think our docker file is not actual. We use requirements.txt and least

Send debug info when errors occur

We should send stacktrace and some additional data somewhere on errors for easier debugging

Analytics / Server logs improvements

Imagine log which looks like this:

[ANALYTICS] something
[ANALYTICS] something more
[SERVER] oh oh
[SERVER] more logs
[ANALYTICS] learning

@rozetko

ImportError: cannot import name 'isna'

Steps to reproduce:

clone repo
go to analytics/
pip3 install -r requirements.txt
python3 server.py

You would see error:

Traceback (most recent call last):
  File "server.py", line 7, in <module>
    from worker import Worker
  File "/mnt/c/Users/rozetko/git/hastic-server/analytics/worker.py", line 2, in <module>
    from anomaly_model import AnomalyModel
  File "/mnt/c/Users/rozetko/git/hastic-server/analytics/anomaly_model.py", line 2, in <module>
    from data_provider import DataProvider
  File "/mnt/c/Users/rozetko/git/hastic-server/analytics/data_provider.py", line 1, in <module>
    import pandas as pd
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/__init__.py", line 42, in <module>
    from pandas.core.api import *
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/core/api.py", line 10, in <module>
    from pandas.core.groupby import Grouper
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/core/groupby/__init__.py", line 2, in <module>
    from pandas.core.groupby.groupby import (
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/core/groupby/groupby.py", line 42, in <module>
    from pandas.core.dtypes.missing import isna, isnull, notna, _maybe_fill
ImportError: cannot import name 'isna'

Push metric data from server to analytics

I believe that analytics should know about how extract data. It is server responsibility.
We need to send data for initial learning and then data about new data from datasource.

There are many benefits from this.

@rozetko lets stop querying grafana from python

deb package

I like how Grafana does it:

https://grafana.com/grafana/download?platform=linux

just installation from source and everything works

Fix known-issues

I think we should rethink this part: https://github.com/hastic/hastic-server#known-bugs--issues

Let's check what is actual and make issues here on github if we have bugs.

"No such file or directory" error on anomaly create

You get this error when you've just created anomaly and there are no segments yet

Use anomaly IDs

We still use anomaly names in some cases instead of IDs
It is really confusing
We should use only IDs

Task model on node.js side

we use any - like definitions for task, but we can do it more accurate

let task = {
    type: 'learn',
    anomaly_id: anomalyId,
    pattern,
    segments: segments
  };

Analytic task doesn't stop when you are deleting anomaly in panel

When server already started learning / prediction task and you delete anomaly - it doesn't know about this fact and continue doing the task => you have to wait for it to finish before it starts analysing another anomaly

Index transform bug

There is a bug when indices get transformed to time

In this screenshot all segments have the same length in indices, but the one on the left has a lot larger length

Case-sensitive anomaly name

Now anomaly name is case-sensitive
If you create anomaly with capital letter(s) in name - you get "Not found" or "Internal error" alert
We should change all names to lower-case

HASTIC_API_KEY to config file

We should get HASTIC_API_KEY from config if it exists. Otherwise, use environment variable.

Overlapping patterns classification

What changes should we do in UI and requirements from it
Algorithm implementation

@rozetko, @VargBurz just add some thoughts at least

Send task execution failure back to server

https://github.com/hastic/hastic-server/blob/0b857996d8e2e7844c3002698d53d32a92b884d5/analytics/server.py#L36

We need to wrap task result payload in res with metadata with taskId and result of execution.
so it's like:

{
"taskId": 123123,
"status": "ok",
"data": {
    ... payload data ...
  } 
}

Bundle babel-polyfill

It's a problem starting without webpack:

Peaks detector algorithm improvement

Make it work only after labeling (like drops)
Make it find peaks similar to labeled
Find / create test dataset
Write tests

hastic-zzz / hastic-server Goto Github PK

hastic-server's Issues

Recommend Projects

Recommend Topics

Recommend Org