Giter VIP home page Giter VIP logo

traas-stack / holoinsight Goto Github PK

View Code? Open in Web Editor NEW
301.0 14.0 62.0 15.76 MB

HoloInsight is a cloud-native observability platform with a special focus on real-time log analysis and AI integration.

License: Apache License 2.0

Shell 0.26% Java 91.02% Dockerfile 0.03% JavaScript 0.09% Python 0.09% TypeScript 8.17% Less 0.35%
alerting devops log-analysis observability ai-ops k8s prometheus traas logging metrics

holoinsight's People

Contributors

archerny avatar dependabot[bot] avatar jiwliu avatar jsy1001de avatar kongluoxing avatar masaimu avatar mooneylert avatar mryuanliu avatar sw1136562366 avatar wangsiyuan-code avatar xiangwanpeng avatar xzchaoo avatar zzhb101 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

holoinsight's Issues

Supports diff with github remote

Describe This Problem

Supports diff with github remote

Proposal

Add a helper script to check diff with github remote

Additional Context

-

Migrate base code

Describe This Problem

Migrate code from internal repositories to open source repositories.
Include the following modules:

  • base code
  • common

Proposal

-

Additional Context

-

Add sucessRate in statistic trace log

Describe This Problem

Add sucessRate in statistic trace log. fix scheduler corn

Proposal

Additional Context

Add any other context about the feature request here.

Adapt to more storage layers refactor

Refactor Request

Refactor codes to adapt to more metrics storages.

Description of the proposed solution

Refactor codes to adapt to more metrics storages.

Additional context

-

Distinguish different types of alert exception

Refactor Request

The alert procedure contains alert calculation and alert notification, the exception of alert notification caused by user configuration should be treated differently.

Description of the proposed solution

HoloinsightAlertIllegalArgumentException represents exception caused by user configuration, while HoloinsightAlertInternalException caused by unexpected process, needs to fix.

Additional context

Supports elect timezone

Describe This Problem

Can not parse the log into metric while missing timezone header of col

Proposal

Additional Context

Migrate gateway

Describe This Problem

Migrate code from internal repositories to open source repositories.
Include the following modules:

  • gateway

Proposal

Additional Context

Add promtheus config debug ability

Describe This Problem

Add promtheus config debug ability, make it easy to use.

Proposal

Add promtheus config debug ability

Additional Context

Add any other context about the feature request here.

Gateway switches to new MetricStorage API

Refactor Request

@jiwliu add an API layer of metric storage (#55 ). It is time for gateway to switches to new MetricStorage API.

Description of the proposed solution

Gateway module to switches to new MetricStorage API.

Additional context

-

Add front-end files

Describe This Problem

Currently holoinsight front-end code is not yet open source. We need to add front-end build result files to holoinsight repository.

Proposal

Add front-end files

Additional Context

No response

Query switches to new MetricStorage API

Refactor Request

@jiwliu add an API layer of metric storage(#55 ). It is time for query to switches to new MetricStorage API.

Description of the proposed solution

Query module to switches to new MetricStorage API.

Additional context

-

Reduce redundant try catch in alert

Refactor Request

There is redundant and frequent try catch in alert code, and the content of error log is too casual.

Description of the proposed solution

Alarms consist of three phrases: loading alarm configuration, alarm calculation, and alarm notification. The code should not just print a simple unstructured log line for an exception, but should define special exception types and then handle them uniformly.

Supports influxdb 1.x metric storage

Describe This Problem

Influxdb is a popular tsdb.
Supports influxdb 1.x metric storage

Proposal

Supports influxdb 1.x metric storage

Additional Context

No response

migrate meta

Describe This Problem

Migrate code from internal repositories to open source repositories.
Include the following modules:

  • meta

Proposal

Additional Context

Add deploy files for k8s

Describe This Problem

Add deploy files for k8s.
For example deployment yaml or Helm.

Proposal

Add deploy files for k8s.
For example deployment yaml or Helm.

Additional Context

No response

'Add to favorites' does not work

Describe this problem

Select a log monitoring, and then click 'Add to favorites'.
There is a success message, but also an error message.

image

To Reproduce

Select a log monitoring , click 'Add to favorites'.

Expected behavior

Collect successfully, and no error

Additional Information

remove Task related codes

Refactor Request

remove task related codes

Description of the proposed solution

Additional context

[BUG]: horizontal overreach problem

Describe this problem

log favorite have a horizontal overreach problem

Steps to reproduce

In the log monitoring function, you can save other users' monitoring rules beyond your authority. Then click delete in my favorites to delete the original user's monitoring rules. Simply walk through the numbers

Expected behavior

No response

Additional Information

No response

migrate query

Describe This Problem

Migrate code from internal repositories to open source repositories.
Include the following modules:

  • query

Proposal

Additional Context

migrate storage

Describe This Problem

Migrate code from internal repositories to open source repositories.
Include the following modules:

  • storage

Proposal

Additional Context

pql parser init problem

Describe this problem

pql parser not init correctly

To Reproduce

init the lexer with pql string

Expected behavior

Additional Information

Supports Ceresdb metric storage

Describe This Problem

CeresDB is a high-performance, distributed, schema-less, cloud native time-series database that can handle both time-series and analytics workloads.
Supports Ceresdb metric storage.

Proposal

Supports Ceresdb metric storage

Additional Context

No response

DDL management solution

Describe This Problem

Currently, we manage DDL manually in an internal document. This is easy to cause problems For example, missing DDL updates in some deployments.
We need a DDL management solution.

Proposal

Consider referring to https://flywaydb.org/

Additional Context

No response

Add QueryService impl

Describe This Problem

Holoinsight opensource version miss a QueryService impl. So when it runs, there is no data in the front page.

Proposal

Add a default QueryService impl

Additional Context

No response

Alarm task statistics

Refactor Request

The computation of the alarm task needs to be monitored and grouped by tenant and rule type.

Description of the proposed solution

Print statistics log with tenant and rule type.

Adapt to more storage layers

Describe This Problem

io.holoinsight.server.home.web.controller.InitFacadeImpl now has a strong dependence on internal version CeresdDB.
The metric storage should be replaceable.

Proposal

Refactor initialization process to adapt to more storage layers.

Additional Context

-

Standalone mode cannot query data between different modules.

Describe this problem

io.holoinsight.server.home.common.service.QueryClientService#queryData fails to invoke in standalone mode.

Steps to reproduce

Startup query by TestHoloinsightQueryBootstrap.java, then startup home by TestHoloinsightHomeBootstrap.java, the invoke in io.holoinsight.server.home.common.service.QueryClientService#queryData will throw exception io.grpc.StatusRuntimeException: UNAVAILABLE: Channel shutdown invoked

Expected behavior

No response

Additional Information

No response

migrate home

Describe This Problem

Migrate code from internal repositories to open source repositories.
Include the following modules:

  • home

Proposal

Additional Context

metric name auto-completion does not work

Describe this problem

I have created a log monitoring config.
image

When I config a dashboard, the metric name auto-completion does not work.
image

To Reproduce

See Description

Expected behavior

metric name auto-completion works

Additional Information

Remove irrelevant task

Refactor Request

server/home/home-task/src/main/java/io/holoinsight/server/home/task/TenantAppMetaSyncTask.java
server/home/home-task/src/main/java/io/holoinsight/server/home/task/CleanAlertDetailTask.java
These tasks have no relation to the function of holoinsight, it's better to delete them.

Description of the proposed solution

Remove related codes.

Additional context

Check config distribution

Describe This Problem

A collect config need to be built and distributed to related agents.
We need a feature to check the status of config distribution.

For example:

ConfigName=xxxx_123
TotalDims=10
BuildSuccessDims: 10
DistributedLatest: 8
DistributedOld: 1
Unknown: 1

Unknown means there may be some error (eg. network error, agent version too old) between server-side and agent-side.

Proposal

Add an api to check the status of config distribution.

Additional Context

-

Migrate registry

Describe This Problem

Migrate code from internal repositories to open source repositories.
Include the following modules:

  • registry

Proposal

Additional Context

LogConfig supports dryrun mode

Describe This Problem

LogConfig supports dryrun mode (debug ability), make it easier to use

Proposal

-

Additional Context

-

Avoid storage tenant inflation

Describe This Problem

As new users are added, a large number of logical tenants are created,
To isolate user data, logical and physical tenants are 1:1

Proposal

Avoid storage tenant inflation, the new tenant use the default storage space

Accurate alarm task scheduling

Describe This Problem

Currently, the alarm task is scheduled at a fixed rate, but the trigger time is random, which may lead to the delay of the alarm notification or the inconsistent execution time of the alarm task on different servers.

Proposal

Alarm scheduling should use cron trigger.

Additional Context

No response

statistic trace

Describe This Problem

Trace statistics are used for self-monitoring

Proposal

Maybe you have considered some ideas or solutions about this feature.

Additional Context

Add any other context about the feature request here.

Mock authentication

Describe This Problem

Currently, front pages depend on backend authentication webapi.
But authentication webapi is missing in opensource version now.
There is an urgent need for a mock version so that the entire system can run.

Proposal

Add a mocked authentication webapi

Additional Context

No response

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.