go-graphite / carbonapi Goto Github PK

View Code? Open in Web Editor NEW

299.0 26.0 139.0 44.99 MB

Implementation of graphite API (graphite-web) in golang

License: Other

Go 98.90% Makefile 0.08% Shell 0.98% Dockerfile 0.05%

go graphite carbon carbonapi carbonzipper zipper timeseries monitoring prometheus graphite-web

carbonapi's People

Contributors

Stargazers

Watchers

Forkers

oriordan cashlo grobian ikruglov atomicstack avar alexandrequinto avereha sergeyignatov godeep reyjrar nnuss markocelan vision-sbm lyft iftekhar25 szibis astral1 dieterbe kerrick-lyft jpoliv gmarkey korservick kanatohodets salekseev jaderdias sathyanarayanant 40a muhazzz devopsbox anderender errx gothrek22 optionalg ctrlok skbkontur ibuclaw borovskyav gksams gksinghjsr kamaev 2scale bnikolaus kipwoker alexeypetrenko nikita-b dmitry-makarov ihard rudw0lf manticorecao tingzhendu gunnihinn msaf1980 zombig bjorand dbzer0 kolobaev hakanf proffust lomik sylvain-beugin jeffsaildrone sonsergei doytsujin hjdr4 mevzosvlad o iliapolo gekmihesg nozomi1773 neszt chaosong misiek08 sinopower faceair hnakamur sourya pitan lexx-bright rodio peter-sh ennorion rtbhouse shasderias shedimon artemchekunov mrodikov alexandv github-vincent-miszczak gelezayka teads rajcspsg oranenj omrilotan ryangsteele tantra35 sourya-deepsource jai-deepsource ashemez ferranda

carbonapi's Issues

Fix diffSeries()'s multiple argument support

Process query targets in parallel

At the moment, each metric for a given target is fetched in parallel, but the targets themselves are processed serially. For queries with a large number of targets, this can can unnecessary slow down.

function mostDeviant

The function mostDeviant is missing/not implemented. The implementation of this function in the original graphite seems to be very slow.

carbonapi handles missing metrics differently to graphite

If you specify a missing metric as part of a larger formula, the query will fail in carbonapi but succeed in Graphite.

A simple example of this is:

data.metric1  # Exists
data.metric2  # Does not exists

diffSeries(data.metric1, data.metric2) # Fails on carbonapi
diffSeries(data.metric1, transformNull(data.metric2, 0)) # Also fails on carbonapi

I would expect the result to be data.metric1 since I'd assume that data.metric1 - [undefined] would equal data.metric1.

A valid use case in this situation is as followed:

# Metric layout
page.[page_name].requests
page.[page_name].errors

# The number of total successful pages served (number of requests - number of errors)
# If we automate this in any form we may hit a page that has never had an error recorded.
diffSeries(page.index_html.requests, page.index_html.errors)

/info?target= not working

If I send a request like this for a valid metric:
/info?target=metric_name
I always get the response:
empty target

If I sent the request like this:
/info/?target=metric_name
It's working.

jsonp kills cache

Many graphing libraries use autogenerated jsonp callback names. This breaking caching heavily, since every request URI is now different.

"summarize" last element mismatch

When aggregating in a fixed time range with from/until, e.g., xxxxxx0 to xxxxxx84, aggregate by 5 seconds, there is sometimes an extra or a missing value, the array length is variable by 1.

Support rawData in render URLs

Apparently, in a render url, the rawData flag, sometimes written as rawData=1 isn't supported. We should map it onto format=raw.

Add aliasByMetric

Support for Pearson Product Moment Correlation Coefficient (PPMCC)

As mentioned in FB Gorilla paper add support for sorting/filtering metrics against a test metric using PPMCC

Support PNG output

Work in progress on https://github.com/dgryski/carbonapi/tree/graph-support

Add stdev()

" is a valid char in metric names

Figure out the full set. Figure out which subset we want to support. Hope we don't have to fully break expression parsing in the meantime.

Don't cache failed requests

On carbonzipper restart, we will have a number of failed requests (~1 / cached idle connection). We don't want to add the results of these queries to our request or find cache.

Add Tukey outlier detection

https://github.com/dgryski/go-onlinestats/blob/master/tukey.go

Support foo.{bar,baz}.qux in names

Name parsing code is too limited

Handle the HTTP request method OPTIONS properly to allow all kinds of cross-domain requests

Sometimes the browser issues a HTTP request of the type OPTIONS when it's cross domain. I know a page that broke when using carbonapi because of this. It worked normally with the python version.

Implement Kolmogorov–Smirnov 2 sample test

( Wikipedia description: Kolmogorov–Smirnov Test )

This would allow us to test, for example, a metric against itself in the past to see if the distribution has changed. Or if two metrics have similar distributions.

Naming might be among the hardest to implement.
My votes are:

kolmogorovSmirnovTest2( seriesList, seriesList )
ksTest2 (as the commonly used alias)

(I dislike the camelCasing standard here - but it maintains consistency)

"2" here because the 1-sample version, which essentially compares against a reference (usually normal) distribution, could also be implemented .. but there seem to be better tests for that.

Clean up evalExpr()

It's getting unwieldy.

Support /metrics/find/

via @atomicstack

No CORS headers in response to OPTIONS (breaks preflighted AJAX requests)

The response to an OPTIONS request doesn't include the CORS headers that an preflight AJAX request expects[1] which makes the browser to refuse to perform the request.

[1] https://developer.mozilla.org/en-US/docs/Web/HTTP/Access_control_CORS#Preflighted_requests

add format=csv

Cache more aggressively

Graphite munges timestamps to minutely resolution to increase the chance of a cache hit. It also stores the data returned by the stores so future queries on the same data can also be pulled from the cache.

Decide which of these we want to implement.

Missing holtWinters* functions

There are people that would like to use with carbonapi the holtWinters functions present in the python version

License

What license is this and carbonzipper under?

movingMedian function

It would be nice to have it. Documentation here:
https://graphite.readthedocs.org/en/0.9.10/functions.html

Switch to https://github.com/gonum/plot

In addition to code.google.com shutting down, plotinum has moved to being part of the gonum project.

This move also comes with a substantial number of breaking API changes.

missing, or incomplete support for stacked()

example request: https://graphite-api.[redacted]/render?target=aliasByNode(stacked(sys.*[redacted]*.[redacted].connections)%2C%201)&from=1439439654&until=1439482974&format=json&maxDataPoints=640

Improve expression parsing error messages

It turns out people have very poor graphite queries that fail to parse. Figuring out where the missing comma needs to go is tedious at the moment. Make this easier.

add multiplySeries()

Or is there a way in there right now to multiply two series together? Can't seem to figure out a good way in grafana with the current api, I think multiplySeries() would do the job.

Support floating point constants with exponents

scale(foo.bar.baz, 1e-2)

Add minSeries()

http://graphite.readthedocs.org/en/latest/functions.html#graphite.render.functions.minSeries

Time parsing code may not match exactly graphite's behaviour

Need to figure out "only minutely resolution" for queries, <= vs <, etc.

Add groupByNode()

Lots of request errors on carbonzipper restart

Because we maintain lots of cached http connections to carbonzipper, when that service is restarted we have a large number of (now invalid) sockets which cause failures during subsequent requests.

Work around is to restart carbonapi when carbonzipper is upgraded.

Add hitcount()

http://graphite.readthedocs.org/en/latest/functions.html#graphite.render.functions.hitcount

tukeyAbove(seriesList, interval, basis, n) : interval is unused

[This issue serves as a reminder / todo of interval's inteneded use.]

interval was conceived as a mechanism where the quantile threshold can be computed on a (leading IIRC) subset to enable this funciton to more efficiently process large seriesList over large intervals.

@dgryski and I discussed that we could move it to the end as an optional parameter.