Giter VIP home page Giter VIP logo

carbonapi's People

Contributors

arodland avatar avereha avatar azhiltsov avatar borovskyav avatar carrieedwards avatar cashlo avatar civil avatar cldellow avatar ctrlok avatar deniszh avatar dependabot[bot] avatar dgryski avatar dieterbe avatar faceair avatar felixoid avatar fionaliao avatar gksinghjsr avatar ibuclaw avatar jaderdias avatar jonasbleyl avatar kamaev avatar leizor avatar lexx-bright avatar lomik avatar mchrome avatar misiek08 avatar msaf1980 avatar nnuss avatar npazosmendez avatar szibis avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

carbonapi's Issues

Process query targets in parallel

At the moment, each metric for a given target is fetched in parallel, but the targets themselves are processed serially. For queries with a large number of targets, this can can unnecessary slow down.

function mostDeviant

The function mostDeviant is missing/not implemented. The implementation of this function in the original graphite seems to be very slow.

carbonapi handles missing metrics differently to graphite

If you specify a missing metric as part of a larger formula, the query will fail in carbonapi but succeed in Graphite.

A simple example of this is:

data.metric1  # Exists
data.metric2  # Does not exists

diffSeries(data.metric1, data.metric2) # Fails on carbonapi
diffSeries(data.metric1, transformNull(data.metric2, 0)) # Also fails on carbonapi

I would expect the result to be data.metric1 since I'd assume that data.metric1 - [undefined] would equal data.metric1.

A valid use case in this situation is as followed:

# Metric layout
page.[page_name].requests
page.[page_name].errors

# The number of total successful pages served (number of requests - number of errors)
# If we automate this in any form we may hit a page that has never had an error recorded.
diffSeries(page.index_html.requests, page.index_html.errors) 

/info?target= not working

If I send a request like this for a valid metric:
/info?target=metric_name
I always get the response:
empty target

If I sent the request like this:
/info/?target=metric_name
It's working.

jsonp kills cache

Many graphing libraries use autogenerated jsonp callback names. This breaking caching heavily, since every request URI is now different.

"summarize" last element mismatch

When aggregating in a fixed time range with from/until, e.g., xxxxxx0 to xxxxxx84, aggregate by 5 seconds, there is sometimes an extra or a missing value, the array length is variable by 1.

Support rawData in render URLs

Apparently, in a render url, the rawData flag, sometimes written as rawData=1 isn't supported. We should map it onto format=raw.

" is a valid char in metric names

Figure out the full set. Figure out which subset we want to support. Hope we don't have to fully break expression parsing in the meantime.

Don't cache failed requests

On carbonzipper restart, we will have a number of failed requests (~1 / cached idle connection). We don't want to add the results of these queries to our request or find cache.

Implement Kolmogorov–Smirnov 2 sample test

( Wikipedia description: Kolmogorov–Smirnov Test )

This would allow us to test, for example, a metric against itself in the past to see if the distribution has changed. Or if two metrics have similar distributions.

Naming might be among the hardest to implement.
My votes are:

  • kolmogorovSmirnovTest2( seriesList, seriesList )
  • ksTest2 (as the commonly used alias)

(I dislike the camelCasing standard here - but it maintains consistency)

"2" here because the 1-sample version, which essentially compares against a reference (usually normal) distribution, could also be implemented .. but there seem to be better tests for that.

Cache more aggressively

Graphite munges timestamps to minutely resolution to increase the chance of a cache hit. It also stores the data returned by the stores so future queries on the same data can also be pulled from the cache.

Decide which of these we want to implement.

License

What license is this and carbonzipper under?

Switch to https://github.com/gonum/plot

In addition to code.google.com shutting down, plotinum has moved to being part of the gonum project.

This move also comes with a substantial number of breaking API changes.

Improve expression parsing error messages

It turns out people have very poor graphite queries that fail to parse. Figuring out where the missing comma needs to go is tedious at the moment. Make this easier.

add multiplySeries()

Or is there a way in there right now to multiply two series together? Can't seem to figure out a good way in grafana with the current api, I think multiplySeries() would do the job.

Lots of request errors on carbonzipper restart

Because we maintain lots of cached http connections to carbonzipper, when that service is restarted we have a large number of (now invalid) sockets which cause failures during subsequent requests.

Work around is to restart carbonapi when carbonzipper is upgraded.

tukeyAbove(seriesList, interval, basis, n) : interval is unused

[This issue serves as a reminder / todo of interval's inteneded use.]

interval was conceived as a mechanism where the quantile threshold can be computed on a (leading IIRC) subset to enable this funciton to more efficiently process large seriesList over large intervals.

@dgryski and I discussed that we could move it to the end as an optional parameter.

Per request rate limiting to carbonzipper

This will throttle bulk-style requests, but not adversely affect smaller API calls from proceding.

Requires a bit more thinking to figure out if this is a problem and if this is the best solution.

fix /info endpoint

Note that the breakage might actually be in carbonzipper or carbonserver.

Cache FetchResponses to speed up subsequent requests

The graphite front-end stores responses in memcache, meaning that subsequent requests for the same metric are much faster and don't hit the stores. This is important when lots of people are looking at the same graph or fetching the same metric.

An in-memory version is fine for starters, but we should move to externalize it so it can be shared across carbonapi instances.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.