Giter VIP home page Giter VIP logo

Comments (7)

jordansissel avatar jordansissel commented on July 20, 2024

We used to have a watchdog that caught runaway filters in these situations.
We can bring it back - good idea :)

On Thursday, December 10, 2015, Pius [email protected] wrote:

We have users sending in occasional large messages of many Mbs, and if
they have not implemented logic in the pipeline to filter out these
messages, the grok filter will run and peg the CPU threads for a very long
time. It will be helpful to have a default (or configurable) timeout for
grok filters so that it will give up at some point, etc..


Reply to this email directly or view it on GitHub
#65.

from logstash-filter-grok.

jordansissel avatar jordansissel commented on July 20, 2024

Related: Joni (as of 2.5-ish years ago) supports being interrupted via Thread.interrupt.

from logstash-filter-grok.

jordansissel avatar jordansissel commented on July 20, 2024

Proposal for this:

  1. Configurable execution timeout. Default 0.5 seconds.
  2. When execution time expires, we interrupt grok via java Thread.interrupt().
  3. On interruption, we tag the event _grokexecutiontimeout and continue on to the next filter in the filter worker.
  4. On interruption, we log a warning indicating that Grok's execution expired and include the following context: the text input, the original pattern being tested, and the execution time.

This may get tricky given the new complexity of the combined filter-and-output worker units as well as having some possibility (we'll need to protect against) of race conditions of grok succeeding vs the timer expiring.

It would be more-ideal to use cpu-time instead of wall clock time to avoid external resource contention for cpu or scheduling causing us to mistakenly trigger an execution timeout, but thread-specific cpu usage is not easily available on any OS (as far as I can tell...)

from logstash-filter-grok.

purbon avatar purbon commented on July 20, 2024

I think your last comment will have a very nice improvement for grok, specially when problematic situation happen.

+1 for them.

from logstash-filter-grok.

wiibaa avatar wiibaa commented on July 20, 2024

@suyograo can we close this as duplicate of #82 ?

from logstash-filter-grok.

magicdude4eva avatar magicdude4eva commented on July 20, 2024

Perhaps related - I have no idea why a simple regex times out - #99

from logstash-filter-grok.

jsvd avatar jsvd commented on July 20, 2024

this was implemented in #88

from logstash-filter-grok.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.