Giter VIP home page Giter VIP logo

Comments (9)

objectuser avatar objectuser commented on July 21, 2024

@athal7 Thanks!

We've not seen this issue ourselves. I wonder if anyone else has. Most of our apps listen to multiple topics, but most (but not all) also specify the max_bytes parameter.

You mention lowering it: was it set but set to the same as the default (1,000,000)?

from kaffe.

athal7 avatar athal7 commented on July 21, 2024

from kaffe.

objectuser avatar objectuser commented on July 21, 2024

Not at all, just clarifying.

from kaffe.

athal7 avatar athal7 commented on July 21, 2024

I'm seeing this happen again, even with the max bytes specification:

$ /opt/kafka/bin/kafka-run-class.sh kafka.admin.ConsumerGroupCommand --bootstrap-server [redacted] --new-consumer --group mygroup --describe | grep mytopic

mytopic           0          -               9782            -          'mygroup@somehost'                 'mygroup@somehost'
mytopic           4          -               9076            -          'mygroup@somehost'                 'mygroup@somehost'
mytopic           8          -               9078            -          'mygroup@somehost'                'mygroup@somehost'
mytopic           1          -               9074            -          'mygroup@somehost'                 'mygroup@somehost'
mytopic           5          -               9076            -          'mygroup@somehost'                 'mygroup@somehost'
mytopic           9          -               9076            -          'mygroup@somehost'                 'mygroup@somehost'
mytopic           2          -               9076            -          'mygroup@somehost'                 'mygroup@somehost'
mytopic           6          9079            9079            0          'mygroup@somehost'                 'mygroup@somehost'
mytopic           3          -               9075            -          'mygroup@somehost'                  'mygroup@somehost'
mytopic           7          -               9075            -          'mygroup@somehost'                  'mygroup@somehost'

from kaffe.

athal7 avatar athal7 commented on July 21, 2024

oddly it eventually processed, about 30 minutes after sending this

from kaffe.

objectuser avatar objectuser commented on July 21, 2024

@athal7 It would be interesting to try to distill this down to a simple test so we could pinpoint the problem: Kaffe, Brod, Kafka, app code ... would be great to get to the bottom of this!

I wonder if you could try to reproduce it in a unit test.

from kaffe.

athal7 avatar athal7 commented on July 21, 2024

@objectuser agreed that that would be useful, curious if you have any ideas on how to go about that? It seems like one of those problems to me that is dependent on timing and amount of data, but could be wrong.

from kaffe.

objectuser avatar objectuser commented on July 21, 2024

It does not seem easy!

My only thought is to run batches of known data to a topic and read from it, noting how long it takes to consume the full set. If we can at least reproduce it after X iterations, maybe we can start bisecting the code to see where it all happens.

Maybe each batch could be a set of random numbers (enough for 10 or 100 keys per partition). Those could be sent through and the consumer could collect the data until all data is accounted for.

What do you think?

from kaffe.

athal7 avatar athal7 commented on July 21, 2024

We haven't seen this recently, closing.

from kaffe.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.