Comments (9)
@athal7 Thanks!
We've not seen this issue ourselves. I wonder if anyone else has. Most of our apps listen to multiple topics, but most (but not all) also specify the max_bytes
parameter.
You mention lowering it: was it set but set to the same as the default (1,000,000)?
from kaffe.
from kaffe.
Not at all, just clarifying.
from kaffe.
I'm seeing this happen again, even with the max bytes specification:
$ /opt/kafka/bin/kafka-run-class.sh kafka.admin.ConsumerGroupCommand --bootstrap-server [redacted] --new-consumer --group mygroup --describe | grep mytopic
mytopic 0 - 9782 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 4 - 9076 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 8 - 9078 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 1 - 9074 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 5 - 9076 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 9 - 9076 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 2 - 9076 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 6 9079 9079 0 'mygroup@somehost' 'mygroup@somehost'
mytopic 3 - 9075 - 'mygroup@somehost' 'mygroup@somehost'
mytopic 7 - 9075 - 'mygroup@somehost' 'mygroup@somehost'
from kaffe.
oddly it eventually processed, about 30 minutes after sending this
from kaffe.
@athal7 It would be interesting to try to distill this down to a simple test so we could pinpoint the problem: Kaffe, Brod, Kafka, app code ... would be great to get to the bottom of this!
I wonder if you could try to reproduce it in a unit test.
from kaffe.
@objectuser agreed that that would be useful, curious if you have any ideas on how to go about that? It seems like one of those problems to me that is dependent on timing and amount of data, but could be wrong.
from kaffe.
It does not seem easy!
My only thought is to run batches of known data to a topic and read from it, noting how long it takes to consume the full set. If we can at least reproduce it after X iterations, maybe we can start bisecting the code to see where it all happens.
Maybe each batch could be a set of random numbers (enough for 10 or 100 keys per partition). Those could be sent through and the consumer could collect the data until all data is accounted for.
What do you think?
from kaffe.
We haven't seen this recently, closing.
from kaffe.
Related Issues (20)
- Defining multiple handlers HOT 1
- worker_per_topic_partition with multiple topics HOT 1
- Examples not compatible with Elixir 1.10 or 1.11 HOT 2
- extract_der is giving error with SSL HOT 2
- Undefined function exponential_backoff HOT 10
- Offset doesn't get updated between runs and runs crash with OOM errors HOT 4
- async ack - lots of duplicate messages until I restart the application HOT 2
- Kaffe.Producer.produce_sync raises on timeout
- How to set kafka headers when publishing message HOT 1
- Invalid call to raise/reraise on brod/kpro error
- Add support for SCRAM mechanism in SASL authentication. HOT 1
- Module to help write ExUnit tests
- It's impossible to create 2 separate consumers for different topics
- Running mix with kaffe deps fails to download pc package from hex
- Wrong place for configuration
- Repeated rebalance cycle with kafka broker 2.3.0 HOT 16
- kaffe cannot recover from unreachable Kafka HOT 18
- Missing documentation HOT 8
- Connecting to a TLS-based Kafka instance under AWS MSK? HOT 18
- Receives notification when rebalance in progress/assignments revoked HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kaffe.