Comments (16)
@dtykocki We tried with a Kafka 2.3.1 broker and the rebalance cycle didn't occur. Thank you for checking!
We will upgrade our brokers soon then :) I think this issue can be closed as the error is caused by the mentioned kafka bug.
from kaffe.
@objectuser
Unfortunately it was just coincidence yesterday, when I changed the config it continued to consume the topic but later it stopped again.
from kaffe.
@nemzsom I wonder if there's an incompatibility at a lower level. Can you try updating the brod
dependency and see if that helps?
from kaffe.
@objectuser I tried out with the latest release from brod 3.8.1
but the issue still persist.
from kaffe.
Very interesting! We're using Kafka 2.0.1 hosted by Heroku and have not experienced those issues.
Are you self hosting?
from kaffe.
Do you have any other clusters? Wondering if you can see if it happens in new/isolated environments. Maybe create a cluster in Docker, etc.?
from kaffe.
Yes, we are self hosting and we encountered this issue after we upgrade to 2.3.0 from 1.0.1.
And it is reproducible with a single broker running in docker, no cluster needed.
from kaffe.
I've been able to reproduce this issue with Kafka 2.3.0, though it doesn't happen on the version we're using, 2.0.1. So that narrows the issue down a bit.
We'll investigate further.
from kaffe.
@nemzsom Looking at the changes in 2.3.0, I am thinking that the new "cooperative" rebalancing strategy might have something to do with this.
Would you be able to try setting connect.protocol=eager
in your Kafka Connect config and see if it improves the rebalancing issue?
This might not be a long term fix, as that protocol is said to be deprecated. However, it might be a stop gap to address the issue until we can update Kaffe to handle that protocol.
EDIT: That looks like it's just for Kafka Connect, so maybe not relevant. 🤔
from kaffe.
Thank you @objectuser for looking into this!
I’m on vacation right now but I’ll try your suggestion when I’m back.
(Although I don’t know where to put the connect config so maybe it is only for kafka connect indeed)
from kaffe.
Colleague of @nemzsom here :)
Unfortunately the connect.protocol=eager
seems not relevant in this case as we are not using kafka connect.
I found a somewhat relevant question on stackoverflow:
https://stackoverflow.com/questions/38394662/error-unknown-member-id-occurred-while-committing-offsets-for-group-xxx
Using the suggestion here I set the max_bytes to 500_000 and the lag that we had was consumed immediately. I wonder if this will last, but I'm really in the dark here :D .
from kaffe.
@CrWEE Interesting. We have a mix of settings for that, anywhere between 100k and 1M I think.
But are you saying that with setting max bytes, the consumers are no longer going through all the rebalacing reported above?
from kaffe.
OK, it's been an "off and on" thing for me as well.
from kaffe.
I've been looking into this one a bit and am able to occasionally reproduce the rebalance on 2.3.0 as well. It does look like the ruby-kafka
gem is also having rebalancing issues as a result of a regression on 2.3.0. See zendesk/ruby-kafka#769 (comment) and https://issues.apache.org/jira/browse/KAFKA-8653.
As time permits, I'll continue to do some research into this but I think we'll know more when 2.3.1 is released.
from kaffe.
I'm running a 2.3.1 broker locally and a sample kaffe apps I built no longer repeatedly rebalances.
@CrWEE / @nemzsom - Could you see if the repeated rebalance issue still exists on Kafka 2.3.1?
from kaffe.
Thanks @nemzsom!
from kaffe.
Related Issues (20)
- Defining multiple handlers HOT 1
- worker_per_topic_partition with multiple topics HOT 1
- Examples not compatible with Elixir 1.10 or 1.11 HOT 2
- extract_der is giving error with SSL HOT 2
- Undefined function exponential_backoff HOT 10
- Offset doesn't get updated between runs and runs crash with OOM errors HOT 4
- async ack - lots of duplicate messages until I restart the application HOT 2
- Kaffe.Producer.produce_sync raises on timeout
- How to set kafka headers when publishing message HOT 1
- Invalid call to raise/reraise on brod/kpro error
- Add support for SCRAM mechanism in SASL authentication. HOT 1
- Module to help write ExUnit tests
- It's impossible to create 2 separate consumers for different topics
- Running mix with kaffe deps fails to download pc package from hex
- Wrong place for configuration
- kaffe cannot recover from unreachable Kafka HOT 18
- Missing documentation HOT 8
- Connecting to a TLS-based Kafka instance under AWS MSK? HOT 18
- Receives notification when rebalance in progress/assignments revoked HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kaffe.