Giter VIP home page Giter VIP logo

Comments (1)

dasch avatar dasch commented on June 12, 2024

There are two basic approaches to getting more parallelism out of Kafka consumers:

  1. Add more partitions and group members. In theory, you can have as many consumers as you have partitions, and each will run in parallel.
  2. Use multiple threads in your consumers.

The second option is a bit dangerous, as you lose a bunch of guarantees that Kafka makes, such as ordering, but if you're okay with possibly reprocessing the same messages quite a few times when there's an error, this is a pretty simple approach:

class ParallelConsumer < Racecar::Consumer
  subscribes_to "work"

  def process_batch(batch)
    threads = batch.messages.map {|message|
      Thread.new { work(message) }
    }

    # Wait for all threads to finish before proceeding.
    threads.each(&:join)

    # Racecar will only checkpoint the consumer position when we return,
    # so if one of the threads fail, all will be retried.
  end
end

Note that this introduces concurrency, so you want to be careful and not share mutable state between the threads.

from racecar.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.