Enterprise Integration Zone is brought to you in partnership with:

Mitch Pronschinske is a Senior Content Analyst at DZone. That means he writes and searches for the finest developer content in the land so that you don't have to. He often eats peanut butter and bananas, likes to make his own ringtones, enjoys card and board games, and is married to an underwear model. Mitch is a DZone Zone Leader and has posted 2573 posts at DZone. You can read more from them at their website. View Full User Profile

Data Models and Consumer Idioms Using Apache Kafka

03.13.2013
| 3310 views |
  • submit to reddit

Apache Kafka is a powerful publish subscribe messaging service that delivers high-volume messages across ad hoc topics to subscribers with message durability for offline consumers. As with any technology, Kafka comes with certain concessions, in this case, concessions that influence consumer behavior in particular at scale. Likewise, achieving optimal throughput when writing to durable storage can be optimized by distilling data storage operations into commutative, idempotent sets of operations.

This talk will detail Urban Airship's experience using Kafka to process billions of messages per day. The talk will begin with an in-depth look at Kafka's core design concepts and how they influence nuisances of writing consumers, both positive and negative. Beyond consumers, this talk will detail how Urban Airship leverages the strengths of different storage engines (Cassandra, HBase and in-house solutions) across a common consumer infrastructure for disparate goals including near real-time message routing, analytics and system measurement.