Loading…
Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 
Tuesday, May 10 • 3:00pm - 3:50pm
Using Kafka and Kudu for Fast, Low-latency SQL Analytics on Streaming Data - Mike Percy & Ashish Singh, Cloudera

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Kudu (incubating) is a fast new columnar data store for the Hadoop ecosystem designed to enable high-performing, flexible analytic pipelines. In this talk, Mike Percy and Ashish Singh will demonstrate how Apache Kafka can be combined with Kudu to achieve low latency, high throughput analytics on streaming data. We will compare various approaches to building such a solution and demonstrate a working system for analyzing tweets in real time by combining Kafka, Kudu, and Apache Impala (incubating).

Speakers
avatar for Mike Percy

Mike Percy

Software Engineer, Cloudera
Mike Percy is a software engineer at Cloudera and a PMC member on Apache Kudu, an open source distributed column store for the Hadoop ecosystem. He is also a PMC member on Apache Flume. Prior to joining Cloudera, Mike worked at Yahoo! building machine learning infrastructure for Big... Read More →
avatar for Ashish Singh

Ashish Singh

Software Engineer, Cloudera
Ashish Singh is a Software Engineer, working with Cloudera to empower the Hadoop ecosystem to answer bigger questions. Ashish studied Computer Science and Engineering at Ohio State University. Before working in the Big Data space, he worked on optimizing MPI collective communications... Read More →



Tuesday May 10, 2016 3:00pm - 3:50pm PDT
Georgia A