Loading…
Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 
Monday, May 9 • 10:40am - 11:30am
A Faster Way for Faster Workflows - Ken Krugler, Scale Unlimited

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Cascading is a popular open source project that makes it easier to create workflows for processing big data. In the past these always ran on top of Hadoop, but now there’s a new option - run them using Flink, a fundamentally stream-oriented dataflow engine that takes full advantage of available RAM.

In this presentation Ken Krugler will briefly describe Flink, and then discuss a real-world example of converting a complex workflow (100+ jobs, NLP processing of text, SVM-based classification, etc) from Hadoop to Flink.

Speakers
avatar for Ken Krugler

Ken Krugler

Scale Unlimited
Ken Krugler is a veteran entrepreneur, developer and instructor. He is the president of Scale Unlimited, a provider of consulting and training services for big data analytics, search, and machine learning using Hadoop, Cascading, Mahout, Cassandra and Solr. Ken is an Apache Tika committer... Read More →



Monday May 9, 2016 10:40am - 11:30am PDT
Plaza C