Apache Giraph: start analyzing graph relationships in your bigdata in 45 minutes (or your money back)!

Scale

05/27/2014 - 15:20 to 16:00

Frannz Club

long talk (40 min)

Beginner

Session abstract:

The genesis of Hadoop was in analyzing massive amounts of data with a mapreduce framework. SQL-on-Hadoop has followed shortly after that, paving a way to the whole schemaonread notion. Discovering graph relationship in your data is the next logical step. Apache Giraph (modeled on Google’s Pregel) lets you apply the power of BSP approach to the unstructured data. In this talk we will focus on practical advice of how to get up and running with Apache Giraph in minutes, start analyzing simple data sets with built-in algorithms and finally how to implement your own graph processing using the APIs provided by the project. We will then dive into how Giraph integrates with the Hadoop ecosystem projects (Hive, HBase, Accumulo, etc.) and will also provide a whirlwind tour of Giraph architecture.

Video:

Roman Shaposhnik at #bbuzz 2014

Slide:

roman_shaposhnik_-_giraph_berlin.pdf

Berlin Buzzwords

Apache Giraph: start analyzing graph relationships in your bigdata in 45 minutes (or your money back)!

Session abstract:

Video:

Roman Shaposhnik at #bbuzz 2014

Slide:

roman_shaposhnik_-_giraph_berlin.pdf

Partners

Gold Partner

Past conferences