Every week we are introducing new speakers which will be on stage at #bbuzz 2015. Thanks to our program committee we can present part of our new eclectic program. Presentations range from beginner friendly introductions on hot data analysis topics to in-depth technical presentations about scalable architectures. The conference presents more than 50 talks by international speakers specific to the three tags "search", "store" and "scale".
Jodok Batlogg
Store – 40 min
Understanding databases for distributed Docker applications
Jodok Batlogg is co-founder and CEO of Crate and has a decade of experience with Big Data and Open Source. He has successfully launched a few startups and as CTO, led the tech teams of large web companies, for example by far biggest German-speaking website in 2010, StudiVZ.
Jodok will talk at #bbuzz on „Understanding databases for distributed Docker applications“. He willl focus on the use of Crate alongside Weave in Docker containers, the technical challenges, best practices learned, and getting a simple web application running alongside it. Jodok will present why the shared-nothing architecture is so important when deploying large clusters and talk about the ways Crate have leveraged Lucene, Elasticsearch, and built an optimized distributed SQL planner. Attendees can learn how to deploy a Crate cluster within minutes in the cloud using Docker. The session will also give examples when to use Crate compared to other similar technologies such as MongoDB, Hadoop, Cassandra or FoundationDB. Jodok will close his session on how to architect an application that is easy to scale using Crate, Docker, Weave, and a simple web application.
Uwe Schindler
Search - 40 min
Apache Lucene 5 - New Features and Improvements for Apache Solr and Elasticsearch
Uwe works as managing director for SD DataSolutions GmbH in Bremen, Germany, a company that provides consulting and support for Apache Lucene, ElasticSearch, and Apache Solr. His main focus is on development of Lucene Java. He implemented fast numerical search and is maintaining the new attribute-based text analysis API. He is committer and PMC member of Apache Lucene and Solr.
His talk „Apache Lucene 5 - New Features and Improvements for Apache Solr and Elasticsearch“ will present the improvements and new features, but also some incompatible changes in the Lucene 5 release.
Lucene now uses all the brand new features (NIO.2) of Java 7 to make the indexing process more stable and resulting indexes durable. Checksums are used during merging to prevent bugs in the underlying JVM or data corruption due to networking errors (e.g., while distributing indexes during recovery in Elasticsearch) to persist in newly created index segments. In Lucene 5, the API around codecs will be cleaned up and will likely get more stable. There are also new features, like a common FilterCache that can be reused by Solr and Elasticsearch.
This talk will give an overview over the background of these changes and how to make the best out of it.
Konark Modi
Scale - 40 min
Designing NRT(NearRealTime) stream processing systems: Using Storm
Konark Modi works as a Software Engineer with Cliqz Gmbh (Munich), previously was working as a Senior Engineer with the DataPlatformTeam at MakeMyTrip.com (India’s leading online travel company). At MakeMyTrip.com he has handled multiple roles related to DevOps and DataPlatforms. Currently he is one of the organizers for PyCon India,the premier conference in India on using and developing the Python programming language.
Konark talks about „Designing NearRealTime stream processing system by using Storm. His session will focus on creating a pipeline for collecting huge volumes of data anfd processing near-real time using Storm. Storm is a high-volume, continuous, reliable stream processing system developed at BackType and open-sourced by Twitter. Storm is being widely used in lot of organizations and has variety of uses-cases like:
* Realtime analytics
* Distributed RPC
* ETL etc.