Schedule: Hadoop Platform sessions

Add to your personal schedule
Location: 114
Lars George (Cloudera), Jonathan Hsieh (Cloudera, Inc)
Average rating: *****
(5.00, 5 ratings)
This talk will show how HBase use-cases vary significantly from write-once, read many workloads storing events, to updatable entity workloads that use it as random read and write backing store. A discussion of how these use-cases can be classified, along with example, concludes the session. Read more.
Add to your personal schedule
Location: 212
Guy Ernest (Amazon Web Services)
Average rating: ***..
(3.62, 8 ratings)
How to extend your toolbox to solve more big data problems with less effort. AWS provides a set of big data services that are elastic, scalable and highly available out of the box. Learning best practices and tips of how to integrate them together and with your architecture adds to your abilities to provide fast and reliable big data solutions. Read more.
Add to your personal schedule
Location: 114
Uri Laserson (Cloudera)
Average rating: *****
(5.00, 2 ratings)
The advent of next-generation DNA sequencing technologies is revolutionizing life sciences research by routinely generating extremely large data sets. Big data tools developed to handle large-scale internet data (like Hadoop) will help scientists effectively manage this new scale of data, and also enable addressing a host of questions that were previously out of reach. Read more.
Add to your personal schedule
Location: 114
Garry Turkington (Improve Digital), Gabriele Modena (Improve Digital)
Average rating: **...
(2.29, 7 ratings)
Improve Digital is an ad tech company with large data volumes. This talk will explore our learnings from enhancing our established batch infrastructure with streaming near-realtime capabilities. In addition to discussing the impact on our architecture we will also describe how the work changed our approach to data lifecycle management. Read more.
Add to your personal schedule
Location: 114
Ameya Kantikar (Groupon)
Average rating: ***..
(3.55, 11 ratings)
Relevance and Personalization is crucial to building personalized local commerce experience at Groupon. Talk covers overview of the real time analytics infrastructure built using open source technologies such as Kafka- Storm - HBase- Redis which handles over 1 million data points per second in real time. Talk covers various solution choices, different techniques and strategies and more. Read more.
Add to your personal schedule
Location: 114
Marcel Kornacker (Cloudera)
Average rating: ***..
(3.50, 4 ratings)
Find out how to run real-time analytics over raw data without requiring a manual ETL process targeted at an RDBMS. This talk describes Impala’s approach to on-the-fly data transformation and its support for nested data; examples demonstrate how this can be used to query raw data feeds in formats such as text, JSON and XML, at a performance level commonly associated with specialized engines. Read more.
Add to your personal schedule
Location: 114
Average rating: ***..
(3.00, 2 ratings)
This session presents details on Cisco’s enterprise Hadoop architecture including roadmap details, centralized funding model that helped it get deployed quickly as well as its logical and physical views. Prominent use cases already in use at Cisco will also be covered. Read more.
Add to your personal schedule
Location: 114
Abed Ajraou (Solocal)
Average rating: **...
(2.83, 6 ratings)
Solocal, the French company behind PagesJaunes.fr, recently put Big Data and Hadoop into action to replace its traditional BI infrastructure. In this session, you will learn why and how that was done. Read more.
Add to your personal schedule
Location: 114
tod davis (Children's Healthcare of Atlanta)
Average rating: *****
(5.00, 3 ratings)
Children’s Healthcare of Atlanta in the US implemented Hadoop to capture and analyze vital sign sensor data in the ICU. Its goal is to understand the impact of stressful procedures, to reduce pain, and to improve outcomes in their most fragile patients. This session will highlight the challenges of pediatric healthcare data management and the strategies used to make this project a success. Read more.
Add to your personal schedule
Location: 114
Neil Martin (comparethemarket.com), Rob Siwicki (comparethemarket.com)
Average rating: ***..
(3.00, 1 rating)
The talk will provide insight into how to achieve coordinated technological change in a highly agile IT organization; an organisational function that supports one of the UK’s most recognisable brands. Discover valuable lessons learned and begin to understand how your organization may want to take first steps in its engagement proving and implementing Big Data technology. Read more.
Add to your personal schedule
Location: 114
Ankit Tharwani (Barclays UK)
Average rating: ****.
(4.00, 7 ratings)
With traditional revenue sources maturing and new entrants at the gate, data can be a powerful differentiator. This session will present the challenges involved in deploying the right technologies and the change management culture at the foundations of new info-led propositions. Read more.
Add to your personal schedule
Location: 114
Georgos Siganos (Qatar Computing Research Institute)
Average rating: ***..
(3.00, 1 rating)
Graph mining of large highly dynamic graphs is a challenging algorithmic and programming task requiring custom algorithms. Additionally, existing graph mining architectures are designed for batch workloads. The RT-Giraph open source project simplifies online graph mining by maintaining the programming and algorithmic simplicity of Apache Giraph, while supporting dynamic graphs. Read more.