Location: B118-119
Jay Kreps (Confluent)
Average rating: ****.
(4.11, 9 ratings)
The last few years have brought a wealth of new data technologies organized around horizontal scalability. This talk will cover the essential infrastructure areas: real-time stream processing, offline data crunching, large-scale data deployments and live serving. The focus will be on how these ingredients come together to enable innovative data-driven products at LinkedIn. Read more.
Location: B118-119
Jared Williams (New York State Senate), Noel Hidalgo (World Economic Forum), Graylin Kim (New York State Senate)
Average rating: ***..
(3.50, 2 ratings)
The story of the development team and what lessons we learned in building Open Legislation - an open government platform. It will detail our transition from a MySQL back end to an application fully powered by Lucene, the data quality and efficiency issues that we’ve had to address, and how we’re now trying to rebuild internal trust after our iterative and initially shaky development process. Read more.
Location: C121/122
Tom Wilkie (Acunu Ltd)
Average rating: ****.
(4.80, 5 ratings)
The standard Linux storage stack wasn't designed for write-heavy big data workloads, nor is it well-suited to modern hardware: large, slow SATA disks, SSDs or many cores. Castle, an open-source project, is a ground-up overhauling of RAID, file systems, and the POSIX interface. Read more.
Location: C123
Kate Matsudaira (SEOmoz)
Average rating: ***..
(3.50, 10 ratings)
Building large data applications can present a unique set of technical challenges because things that often work well in the conventional development environment can become incredibly arduous or expensive when applied on a much bigger scale. This talk will cover some of those challenges and potential solutions for each. Read more.
Location: B118-119
Erik Onnen (Urban Airship)
Average rating: *****
(5.00, 3 ratings)
This talk will cover lessons learned in building Urban Airship's large-scale data warehouse in EC2 including PostgreSQL, Kafka, Cassandra, HBase and Hadoop. Read more.