Presented By O'Reilly and Cloudera
Make Data Work
March 13–14, 2017: Training
March 14–16, 2017: Tutorials & Conference
San Jose, CA

Schedule: Financial services sessions

9:00am5:00pm Tuesday, March 14, 2017
Location: LL20 A
Barbara Eckman (Comcast), Dirk Jungnickel (Emirates Integrated Telecommunications Company (du)), Kishore Papineni (Astellas Pharma), Paul Barth (Podium Data), Carlo Torniai (Pirelli Tyre), Bryan Harrison (American Express), Chris Murphy (Zurich Insurance Group), Martin Lidl (Deloitte), Maura Lynch (Pinterest), Nixon Patel (Kovid Group), Bas Geerdink (Aizonic), Robin Li (Tapjoy), Yohan Chin (Tapjoy), Jim Harrold (NationBuilder), Lana Novikova (Heartbeat AI Technologies)
In a series of 12 half-hour talks aimed at a business audience, you’ll hear data-themed case studies from household brands and global companies, explaining the challenges they wanted to tackle, the approaches they took, and the benefits—and drawbacks—of their solutions. If you want practical insights about applied data, look no further. Read more.
11:50am12:30pm Wednesday, March 15, 2017
Spark & beyond
Location: 230 A Level: Intermediate
Jasjeet Thind (Zillow)
Average rating: ****.
(4.50, 2 ratings)
Zillow pioneered providing access to unprecedented information about the housing market. Long gone are the days when you needed an agent to get comparables and prior sale and listing data. And with more data, data science has enabled more use cases. Jasjeet Thind explains how Zillow uses Spark and machine learning to transform real estate. Read more.
1:50pm2:30pm Wednesday, March 15, 2017
Data engineering and architecture
Location: LL20 A Level: Intermediate
Average rating: *****
(5.00, 2 ratings)
Data warehouses are critical in driving business decisions—with SQL dominantly used to build ETL pipelines. While the technology has shifted from using RDBMS-centric data warehouses to data pipelines based on Hadoop and MPP databases, engineering and quality processes have not kept pace. Avinash Padmanabhan highlights the changes that Intuit's team made to improve processes and data quality. Read more.
2:40pm3:20pm Wednesday, March 15, 2017
Platform Security and Cybersecurity
Location: LL21 B Level: Intermediate
Ajit Gaddam (VISA), Jiphun Satapathy (VISA)
Average rating: ***..
(3.83, 6 ratings)
Apache Kafka is used by over 35% of Fortune 500 companies to store and process some of their most sensitive datasets. Ajit Gaddam and Jiphun Satapathy provide a security reference architecture to secure your Kafka cluster while leveraging it to support your organization's cybersecurity requirements. Read more.
4:20pm5:00pm Wednesday, March 15, 2017
Business case studies, Strata Business Summit
Location: 210 D/H Level: Intermediate
Alan Chaney (Bitvore Corp)
Average rating: ***..
(3.50, 2 ratings)
Bitvore Corp’s Bitvore for Munis personalized news surveillance system is rapidly becoming a must-have for all major fixed-income securities analysts, investors, and brokers working in the three-trillion-dollar municipal bond market in the USA. Alan Chaney explains how Bitvore delivers the few important and relevant articles out of thousands each day, saving users many hours daily. Read more.
4:20pm5:00pm Wednesday, March 15, 2017
Data engineering and architecture
Location: LL20 C Level: Intermediate
Kevin Mao (Capital One)
Average rating: ****.
(4.67, 3 ratings)
Kevin Mao explores the value of and challenges associated with collecting raw security event data from disparate corners of enterprise infrastructure and transforming them into high-quality intelligence that can be used to forecast, detect, and mitigate cybersecurity threats. Read more.
5:10pm5:50pm Wednesday, March 15, 2017
Data science & advanced analytics
Location: 212 A-B Level: Intermediate
Matar Haller (Winton Capital)
Average rating: *****
(5.00, 2 ratings)
With the exploding growth of video and audio content online, there's an increasing need for indexable and searchable audio. Matar Haller demonstrates how to automatically identify who is speaking when in a recorded conversation using machine learning applied to a corpus of audio recordings. Matar shares how she approached the problem, the algorithms used, and steps taken to validate the results. Read more.
2:40pm3:20pm Thursday, March 16, 2017
Real-time applications
Location: 210 A/E Level: Advanced
Jeffrey Yau (Silicon Valley Data Science)
Average rating: ***..
(3.20, 5 ratings)
Thanks to frameworks such as Spark's GraphX and GraphFrames, graph-based techniques are increasingly applicable to anomaly, outlier, and event detection in time series. Jeffrey Yau offers an overview of applying graph-based techniques in fraud detection, IoT processing, and financial data and outlines the benefits of graphs relative to other techniques. Read more.
4:20pm5:00pm Thursday, March 16, 2017
Spark & beyond
Location: LL21 C/D Level: Beginner
Bryan Cheng (BlockCypher), Karen Hsu (BlockCypher)
Average rating: *****
(5.00, 2 ratings)
Bryan Cheng and Karen Hsu describe how they built machine-learning and graph traversal systems on Apache Spark to help government organizations and private businesses stay informed in the brave new world of blockchain technology. Bryan and Karen also share lessons learned combining these two bleeding-edge technologies and explain how these techniques can be applied to private and federated chains. Read more.