Hadoop Analytics in Financial Services

Stefan Groschupf (Datameer)

Once social media and web companies discovered Hadoop as the good enough solution for any data analytics problem that did not fit into mysql, Hadoop was on a rapid rise on the financial industry. The reasons the financial industry is adopting Hadoop very fast are very different than in other industries. Banks typically are not engineering driven organizations and terms like agile development, shared root key or cron tab scheduling are no go’s in a bank but standard around Hadoop.

This entertaining talk for bankers and other financial services managers with technical experience or engineers discusses four business intelligence platform deployments on Hadoop:

1. Long-term storage and analytics of transactions and the huge cost saves Hadoop can provide;

2. Identifying cross and up sell opportunities by analyzing web log files in combination with customer profiles;

3. Value-at-risk analytics; and

4. Understanding the SLA issues and identifying problems in a thousands-of-nodes, big services oriented architecture.

This session discusses the different use cases and the challenges to overcome in building and using BI on Hadoop.

Photo of Stefan Groschupf

Stefan Groschupf


Stefan Groschupf is the co-founder and CEO of Datameer and is well known for his entrepreneurial accomplishments in data management and large-scale distributed computing.

Before Datameer, Stefan was the co-founder and CEO of Scale Unlimited, a leader provider of educational and consulting services for Hadoop and related technologies with proven success in companies such as HP, Sun, Apple, Deutsche Telekom and Nokia. Earlier, Stefan was CEO of 101Tec, a supplier of Hadoop and Nutch-based search and text classification software to industry-leading companies such as DHL and EMI Music. Stefan has also served as CTO at Thinglink, a developer of social objects used in social interaction design, as well as at Sproose, a social search engine company.


  • EMC
  • Microsoft
  • HPCC Systems™ from LexisNexis® Risk Solutions
  • MarkLogic
  • Shared Learning Collaborative
  • Cloudera
  • Digital Reasoning Systems
  • Pentaho
  • Rackspace Hosting
  • Teradata Aster
  • VMware
  • IBM
  • NetApp
  • Oracle
  • 1010data
  • 10gen
  • Acxiom
  • Amazon Web Services
  • Calpont
  • Cisco
  • Couchbase
  • Cray
  • Datameer
  • DataSift
  • DataStax
  • Esri
  • Facebook
  • Feedzai
  • Hadapt
  • Hortonworks
  • Impetus
  • Jaspersoft
  • Karmasphere
  • Lucid Imagination
  • MapR Technologies
  • Pervasive
  • Platform Computing
  • Revolution Analytics
  • Scaleout Software
  • Skytree, Inc.
  • Splunk
  • Tableau Software
  • Talend

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at sstewart@oreilly.com.

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

View a complete list of Strata contacts