In 2013 it will be possible to map a person’s entire genome for less than $1,000, which represents a million-fold reduction in cost from the first such process only a few years ago. Consumers attracted by this affordable price and by the growing potential of gene sequencing in areas including personalized medicine, pharmacogenomic testing and family history research will generate vast amounts of searchable, highly personal data.
Much of this new data will be sent to Ancestry.com, a Big Data company that already manages more than 11 billion records (4 petabytes) of searchable structured and unstructured data consisting of birth, death, census, military, immigration and other records. In the past 17 years, Ancestry users have created more than 47 million family trees containing more than 5 billion profiles of relatives. Added to the current mass archive, the new flood of gene-sequencing data generated by Ancestry’s recently-introduced DNA testing product will present Big Data challenges and opportunities of interest to many other companies whose business models are predicated on similar massive data troves.
In this session, Scott Sorensen, Ancestry.com CTO, will present an informative explanation of how the company is leveraging its Big Data capabilities by using Hadoop.
Specifically, Sorensen will provide two broad category overviews of how the company uses Hadoop: 1) with analytics and 2) product features. Details include:
Analytics:
Product Features: Improving the customer experience
Scott Sorensen has served as Ancestry.com’sChief Technology Officer since April 2013. Since joining the family history search giant in 2002, Scott has held multiple positions including Senior Vice President of Engineering, Vice President of Search and Vice President of Commerce and the first piece of code Scott wrote for the company is still used today. Prior to joining Ancestry.com, Scott was co-founder and Vice President of Engineering and then President at Coresoft Technologies. Scott was an engineering manager at WordPerfect / Novell and a software engineer at IBM. He holds a B.S in Computer Science from Brigham Young University.
Comments on this page are now closed.
For exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com
For information on trade opportunities with O'Reilly conferences email mediapartners
@oreilly.com
For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com
View a complete list of Strata + Hadoop World 2013 contacts
Comments
Would it be possible to post the slides here, like the other speakers have?