Presented By O'Reilly and Cloudera
Make Data Work
March 13–14, 2017: Training
March 14–16, 2017: Tutorials & Conference
San Jose, CA

Zillow: Transforming real estate through big data and machine learning

Jasjeet Thind (Zillow)
11:50am12:30pm Wednesday, March 15, 2017
Spark & beyond
Location: 230 A Level: Intermediate
Secondary topics:  Data Platform, Financial services, Geospatial
Average rating: ****.
(4.50, 2 ratings)

Who is this presentation for?

  • Data scientists, software developers, and executives

Prerequisite knowledge

  • A basic understanding of building big data platforms and machine learning

What you'll learn

  • Learn best practices for scaling platforms for distributed data processing in Spark
  • Explore key machine-learning algorithms for real estate


Zillow, the nation’s number-one real estate website and mobile app, pioneered providing access to unprecedented information about the housing market. Long gone are the days when you needed an agent to get comparables and prior sale and listing data. Enter Zillow, the nation’s number-one real estate website and mobile app. With more data, data science has enabled more use cases. Jasjeet Thind explores Zillow’s big data platform, discusses some of its core machine-learning algorithms, and outlines best practices for scaling streaming data ingestion and data processing in Spark.

Topics include:

  • How Zillow predicts the owners of 100+ million homes and distinguishes between a buyer, seller, homeowner, and renter
  • How Zillow makes the Zestimate more accurate via text mining
  • How Zillow implemented its own collaborative filtering algorithm to provide personalized real estate recommendations
  • The best time to sell your home
Photo of Jasjeet Thind

Jasjeet Thind


Jasjeet Thind is the vice president of data science and engineering at Zillow. His group focuses on machine-learned prediction models and big data systems that power use cases such as Zestimates, personalization, housing indices, search, content recommendations, and user segmentation. Prior to Zillow, Jasjeet served as director of engineering at Yahoo, where he architected a machine-learned real-time big data platform leveraging social signals for user interest signals and content prediction. The system powers personalized content on Yahoo, Yahoo Sports, and Yahoo News. Jasjeet holds a BS and master’s degree in computer science from Cornell University.