Attendees: Please read the instructions & prerequisites before arriving to the tutorial.
This tutorial will teach attendees about the key aspects of scalable web mining, via six modules:
2. Focused Web Crawling
3. Structured Data Extraction
4. Analyzing the Data
5. Barriers to Success
6. Examples and Summary
Veteran developer and entrepreneur, 25+ years experience. Founder and President of TransPac Software, a 20 year leader in internationalization, mobile devices, and search consulting. Founder and CTO of Krugle, a vertical search engine and enterprise appliance for code and technical information. Co-founder of Bixo web mining project. Committer for the Apache Tika project. Author and speaker on vertical search and web mining.
Comments on this page are now closed.
For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at firstname.lastname@example.org.
For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at email@example.com
View a complete list of Strata contacts