With more than 300 million members, LinkedIn is the largest professional social network in the world. This data set gives LinkedIn the necessary elements to build the world’s first Economic Graph. The Economic Graph is a valuable map of the connections between people, companies, jobs, skills, educational institutions, and professional knowledge in the global economy.
This task is first and foremost a data problem. In order to achieve this colossal effort, we need to standardize many different entities across many languages and many cultures. In this talk, Vitaly Gordon, a Senior Data Scientist on LinkedIn’s Product Data Science team, will speak about how his team have used some of the most advanced tools like Apache Hadoop, Apache Spark, and Apache Samza to start building the economic graph out of its massive data set of professional identities.
Vitaly Gordon is a senior data scientist on the LinkedIn Product Data Science team where he develops data products that most of you use every day. Prior to LinkedIn, Vitaly founded the data science team at LivePerson and worked in the elite 8200 unit, leading a team of researchers in developing algorithms to fight terrorism. His contributions have been recognized through a number of awards including the “Life Source” award, an award given each year deemed most high-impact in saving lives. Vitaly holds a B.Sc in Computer Science and an MBA from the Israeli Institute of Technology.