Get the free Ebook:
Private and Open Data in Asia: A Regional Guide.
Nowadays, there has been an increasing number of data centers around the world for various data services. If each data center separately builds and maintains Hadoop for analytics services, the data processing cooperation among such multiple, isolated Hadoops will be complicated and ineffective. It is useful to build a single Hadoop cluster with supporting secure access control and fair resource management across distributed data centers.
This session addresses the multi-tenant feature of geographically-distributed Hadoop. We describe technical issues and implementations that allow multiple users (data owners, service developers, data scientists, and end users) to securely share the data and support a flexibly controlled resource management mechanism, which results in isolated executing environments for each tenant over geographically-distributed data centers.
The main features of our platform are:
The session will reveal the main requirements and challenges in building our platform, explain details of the design, and compare it with existing approaches.
Heesun Won is a principal researcher at the Electronic and Telecommunications Research Institute (ETRI), where she has been developing an open data reference model and data distribution system with semantic data map—SODAS: Smart Open Data as a System. Her research interests include software architecture for big data processing in cloud environments.
Minh Chau Nguyen is a researcher in the smart data platform research department at the Electronic and Telecommunications Research Institute (ETRI). His research interests include big data management, software architecture, and distributed systems.
©2015, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.