Over the past three decades, the prevalence of dedicated advanced monitoring systems on aircraft has grown tremendously. These systems commonly include a large number of sensors monitoring everything from outside air temperature to gearbox pressures to the health of drivetrain components (via onboard calculated condition indicators). The motivation for monitoring and collecting this data is to enhance safety and enable intelligent decision making about the operation of the aircraft, including optimizing maintenance, maximizing availability, focusing troubleshooting, and enabling proactive and timely support.
The Health & Usage Monitoring System (HUMS) onboard most Sikorsky rotorcraft collects several different types of data (parametric, event, usage, regime, mechanical diagnostics, etc.). When this data is aggregated across an entire fleet over all time, the size of the data alone can be difficult to manage, let alone be tractable for extraction of useful and actionable insights in a reasonable time frame or interactive analysis and visualization. When dealing with datasets of this size (which can be hundreds of terabytes and millions of files for a single fleet), there are several difficult problems with ingesting and storing files, batch processing algorithms and analytics, and serving the data to end users. In addition, Sikorsky uses several other datasets, including flight test data, supply chain, maintenance, safety, operator information, design specs, logistics, and other external data sources, including global weather and economic information (oil and gas prices), to drive decision making in the enterprise.
Mike Koelemay explores the data platform that Sikorsky has built in recent years using massively scalable tools such as Hadoop, Spark, and Cassandra to enable the ingestion, storage, serving, and processing of aircraft data and shares several use cases showcasing how this technology has enabled decision support using historical fleet data that was previously impractical, as well as the challenges encountered along the way.
Mike Koelemay runs the data science team within advanced analytics at Sikorsky, where he is responsible for bringing state-of-the-art analytics and algorithm technologies to support the ingestion, processing, and serving of data collected onboard thousands of aerospace assets around the world. Drawing on his 10+ years of experience in applied data analytics for integrated system health management technologies, Mike works with other software engineers, data architects, and data scientists to support the execution of advanced algorithms, data mining, signal processing, system optimization, and advanced diagnostics and prognostics technologies, with a focus on rapidly generating information from large, complex datasets.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.