Metodo

International Studies in Phenomenology and Philosophy

Book | Chapter

196909

Abstract

Of the core challenges originally associated with Big Data, namely Volume, Velocity, and Variety, the Variety aspect is the one that is least addressed by the standard analytics architectures. In this chapter, we analyze types and sources of variety and describe data- and metadata management principles for organizing data lakes. We discuss how semantic metadata can help describe and manage variety in structure, provenance, visibility and permitted use. Moreover, ontologies and metadata catalogs can aid discovery, navigation, exploration, and interpretation of heterogeneous data lakes, and can simplify interpretation, lift data quality, and simplify integration of multiple data sets. We present an application of these principles in a data architecture for the Law Enforcement domain in Australia.

Publication details

Published in:

Hoppe Thomas, Humm Bernhard, Reibold Anatol (2018) Semantic applications: methodology, technology, corporate use. Dordrecht, Springer.

Pages: 47-62

DOI: 10.1007/978-3-662-55433-3_4

Full citation:

Mayer Wolfgang, Grossmann Georg, Selway Matt, Stanek Jan, Stumptner Markus (2018) „Variety management for big data“, In: T. Hoppe, B. Humm & A. Reibold (eds.), Semantic applications, Dordrecht, Springer, 47–62.