A data lake is “a massive, easily accessible data repository built on (relatively) inexpensive computer hardware for storing ‘big data.’ ” The term was invented by James Dixon of Pentaho to describe the vast data repositories used in modern Big Data applications. Enterprises often use data lakes as repositories for reduced-order, structured data sets. Post-processing […]
Continue ReadingWhat is Hadoop? Why do I need to know about it? Suppose your company collects a lot of data—not just gigabytes but terabytes or petabytes. To make that data useful, you need a system to store all that data reliably and retrieve and manipulate it quickly. Hadoop is a distributed architecture and infrastructure for storing and […]
Continue ReadingFree-form unstructured data, which represents over 80 percent of all new data created today, is the new frontier for business insights. Much of it can be considered dark data because it is collected, processed, and stored but not analyzed or used. For background, see our previous post, What is Unstructured Data? What kinds of opportunities […]
Continue ReadingWhat is unstructured data? Unstructured data is a catch-all term used to describe free-form information—text, images, audio, videos—that is not organized inside a well-defined storage structure, such as a relational database management system or a financial application. Unstructured data comes in many forms. How is it different from structured data? Structured data has a predefined […]
Continue ReadingThe Internet of Things (IoT)—the network of physical objects that sense, interact, or communicate with the external environment—is growing exponentially. As those data-emanating devices proliferate, so does the total amount of data your company may have to put through an analytics framework. A data access platform like Aureum will prove essential to keeping enterprises from […]
Continue ReadingThe Peaxy Executive Summary Series is designed to explain quickly and simply what business leaders need to know about big data and data access systems. The world is being “datafied” and the result is Big Data—too large, complex, and dynamic for conventional data tools to capture, store, manage, and analyze. For background on the proliferation of […]
Continue ReadingThe Peaxy Executive Summary Series is designed to explain quickly and simply what business leaders need to know about big data and data access systems. The world is being “datafied” and the result is Big Data—too large, complex, and dynamic for conventional data tools to capture, store, manage, and analyze. For background on why we are […]
Continue ReadingThe Peaxy Executive Summary Series is designed to explain quickly and simply what business leaders need to know about big data and data access systems. The world is being “datafied” and the result is Big Data—too large, complex, and dynamic for conventional data tools to capture, store, manage, and analyze. For background on the proliferation of […]
Continue ReadingI’M CONSTANTLY HEARING ABOUT “BIG DATA,” BUT THE TERM SEEMS VAGUE. WHAT IS IT? It is vague, but it’s real. A good working definition is: data that is too large, complex, and dynamic for conventional data tools to capture, store, manage, and analyze. WHAT’S THE BIG DEAL? IS THERE REALLY THAT MUCH DATA? Every minute […]
Continue Reading