How big is Big Data? A Comprehensive Survey of Data Production, Storage, and Streaming in Science and Industry
- 1National Institute of Nuclear Physics of Bologna, Italy
- 2University of Bologna, Italy
- 3European Organization for Nuclear Research (CERN), Switzerland
The contemporary surge in data production is fueled by diverse factors, with contributions from numerous stakeholders across various sectors. Comparing the volumes at play among different big data entities is challenging due to the scarcity of publicly available data. This survey aims to offer a comprehensive perspective on the orders of magnitude involved in yearly data generation by some public and private leading organizations, using an array of online sources for estimation. These estimates are based on meaningful, individual data production metrics and plausible per-unit sizes. The primary objective is to offer insights into the comparative scales of major big data players, their sources, and data production flows, rather than striving for precise measurements or incorporating the latest updates. The results are succinctly conveyed through a visual representation of the relative data generation volumes across these entities.
Keywords: big data, data production, Data volumes, data storage, Streaming data
Received: 02 Aug 2023;
Accepted: 20 Sep 2023.
Copyright: © 2023 Clissa, Lassnig and Rinaldi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Dr. Luca Clissa, National Institute of Nuclear Physics of Bologna, Bologna, Italy