IDC: IoT expected to generate 79.4ZB of data in 2025
A new forecast from IDC estimates that there will be 41.6 billion connected IoT devices, or ‘things,’ generating 79.4 zettabytes (ZB) of data in 2025.
As the number of connected IoT devices grows, the amount of data generated by these devices will also grow.
Some of this data is small and bursty, indicating a single metric of a machine's health, while large amounts of data can be generated by video surveillance cameras using computer vision to analyse crowds of people, for example.
There is an obvious direct relationship between all the ‘things’ and the data these things create.
"Mankind is on a quest to digitise the world and a growing global DataSphere is the result. The world around us is becoming more 'sensorised,' bringing new levels of intelligence and order to personal and seemingly random environments, and Internet of Things devices are an integral part of this process," says IDC Global DataSphere senior vice president David Reinsel.
"However, with every new connection comes a responsibility to navigate and manage new security vulnerabilities and privacy concerns. Companies must address these data hazards as they advance new levels of efficiency and customer experience."
IDC projects that the amount of data created by these connected IoT devices will see a compound annual growth rate (CAGR) of 28.7% over the 2018-2025 forecast period.
Most of the data is being generated by video surveillance applications, but other categories such as industrial and medical will increasingly generate more data over time.
"As the market continues to mature, IoT increasingly becomes the fabric enabling the exchange of information from 'things', people, and processes,” says IDC IoT, 5G and mobility group vice president Carrie MacGillivray.
“Data becomes the common denominator – as it is captured, processed, and used from the nearest and farthest edges of the network to create value for industries, governments, and individuals' lives. Understanding the amount of data created from the myriad of connected devices allows organisations and vendors to build solutions that can scale in this accelerating data-driven IoT market.”
While it's not surprising to see industrial and automotive equipment represent the largest opportunity of connected "things," IDC expects to see strong adoption of household (e.g., smart home) and wearable devices in the near term.
Over the longer term, however, with public safety concerns, decreasing camera costs, and higher bandwidth options available (including the deployment of 5G networks offering low latency, dense coverage, and high bandwidth), video surveillance will grow in adoption at a rapid rate.
Drones, while still early in adoption today, show great potential to access remote or hard to reach locations and will also be a big driver of data creation using cameras.
While the video surveillance category will drive a large share of the IoT data created, the industrial and automotive category will see the fastest data growth rates over the forecast period with a CAGR of 60%.
This is the result of the increasing number of ‘things’ (other than video surveillance cameras) that are capturing data continuously as well as more advanced sensors capturing more (and richer) metrics or machine functions.
This rich data includes audio, image, and video. And, where analytics and artificial intelligence are magnifying data creation beyond just the data capture, data per device is growing at a faster pace than data per video surveillance camera.
It should also be noted that the IoT metadata category is a growing source of data to be managed and leveraged.
IoT metadata is, essentially, all the data that is created about other IoT data files. While not having a direct operational or informational function in a specific data category (like industrial or video surveillance), metadata provides the information about the data files captured or created by the IoT device.
Metadata, compared with original source files like a video image, is very small, sometimes by orders of magnitude.
In other cases, however, metadata can mimic the size of the source file, such as in a manufacturing environment.
In all cases, metadata is valuable data that can be leveraged to inform intelligent systems, drive personalisation, or bring context to seemingly random scenarios or data sets.
In other words, metadata is a prime candidate to be fed into NoSQL databases like MongoDB to bring structure to unstructured content or fed into cognitive systems to bring new levels of understanding, intelligence, and order to outwardly random environments.
The IDC report, Worldwide Global DataSphere IoT Device and Data Forecast, 2019-2023, provides a forecast of the number of IoT "things" that are connected as well as the data generated by these things.
This forecast study also categorises the devices and the data across several categories including video surveillance, industrial, household, medical, industry terminals, and other.
IDC defines an IoT device as a uniquely identifiable ‘thing’ or endpoint that can autonomously connect bidirectionally using connectivity to exchange data via the internet.
As these things become connected, they generate data that allows for monitoring, management, and analysis to be done on the state of these devices or the surrounding environment.
IDC's Global DataSphere sizes and forecasts data creation, capture, and replication across 70 categories of content-creating things — including non-IoT devices as well as IoT devices.
The data is then categorised into the types of data being created to understand various trends in data usage, consumption, and storage.