Definition

petabyte

By

Rodney Brown, TechTarget
Erin Sullivan, Senior Site Editor

Published: Dec 16, 2022

What is a petabyte?

A petabyte is a measure of memory or data storage capacity that is equal to 2 to the 50th power of bytes. There are 1,024 terabytes (TB) in a petabyte and approximately 1,024 PB make up one exabyte.

Traditional network-attached storage (NAS) is scalable and capable of handling petabytes of data, but it can take too much time and use too many resources when going through the system's organized storage index.

In terms of memory, a typical laptop or desktop computer contains 16 GB of random access memory (RAM). A top-end server can contain as much as 6 TB of RAM. That means it would take 170 top-end servers -- or roughly 61,000 desktops -- to add up to a single petabyte of RAM.

For another example of how large a petabyte is, a typical DVD holds 4.7 GB of data. That means a single terabyte of storage could hold 217.8 DVD-quality movies, while a single petabyte of storage could hold 223,101 DVD-quality movies.

petabyte comparison

Petabyte storage vendors

Barely a decade ago, data storage vendors would boast of selling an aggregate of a petabyte or two in all of their storage systems sold. Due to the continued rapid increase in data storage capacity requirements, it's now common to see individual companies and even single storage systems with more than a petabyte of storage capacity.

Storage vendors that offer petabyte-level storage include the following:

Fujitsu
Qnap
Spectra Logic
StoneFly
Vast Data

Petabyte backups and storage

Other data storage technologies can back up and archive at a petabyte scale.

Snapshots and other disk-based backup technologies provide a local copy of the data, enabling a rapid restore.
Tape and the cloud provide relatively low-cost backup options for petabytes of data, but they are more often used as off-site archival storage rather than primary storage.
Solid-state storage can scan petabytes of data at a much higher speed without sacrificing data integrity.
Object storage assigns each object a unique identifier, enabling the system to search large amounts of data in a flat space as opposed to examining a complete storage index to find a specific file.

storage capacity measurements

Petabytes and big data

There is no specific quantity of data that qualifies as big data, but the term often refers to information in the petabyte, or even exabyte, range. Mining for information across petabytes of data is a time-consuming task. Organizations working with big data often use the Hadoop Distributed File System because it facilitates rapid data transfer and enables a system to operate uninterrupted while working with petabytes of data.

To get a sense of how big some data warehouse stores have become, in July 2017, the European research center CERN announced that its data center had 200 PB archived in its tape library.

With the increased use of 4K video and the advent of the internet of things, IDC predicted that by 2025 there will be 175 zettabytes -- or approximately 175,000,000 PB -- of data that needs storage.

Editor's note: This article was revised in 2022 by TechTarget editors to improve the reader experience.

Continue Reading About petabyte

Is demand for data storage or supply driving increased storage?

Differences in scale-up vs. scale-out storage

What's driving the resurgence in tape storage use?

Dig Deeper on Storage management and analytics

Search Disaster Recovery

Business continuity in the cloud: Benefits, issues and tips
Using the cloud for business continuity helps reduce downtime, increase redundancy and simplify disaster recovery plans. Learn ...
Risk assessment matrix: Free template and usage guide
A risk assessment matrix identifies issues with the greatest potential for business disruption or damage. Use our free template ...
Build IT resilience to avoid paying ransomware demands
No one wants to pay the ransom after a cyberattack, but many organizations feel like they have no choice. Explore the benefits of...

Search Data Backup

Cloud Software Group to buy data protection vendor Arctera
Cloud Software Group, a company formed after the 2022 merger of Citrix Systems and Tibco Software, is set to acquire Arctera, ...
HPE Zerto, storage, networking prioritizing cybersecurity
Updates for HPE Zerto, storage and networking offerings all aim to increase cyber resiliency and recovery without significant ...
8 data backup strategies and best practices you need to know
This comprehensive guide explains backup basics, common threats and eight critical best practices your backup strategy needs.

Search Data Center

8 ways to enhance data center physical security
Data center physical security is just as important as cybersecurity. Organizations can follow these eight security approaches to ...
Benefits of edge computing over large data centers
Edge computing attracts companies by reducing latency. Its benefits over large data centers include modular design, effective ...
AWS tables Virginia data center after community pushback
The proposed 7.2 million-square-foot operation -- one of the world's largest -- would have added to Amazon's $35 billion data ...

Sustainability
and ESG

9 IT sustainability approaches to consider
Learn from these nine IT sustainability approaches and examples, including prioritizing e-waste reduction, using AI more ...
Diverse teams are smarter -- here’s why
Your company can foster an improved bottom line, better retention and a host of other benefits by supporting diverse and ...
Sustainability quiz: Test your knowledge of the basics
Have fun testing what you know about climate change basics, contributing factors and potential solutions by taking this ...

Close