Digital Storage Explained: From Bits to Terabytes
You buy a 1 TB hard drive, plug it in, and your computer says it holds 931 GB. Not 1,000 GB. Not even 999 GB. You check the box, check the receipt, and wonder if someone made a mistake. Nobody made a mistake. The drive contains exactly as many bytes as advertised. The problem is that the word "gigabyte" means two different things depending on who is using it.
This gap between what you expected and what your screen shows traces back to a conflict between two counting systems that has persisted for decades, produced lawsuits, and confused millions of people. Here is how it works.
The Bit, the Byte, and Where They Came From
Every piece of data on a computer reduces to binary digits: ones and zeros. A single binary digit is a bit. In 1956, Werner Buchholz, a computer scientist working on the IBM Stretch supercomputer, coined the term byte to describe a group of bits processed as one unit. He deliberately misspelled "bite" so nobody would confuse it with "bit" in written documentation. Early bytes varied in size (some systems used 6-bit bytes), but the 8-bit byte eventually won out and became universal.
Eight bits give you 256 possible values (2 to the power of 8). That is enough to represent a single text character, a small number, or one pixel's worth of color data. Everything else in digital storage builds upward from that 8-bit foundation.
Two Ways to Count: Decimal vs. Binary
Here is where the confusion starts. In everyday math, "kilo" means 1,000. A kilogram is 1,000 grams. A kilometer is 1,000 meters. So a kilobyte should be 1,000 bytes, right?
In the decimal (base-10) system, it is. But computers operate in binary (base-2), and the nearest clean power of 2 to 1,000 is 1,024 (which is 2 to the power of 10). Early computer engineers used "kilobyte" to mean 1,024 bytes because it was close enough to 1,000 and mapped neatly onto binary architecture. The same pattern repeated up the scale: "megabyte" meant 1,024 ร 1,024 (1,048,576) bytes, and "gigabyte" meant 1,024 ร 1,024 ร 1,024 (1,073,741,824) bytes.
For small amounts, the difference is tiny. At the kilobyte level, it is only 2.4%. But the gap compounds. By the time you reach terabytes, the binary value (1,099,511,627,776 bytes) is nearly 10% larger than the decimal value (1,000,000,000,000 bytes). That 10% is your "missing" 69 GB on a 1 TB drive.
| Unit | Decimal (SI) | Binary | Difference |
|---|---|---|---|
| Kilobyte | 1,000 bytes | 1,024 bytes | 2.4% |
| Megabyte | 1,000,000 bytes | 1,048,576 bytes | 4.9% |
| Gigabyte | 1,000,000,000 bytes | 1,073,741,824 bytes | 7.4% |
| Terabyte | 1,000,000,000,000 bytes | 1,099,511,627,776 bytes | 10.0% |
Need to convert between these units? The gigabytes to megabytes converter handles the decimal math, and the terabytes to gigabytes converter works for larger capacities.
The IEC Fix: KiB, MiB, GiB
In December 1998, the International Electrotechnical Commission (IEC) attempted to solve this mess. They published standard IEC 60027-2, which introduced a new set of prefixes for binary multiples: kibibyte (KiB) for 1,024 bytes, mebibyte (MiB) for 1,048,576 bytes, gibibyte (GiB) for 1,073,741,824 bytes, and tebibyte (TiB) for 1,099,511,627,776 bytes. The names combine the first two letters of the SI prefix with "bi" for binary: kibi, mebi, gibi, tebi.
Under this system, the old SI prefixes (kilo, mega, giga, tera) would mean exactly what they mean everywhere else in science: powers of 1,000. A gigabyte would always be 1,000,000,000 bytes. A gibibyte would always be 1,073,741,824 bytes. Clean, unambiguous.
The reality is that adoption has been slow. Linux distributions adopted the IEC prefixes years ago. macOS switched in 2009, which is why a Mac reports a 1 TB drive as "1 TB" rather than "931 GB." Windows still uses the old binary interpretation while labeling it "GB," which keeps the confusion alive for the largest share of desktop users. Most people have never heard the word "gibibyte."
Why Your Drive Shows Less Than Advertised
Drive manufacturers use the decimal definition. When Western Digital, Seagate, or Samsung prints "1 TB" on a box, they mean 1,000,000,000,000 bytes. This is technically correct by SI standards. Your operating system, however, may divide that byte count by 1,024 at each level instead of 1,000. The result: 1,000,000,000,000 divided by 1,024, then by 1,024, then by 1,024 again, equals approximately 931 GB as displayed by the OS.
This discrepancy has not been purely academic. In the early 2000s, class action lawsuits targeted hard drive makers including Western Digital and Seagate. Plaintiffs argued that advertising drives in decimal gigabytes was misleading because operating systems reported less space. Western Digital settled in 2006, offering free backup software to roughly one million affected customers. Seagate reached a similar settlement, providing cash refunds or free software. The drives always contained every byte printed on the label. The argument was about consumer expectations, not missing data.
What Actually Fits on a Drive?
Abstract byte counts do not mean much without context. Here is a rough guide to what real files look like, using standard decimal megabytes to kilobytes and gigabytes to terabytes measurements.
| File Type | Typical Size | Per 1 GB | Per 1 TB |
|---|---|---|---|
| Text document (5 pages) | 50 KB | ~20,000 docs | ~20 million |
| MP3 song (4 min) | 4 MB | ~250 songs | ~250,000 songs |
| JPEG photo (12 MP) | 3-5 MB | ~250 photos | ~250,000 photos |
| RAW photo | 25-50 MB | ~25 photos | ~25,000 photos |
| 1080p movie (2 hr) | 1.5-4 GB | ~1 movie | ~300 movies |
| 4K movie (2 hr) | 8-15 GB | partial movie | ~80 movies |
These numbers shift with compression, quality settings, and codecs. A heavily compressed 1080p stream from a streaming service might be 1.5 GB, while a Blu-ray rip of the same film could be 30 GB or more.
The Hierarchy: Bits to Petabytes
For reference, here is the full ladder of common storage units in decimal notation:
- 1 byte = 8 bits
- 1 kilobyte (KB) = 1,000 bytes
- 1 megabyte (MB) = 1,000 KB = 1 million bytes
- 1 gigabyte (GB) = 1,000 MB = 1 billion bytes
- 1 terabyte (TB) = 1,000 GB = 1 trillion bytes
- 1 petabyte (PB) = 1,000 TB = 1 quadrillion bytes
Consumer devices currently top out at a few terabytes for SSDs and up to about 30 TB for enterprise hard drives. Cloud providers like Google, Amazon, and Microsoft manage storage measured in exabytes (1,000 PB). The gigabytes to megabytes converter is handy when you need to compare file sizes across different scales.
The Short Version
Drive makers count in powers of 1,000. Some operating systems count in powers of 1,024. The IEC created new prefixes (KiB, MiB, GiB) to eliminate the ambiguity, but the world has been slow to adopt them. Your 1 TB drive holds exactly 1 trillion bytes. Whether that reads as "1 TB" or "931 GB" on screen depends entirely on which definition your OS applies to the word "gigabyte."
Nobody is cheating you. The bytes are all there. The labels just have not caught up with the math.
Sources: IEC 60027-2, NIST, IEEE, Werner Buchholz / IBM (1956)