TL;DR – 2021 was the year file formats finally caught up with the web‑first, mobile‑first world: royalty‑free, HDR‑ready, and AI‑friendly standards displaced many legacy codecs. PDF 2.0 and DOCX/ODF dominate documents, AVIF/WebP and HEIF win on images, Opus takes over real‑time audio, AV1 starts to replace HEVC for video, and columnar formats like Parquet + Arrow become the backbone of big‑data pipelines.
1. Documents & Publishing – Two Pillars, One Ecosystem
| Format | 2021 status | Why you should care |
|---|---|---|
| PDF 2.0 (ISO 32000‑2) | Mature, still the universal static‑document format. | Embeds 3‑D, rich media, digital signatures, and improved accessibility tags. Great for contracts, e‑books, and government forms. |
| PDF/A‑3 | Growing in regulated sectors (finance, pharma). | Lets you bundle any file (XML, CSV, etc.) inside the PDF for audit trails – perfect for invoicing with attached data. |
| DOCX / Office Open XML | > 85 % of corporate docs (Statista 2021). | ZIP‑based container separates text, styles, and media; extensible via custom XML parts. Ideal for collaborative editing. |
| ODF (OpenDocument Format) | Niche but required in many EU public‑sector contracts. | Fully open, royalty‑free, strong spreadsheet & formula support. |
| EPUB 3 | > 30 % of new titles (Publishers Weekly). | HTML5/CSS3‑based, supports audio/video, MathML, and fixed‑layout for graphic‑heavy books. |
| MOBI / AZW3 | Still the Kindle’s workhorse, but being phased out. | Proprietary DRM, limited CSS – good for legacy Kindle e‑books. |
Takeaway: 2021 cemented the static vs. editable divide. PDF 2.0 handles secure, unchangeable distribution, while DOCX/ODF keep the edit‑in‑place workflow alive. EPUB 3, meanwhile, is the go‑to for multimedia‑rich publishing that needs to reflow on any screen.
2. Images – From JPEG to AVIF, HDR, and Beyond
| Format | 2021 adoption | Key strengths |
|---|---|---|
| AVIF (AV1 Image File Format) | Supported in Chrome 90+, Firefox 93+, Android 12; ~15 % of web images on major news sites. | 50 % smaller than WebP, 10‑/12‑bit HDR, alpha channel – royalty‑free. |
| WebP | ~30 % of images served by the top‑10 websites. | Lossy & lossless, animation, fast GPU decode on mobile. |
| HEIF/HEIC | Dominant on iOS 14 (≈60 % of iPhone photos). | 2× JPEG compression, depth‑map & burst‑mode support. |
| JPEG XL | Early‑adopter browsers (Chrome/Firefox Nightly). | Lossless + lossy, 35 % smaller than JPEG at equal quality, HDR, animation. |
| PNG | Still the default for lossless UI assets. | Universal support, lossless transparency. |
| SVG | 100 % browser support; the de‑facto format for icons. | Vector, scriptable, CSS‑stylable – file size scales with complexity, not resolution. |
| PSD | Essential in creative pipelines (1‑2 % of web images). | Layered, adjustment maps, smart objects – widely readable via libraries. |
Why the shift matters
Lossy vs. lossless is no longer a binary decision. AVIF gives you lossy compression that rivals JPEG while still offering a lossless mode for archival. HDR and wide‑color (10‑/12‑bit) are now a baseline requirement for modern displays, and both AVIF and HEIF deliver it without the licensing baggage of JPEG‑XR or proprietary formats.
3. Audio & Video – The Royalty‑Free Wave
Audio
| Format | 2021 market share | Why it’s winning |
|---|---|---|
| AAC | ~55 % of streaming (Spotify, Apple Music). | Better quality than MP3 at the same bitrate; universal device support. |
| Opus | Fast‑growing; default in WebRTC, Discord, many podcasts. | Hybrid speech‑music codec, 6 kbps‑510 kbps, low latency, adaptive bitrate – delivers higher perceived quality than AAC at lower bitrates. |
| FLAC | ~12 % of high‑fidelity streaming (Tidal HiFi, Amazon Music HD). | True lossless, rich metadata, fast decoding. |
| MP3 | Still > 30 % of legacy libraries but on the decline. | Ubiquitous, but limited to 320 kbps and no surround. |
| ALAC | Primary for Apple ecosystem. | Lossless, native iOS/macOS support. |
Key point: 2021 saw Opus become the standard for real‑time communication in browsers (Chrome 89+, Firefox 86+). Its psychoacoustic model gives you “studio‑quality” sound at 64 kbps, which is a game‑changer for low‑bandwidth video calls and podcasts.
Video
| Format | 2021 usage | Why it matters |
|---|---|---|
| AV1 | ~10 % of YouTube streams (first large‑scale rollout). | Royalty‑free, 30‑50 % better compression than HEVC, hardware decode on Intel Xe, Nvidia RTX 30, Apple A14. |
| HEVC (H.265) | Still dominant for 4K/8K OTT (Netflix, Prime). | Patented, excellent compression, but licensing fees limit web adoption. |
| H.264/AVC | > 80 % of all video traffic. | Ubiquitous hardware acceleration; the “lowest common denominator”. |
| VP9 | Used by YouTube for 4K (~30 % of 4K streams). | Open, royalty‑free, now being eclipsed by AV1. |
| WebM (VP9/AV1 + Opus) | Default for HTML5 video on Chrome/Firefox. | Container that avoids licensing headaches. |
Why AV1 matters – It’s the first royalty‑free codec that can consistently beat HEVC on 4K HDR content while being decoded in real time on mainstream mobile SoCs. That’s why Google, Netflix, and even Microsoft are betting on it for the next generation of streaming.
4. Data, Archives & Compression – Speed, Size, and Security
| Category | Popular formats (2021) | Highlights |
|---|---|---|
| Document containers | PDF 2.0, DOCX, ODF | Encryption (AES‑256), digital signatures, long‑term validation. |
| Image containers | AVIF, WebP, HEIF | HDR, alpha, lossless‑lossy dual mode. |
| Audio containers | MP4 (AAC), Ogg (Opus), FLAC | Streaming‑friendly (HLS/DASH) wrappers. |
| Video containers | MP4 (AV1/HEVC), WebM (AV1) | Adaptive streaming, DRM integration. |
| Archive & compression | ZIP, 7z, tar.xz, Zstandard (zstd), Brotli | 7z/LZMA2 for maximum ratio; zstd for fast cloud‑side compression; Brotli for HTTP text assets. |
| Big‑data interchange | Parquet, Apache Arrow, JSON‑LD, CSV | Columnar storage + dictionary encoding cuts a 1 TB table from ~300 GB (CSV) to ~45 GB (Parquet). Arrow enables zero‑copy sharing between Python, Java, and Rust. |
Practical tip: For any workflow that moves data between services (e.g., ETL pipelines), store the source of truth in Parquet with encryption (Parquet 1.12, 2021) and use Arrow for in‑memory analytics. For web assets, Brotli‑compress HTML/CSS/JS and serve images as AVIF or WebP to shave bandwidth dramatically.
5. Emerging & Niche Formats Worth Watching
- JPEG XL – Still experimental but promises lossless + lossy in one file, plus animation.
- HEIC/HEIF – Already the default on iOS 14; expect Android to follow suit.
- GLTF 2.0 – The “JPEG of 3‑D”, now the standard for web‑based AR/VR and game asset exchange.
- USD (Universal Scene Description) – Adopted by Pixar and entering Unity’s preview pipeline; ideal for complex, layered scenes.
- Zstandard – Fast, tunable compression gaining ground in container images (Docker) and Linux package managers.
- PDF 2.0 – New digital‑signature and accessibility features make it the go‑to for secure, compliant PDFs.
Quick Takeaways for Your 2021 (and beyond) Workflow
- Web images: Serve AVIF first, fall back to WebP, then JPEG. Expect a 30‑50 % reduction in bandwidth.
- Audio streaming: Use Opus for live or podcast content; keep AAC for legacy music libraries.
- Video delivery: Start experimenting with AV1‑encoded MP4 files; browsers already decode them on most modern devices.
- Data pipelines: Store raw analytics in Parquet + Arrow; compress intermediate files with Zstandard for speed.
- Document exchange: Adopt PDF 2.0 for any contract or form that needs signatures, and keep DOCX/ODF for collaborative drafts.
Final thought – 2021 wasn’t just another year of incremental updates; it was the moment the industry collectively said “enough with proprietary, bandwidth‑hungry formats.” The rise of royalty‑free, HDR‑ready, and AI‑friendly standards means smaller files, faster loads, and more secure data—all without the headache of licensing negotiations. If you align your toolchain with the formats above, you’ll be ready for the next wave of web, mobile, and data‑intensive applications. Happy converting!