TL;DR – 2021 was the year file formats finally caught up with the web‑first, mobile‑first world: royalty‑free, HDR‑ready, and AI‑friendly standards displaced many legacy codecs. PDF 2.0 and DOCX/ODF dominate documents, AVIF/WebP and HEIF win on images, Opus takes over real‑time audio, AV1 starts to replace HEVC for video, and columnar formats like Parquet + Arrow become the backbone of big‑data pipelines.


1. Documents & Publishing – Two Pillars, One Ecosystem

Format2021 statusWhy you should care
PDF 2.0 (ISO 32000‑2)Mature, still the universal static‑document format.Embeds 3‑D, rich media, digital signatures, and improved accessibility tags. Great for contracts, e‑books, and government forms.
PDF/A‑3Growing in regulated sectors (finance, pharma).Lets you bundle any file (XML, CSV, etc.) inside the PDF for audit trails – perfect for invoicing with attached data.
DOCX / Office Open XML> 85 % of corporate docs (Statista 2021).ZIP‑based container separates text, styles, and media; extensible via custom XML parts. Ideal for collaborative editing.
ODF (OpenDocument Format)Niche but required in many EU public‑sector contracts.Fully open, royalty‑free, strong spreadsheet & formula support.
EPUB 3> 30 % of new titles (Publishers Weekly).HTML5/CSS3‑based, supports audio/video, MathML, and fixed‑layout for graphic‑heavy books.
MOBI / AZW3Still the Kindle’s workhorse, but being phased out.Proprietary DRM, limited CSS – good for legacy Kindle e‑books.

Takeaway: 2021 cemented the static vs. editable divide. PDF 2.0 handles secure, unchangeable distribution, while DOCX/ODF keep the edit‑in‑place workflow alive. EPUB 3, meanwhile, is the go‑to for multimedia‑rich publishing that needs to reflow on any screen.


2. Images – From JPEG to AVIF, HDR, and Beyond

Format2021 adoptionKey strengths
AVIF (AV1 Image File Format)Supported in Chrome 90+, Firefox 93+, Android 12; ~15 % of web images on major news sites.50 % smaller than WebP, 10‑/12‑bit HDR, alpha channel – royalty‑free.
WebP~30 % of images served by the top‑10 websites.Lossy & lossless, animation, fast GPU decode on mobile.
HEIF/HEICDominant on iOS 14 (≈60 % of iPhone photos).2× JPEG compression, depth‑map & burst‑mode support.
JPEG XLEarly‑adopter browsers (Chrome/Firefox Nightly).Lossless + lossy, 35 % smaller than JPEG at equal quality, HDR, animation.
PNGStill the default for lossless UI assets.Universal support, lossless transparency.
SVG100 % browser support; the de‑facto format for icons.Vector, scriptable, CSS‑stylable – file size scales with complexity, not resolution.
PSDEssential in creative pipelines (1‑2 % of web images).Layered, adjustment maps, smart objects – widely readable via libraries.

Why the shift matters
Lossy vs. lossless is no longer a binary decision. AVIF gives you lossy compression that rivals JPEG while still offering a lossless mode for archival. HDR and wide‑color (10‑/12‑bit) are now a baseline requirement for modern displays, and both AVIF and HEIF deliver it without the licensing baggage of JPEG‑XR or proprietary formats.


3. Audio & Video – The Royalty‑Free Wave

Audio

Format2021 market shareWhy it’s winning
AAC~55 % of streaming (Spotify, Apple Music).Better quality than MP3 at the same bitrate; universal device support.
OpusFast‑growing; default in WebRTC, Discord, many podcasts.Hybrid speech‑music codec, 6 kbps‑510 kbps, low latency, adaptive bitrate – delivers higher perceived quality than AAC at lower bitrates.
FLAC~12 % of high‑fidelity streaming (Tidal HiFi, Amazon Music HD).True lossless, rich metadata, fast decoding.
MP3Still > 30 % of legacy libraries but on the decline.Ubiquitous, but limited to 320 kbps and no surround.
ALACPrimary for Apple ecosystem.Lossless, native iOS/macOS support.

Key point: 2021 saw Opus become the standard for real‑time communication in browsers (Chrome 89+, Firefox 86+). Its psychoacoustic model gives you “studio‑quality” sound at 64 kbps, which is a game‑changer for low‑bandwidth video calls and podcasts.

Video

Format2021 usageWhy it matters
AV1~10 % of YouTube streams (first large‑scale rollout).Royalty‑free, 30‑50 % better compression than HEVC, hardware decode on Intel Xe, Nvidia RTX 30, Apple A14.
HEVC (H.265)Still dominant for 4K/8K OTT (Netflix, Prime).Patented, excellent compression, but licensing fees limit web adoption.
H.264/AVC> 80 % of all video traffic.Ubiquitous hardware acceleration; the “lowest common denominator”.
VP9Used by YouTube for 4K (~30 % of 4K streams).Open, royalty‑free, now being eclipsed by AV1.
WebM (VP9/AV1 + Opus)Default for HTML5 video on Chrome/Firefox.Container that avoids licensing headaches.

Why AV1 matters – It’s the first royalty‑free codec that can consistently beat HEVC on 4K HDR content while being decoded in real time on mainstream mobile SoCs. That’s why Google, Netflix, and even Microsoft are betting on it for the next generation of streaming.


4. Data, Archives & Compression – Speed, Size, and Security

CategoryPopular formats (2021)Highlights
Document containersPDF 2.0, DOCX, ODFEncryption (AES‑256), digital signatures, long‑term validation.
Image containersAVIF, WebP, HEIFHDR, alpha, lossless‑lossy dual mode.
Audio containersMP4 (AAC), Ogg (Opus), FLACStreaming‑friendly (HLS/DASH) wrappers.
Video containersMP4 (AV1/HEVC), WebM (AV1)Adaptive streaming, DRM integration.
Archive & compressionZIP, 7z, tar.xz, Zstandard (zstd), Brotli7z/LZMA2 for maximum ratio; zstd for fast cloud‑side compression; Brotli for HTTP text assets.
Big‑data interchangeParquet, Apache Arrow, JSON‑LD, CSVColumnar storage + dictionary encoding cuts a 1 TB table from ~300 GB (CSV) to ~45 GB (Parquet). Arrow enables zero‑copy sharing between Python, Java, and Rust.

Practical tip: For any workflow that moves data between services (e.g., ETL pipelines), store the source of truth in Parquet with encryption (Parquet 1.12, 2021) and use Arrow for in‑memory analytics. For web assets, Brotli‑compress HTML/CSS/JS and serve images as AVIF or WebP to shave bandwidth dramatically.


5. Emerging & Niche Formats Worth Watching

  • JPEG XL – Still experimental but promises lossless + lossy in one file, plus animation.
  • HEIC/HEIF – Already the default on iOS 14; expect Android to follow suit.
  • GLTF 2.0 – The “JPEG of 3‑D”, now the standard for web‑based AR/VR and game asset exchange.
  • USD (Universal Scene Description) – Adopted by Pixar and entering Unity’s preview pipeline; ideal for complex, layered scenes.
  • Zstandard – Fast, tunable compression gaining ground in container images (Docker) and Linux package managers.
  • PDF 2.0 – New digital‑signature and accessibility features make it the go‑to for secure, compliant PDFs.

Quick Takeaways for Your 2021 (and beyond) Workflow

  1. Web images: Serve AVIF first, fall back to WebP, then JPEG. Expect a 30‑50 % reduction in bandwidth.
  2. Audio streaming: Use Opus for live or podcast content; keep AAC for legacy music libraries.
  3. Video delivery: Start experimenting with AV1‑encoded MP4 files; browsers already decode them on most modern devices.
  4. Data pipelines: Store raw analytics in Parquet + Arrow; compress intermediate files with Zstandard for speed.
  5. Document exchange: Adopt PDF 2.0 for any contract or form that needs signatures, and keep DOCX/ODF for collaborative drafts.

Final thought – 2021 wasn’t just another year of incremental updates; it was the moment the industry collectively said “enough with proprietary, bandwidth‑hungry formats.” The rise of royalty‑free, HDR‑ready, and AI‑friendly standards means smaller files, faster loads, and more secure data—all without the headache of licensing negotiations. If you align your toolchain with the formats above, you’ll be ready for the next wave of web, mobile, and data‑intensive applications. Happy converting!