VidPickr glossary
YouTube video terms, in plain language.
What does “1080p AV1” actually mean? Why do some downloads end up bigger than others? What’s the difference between a codec and a container? 72 short entries explaining the words you keep seeing in YouTube quality menus and downloader output.
Codecs
Codec
A codec is the algorithm that encodes (compresses) and decodes raw audio or video into a smaller stream.
AV1
AV1 is a royalty-free, open-source video codec from the Alliance for Open Media.
VP9
VP9 is a royalty-free video codec developed by Google, used widely on YouTube and in WebM files.
H.264 (AVC)
H.
H.265 (HEVC)
H.
Opus (audio codec)
Opus is the modern royalty-free audio codec used by YouTube, WhatsApp voice, Discord, Zoom, and most modern web audio.
File formats
Container (file format)
A container is the file format that wraps one or more audio and video streams into a single file.
Fragmented MP4
Fragmented MP4 (fMP4) is an MP4 variant where the file is split into many short chunks ("fragments"), each containing its own header.
M4A
M4A is an audio-only file format that wraps AAC-encoded audio in an MP4 container.
WebM
WebM is an open container format developed by Google for the web.
MKV (Matroska)
MKV (Matroska) is an open, flexible container format that holds video, audio, subtitles, chapters, and metadata in a single file.
MP3
MP3 is the most widely supported audio format.
GIF (Graphics Interchange Format)
GIF is an early image format (1987) that supports lossless compression, transparency, and short animation loops.
MP4 (container, deep dive)
MP4 is the universal video container format — every device, browser, and editor handles it.
Caption / subtitle
Captions (also called subtitles) are text overlays that transcribe spoken content for accessibility, translation, or sound-off viewing.
Streaming protocols
HLS (HTTP Live Streaming)
HLS (HTTP Live Streaming) is a streaming protocol developed by Apple.
DASH (MPEG-DASH)
DASH (Dynamic Adaptive Streaming over HTTP) is an open streaming protocol.
Range request (HTTP byte range)
A range request is an HTTP request that asks for a specific byte range of a file rather than the full thing.
Manifest (streaming)
A manifest is a small text file that lists every segment of a streaming video, plus available qualities, codecs, and timing.
PoToken (Player Token)
PoToken is a signed token YouTube's player generates in the browser as proof that the request comes from a real player session, not a scraper.
BotGuard
BotGuard is the obfuscated JavaScript anti-bot system YouTube's web player executes in every viewer's browser.
itag (YouTube format identifier)
itag is YouTube's internal numeric identifier for a specific format combination — resolution, codec, container, and audio/video mix.
Adaptive streaming (ABR)
Adaptive streaming (Adaptive Bitrate, ABR) is a protocol where the player switches between multiple quality variants of the same video during playback based on real-time network conditions.
Canvas fingerprint
A canvas fingerprint is a unique signature derived from how a specific browser renders graphics — different GPUs, drivers, fonts, and OSes produce subtly different pixel output for the same rendering instructions.
MSE (Media Source Extensions)
MSE (Media Source Extensions) is a browser API that lets JavaScript feed video / audio data to the HTML5 video element segment by segment.
JWT (JSON Web Token)
A JWT (JSON Web Token) is a signed token format used to authenticate requests without server-side session storage.
CORS (Cross-Origin Resource Sharing)
CORS (Cross-Origin Resource Sharing) is the browser security rule that prevents JavaScript on one domain from making requests to a different domain unless the target explicitly allows it.
Service worker
A service worker is a script that runs in the background of a website, separate from the main page, intercepting network requests and providing offline support.
WebCodecs
WebCodecs is a modern browser API that gives JavaScript direct access to the browser's video and audio codecs — encode, decode, frame-by-frame manipulation.
Chunked transfer (HTTP)
Chunked transfer encoding is an HTTP feature that lets a server stream a response without knowing its total length up front.
WebRTC
WebRTC is the browser API for real-time peer-to-peer audio, video, and data.
HTTP status codes (203, 403, 410, 429)
HTTP status codes are 3-digit numbers servers return to indicate request outcomes.
DRM (Digital Rights Management)
DRM is the umbrella term for content-protection systems that restrict copying or redistributing media.
Rate limiting
Rate limiting is the practice of capping how many requests a client can make in a given time window.
HDR10 vs Dolby Vision
HDR10 is the open HDR standard (royalty-free, used by YouTube).
Quality metrics
Bitrate
Bitrate is the amount of data a video or audio stream carries per second, measured in bits per second (bps) or kilobits (kbps) and megabits (Mbps).
FPS (frames per second)
FPS (frames per second) is the number of distinct still images displayed per second of video.
Resolution
Resolution is the pixel count of a video frame, expressed as width × height.
Aspect ratio
Aspect ratio is the proportion of a video frame's width to its height.
VBR (variable bitrate)
VBR (variable bitrate) is an encoding mode where the encoder spends more bits on complex frames and fewer on simple ones, instead of allocating the same budget to every second.
Bandwidth
Bandwidth is the rate of data transfer per second between two endpoints, typically measured in Mbps (megabits per second).
Frame rate (advanced)
Frame rate is the number of frames displayed per second.
Audio normalization (LUFS, loudness)
Audio normalization is the process of adjusting volume levels so all content plays at roughly the same loudness.
Sample rate (audio)
Sample rate is the number of audio samples per second, measured in Hz or kHz.
Aspect ratio (advanced)
Aspect ratio is the proportional width-to-height of a video frame.
GOP length (keyframe interval)
GOP length is the duration between keyframes in a video.
Resolution + pixel density
Resolution is the pixel count of a video frame (1920×1080 = "1080p").
Core concepts
HDR (high dynamic range)
HDR (high dynamic range) is a video signaling standard that carries a wider range of brightness and color than standard SDR video.
GOP (group of pictures)
A GOP (group of pictures) is a sequence of video frames that starts with a keyframe and continues with frames described as differences from earlier frames.
Signed URL
A signed URL is a download link with cryptographic parameters that authenticate the request and expire after a set time.
Keyframe (I-frame)
A keyframe (I-frame) is a video frame that stores a complete picture, independent of any other frames.
Lossy compression
Lossy compression discards some data permanently to make files smaller.
Lossless compression
Lossless compression makes files smaller without discarding any data — the original is perfectly reconstructed when decoded.
Metadata (video file metadata)
Metadata is the information about a video file that isn't the audio or video data itself — title, artist, duration, resolution, codec used, encoding date, GPS location, thumbnail.
Geo-block (region restriction)
A geo-block is a server-side restriction that refuses to serve content to viewers based on their IP address's country.
CDN (Content Delivery Network)
A CDN (Content Delivery Network) is a globally distributed network of servers that caches content close to viewers, reducing latency and offloading traffic from origin servers.
Proxy (HTTP / SOCKS)
A proxy is an intermediary server that forwards your requests to the target — making the target see the proxy's IP instead of yours.
VPN (Virtual Private Network)
A VPN is a service that encrypts your internet traffic and routes it through a server in your chosen country.
Watermark (video overlay)
A watermark is a visible mark — usually a logo, text, or pattern — overlaid on a video to identify its source or owner.
Thumbnail (video preview image)
A thumbnail is the static preview image shown for a video before it plays — in search results, on the homepage, in subscription feeds.
Playback buffer
A playback buffer is data downloaded ahead of the playhead, ready to play if the network slows down.
Cookie (browser session)
A cookie is a small piece of data a website stores in your browser to track session state — whether you're logged in, which preferences you've set, what country you appear to be in.
Color space
A color space is the mathematical model that describes which colors a video file represents.
Streaming (vs download)
Streaming is the delivery model where the client starts consuming the file before it's fully downloaded.
Audio channel (mono, stereo, 5.1, etc.)
An audio channel is a single track of sound that gets routed to a specific speaker.
IP fingerprinting
IP fingerprinting is the practice of evaluating an IP address against its history, ASN, country, type (residential vs datacenter), and behavioral patterns to decide how to treat requests.
Workflow terms
Mux (muxing)
Muxing is the process of combining separate audio and video streams into a single container file (MP4, MKV, WebM).
Transcoding
Transcoding is the process of decoding a compressed media file and re-encoding it into a different codec, container, bitrate, or resolution.
FFmpeg
FFmpeg is the open-source command-line tool that handles essentially all video / audio processing on the internet.
yt-dlp
yt-dlp is the open-source command-line YouTube downloader that powers most of the YouTube-downloading ecosystem in 2026.
Transcription (speech-to-text)
Transcription is the process of converting spoken audio into written text.
Screen recording (vs downloading)
Screen recording captures whatever is shown on your display as a video file.
Want a term added? Tell us — we read every message and add useful entries quickly.