What is Sample Rate?
Sample rate (measured in Hz or kHz) defines how many times per second the audio waveform is captured. Think of it like frames in a video — more samples per second means a more accurate representation of the original sound.
According to the Nyquist-Shannon theorem, a sample rate must be at least twice the highest frequency you want to capture. Human hearing tops out at roughly 20 kHz, so 44.1 kHz (used by CDs) captures everything we can hear.
| Sample Rate | Max Frequency | Common Use |
|---|---|---|
| 22.05 kHz | ~11 kHz | AM radio, telephone, low-bandwidth voice |
| 44.1 kHz | ~22 kHz | CD audio (Red Book), music distribution, consumer playback |
| 48 kHz | ~24 kHz | Video production (DVD, Blu-ray, YouTube), broadcast TV |
| 96 kHz | ~48 kHz | Hi-res audio mastering, studio recording, archival |
Rule of thumb: Use 44.1 kHz for music and general audio. Use 48 kHz if the audio is for video. Use 96 kHz only for professional mastering where you need headroom for effects processing.
What is Bit Depth?
Bit depth determines the number of possible amplitude values for each audio sample. Higher bit depth means a larger dynamic range — the difference between the quietest and loudest sound the file can represent without distortion.
The formula is simple: dynamic range ≈ 6 dB × bit depth. So 16-bit audio has ~96 dB of dynamic range, 24-bit has ~144 dB, and 32-bit float has over 1500 dB (effectively unlimited).
| Bit Depth | Dynamic Range | PCM Codec | Best For |
|---|---|---|---|
| 16-bit | ~96 dB | PCM S16LE | CD quality, music playback, general distribution |
| 24-bit | ~144 dB | PCM S24LE | Studio recording, mixing, professional editing |
| 32-bit float | >1500 dB | PCM F32LE | Audio mastering, effects processing, no-clip workflows |
Key insight: 24-bit is overkill for listening (human hearing has ~120 dB range) but essential for recording and editing — the extra headroom prevents clipping during processing. 32-bit float is used by DAWs internally and can represent values above 0 dBFS without distortion.
44.1 kHz vs 48 kHz: Which Sample Rate?
This is the most common question when choosing WAV settings, and the answer depends entirely on your destination format.
| Aspect | 44.1 kHz | 48 kHz |
|---|---|---|
| Origin | CD standard (Red Book, 1980) | Video/broadcast standard (AES) |
| Frequency ceiling | ~22.05 kHz | ~24 kHz |
| File size (1 min, 16-bit stereo) | ~10.1 MB | ~11.0 MB |
| Music distribution | Standard — Spotify, Apple Music, CDs | Requires sample rate conversion |
| Video production | Requires sample rate conversion | Standard — YouTube, DVD, Blu-ray |
| Audible difference | No audible difference — both exceed human hearing range | |
Bottom line: Use 44.1 kHz for music-only projects. Use 48 kHz if the audio will be synced with video. Avoid mixing the two in one project — sample rate conversion introduces tiny (though usually inaudible) artifacts. Convertio uses the SoXr audiophile-grade resampler to minimize these artifacts.
16-bit vs 24-bit: Which Bit Depth?
For playback and distribution, 16-bit is all you need. For recording and editing, 24-bit gives you crucial headroom.
| Aspect | 16-bit | 24-bit |
|---|---|---|
| Dynamic range | 96 dB | 144 dB |
| Noise floor | −96 dBFS | −144 dBFS |
| File size per minute (44.1 kHz, stereo) | ~10.1 MB | ~15.1 MB |
| CD compatibility | Yes — Red Book standard | Requires dithering to 16-bit |
| DAW editing | Works, but limited headroom | Preferred — more room for effects |
| Recording | Risk of clipping | Standard — 48 dB extra headroom |
Bottom line: If you're converting audio for listening, sharing, or burning CDs — 16-bit is perfect. If you're editing in a DAW, recording live audio, or running effects chains — choose 24-bit to avoid clipping and preserve the full dynamic range during processing.
Stereo vs Mono: When to Use Each
Stereo (2 channels) is the standard for music and most audio. Mono (1 channel) cuts file size in half and is ideal for voice-only content.
| Use Case | Recommended | Why |
|---|---|---|
| Music | Stereo | Preserves stereo imaging and panning |
| Podcasts | Mono | Voice is centered; halves file size |
| Voiceover / narration | Mono | Single voice source; no spatial info needed |
| Video soundtrack | Stereo | Matches video player expectations |
| Phone system / IVR | Mono | Telephony systems use mono audio |
| Sound effects | Mono | Positioned in 3D by the game/app engine |
WAV File Size Reference
WAV files are uncompressed, so size is exactly predictable. The formula is:
File size = sample rate × (bit depth ÷ 8) × channels × duration + 44 bytes header
Here's how common settings compare for a 3-minute stereo file:
| Settings | Per Minute | 3 Minutes | Use Case |
|---|---|---|---|
| 22.05 kHz / 16-bit / Mono | 2.5 MB | 7.6 MB | Voice memo, IVR |
| 44.1 kHz / 16-bit / Stereo | 10.1 MB | 30.3 MB | CD quality (default) |
| 48 kHz / 16-bit / Stereo | 11.0 MB | 33.0 MB | Video production |
| 48 kHz / 24-bit / Stereo | 16.5 MB | 49.4 MB | Professional video audio |
| 96 kHz / 24-bit / Stereo | 33.0 MB | 98.9 MB | Hi-res mastering |
| 96 kHz / 32-bit float / Stereo | 43.9 MB | 131.8 MB | Maximum quality mastering |
For comparison, the same 3-minute song as MP3 (VBR V2) would be approximately 4–5 MB — about 6–7x smaller than CD-quality WAV.
Which Settings Should You Use?
Here's a quick-reference guide based on what you're doing with the audio:
| Scenario | Sample Rate | Bit Depth | Channels |
|---|---|---|---|
| General music playback | 44.1 kHz | 16-bit | Stereo |
| CD burning | 44.1 kHz | 16-bit | Stereo |
| YouTube / video editing | 48 kHz | 16-bit | Stereo |
| Podcast editing | 44.1 kHz | 16-bit | Mono |
| Music production (DAW) | 44.1 kHz | 24-bit | Stereo |
| Film / broadcast audio | 48 kHz | 24-bit | Stereo |
| Hi-res audio mastering | 96 kHz | 24-bit | Stereo |
| Voice recording / IVR | 22.05 kHz | 16-bit | Mono |
Not sure? Choose 44.1 kHz, 16-bit, Stereo. This is the CD-quality standard that works with every player, editor, and platform. It's the default setting in Convertio.com's converter above.
Converting MP3 to WAV Doesn't Improve Quality
This is the most common misconception. When you convert a 128 kbps MP3 to a 44.1 kHz / 24-bit WAV, the file gets much larger but the audio quality stays exactly the same.
MP3 compression permanently removes audio data. Converting to WAV unpacks the remaining data into an uncompressed container, but it cannot restore what was already discarded. Think of it like unzipping a photo that was already resized — you get more pixels, but they're interpolated, not recovered.
So why convert at all? Because WAV is a better working format:
- Audio editors (Audacity, Pro Tools, Logic) work natively with WAV — no decoding overhead
- Re-saving a WAV doesn't degrade quality (unlike re-encoding MP3)
- CD burning requires PCM/WAV input
- Some hardware and broadcast systems only accept uncompressed audio
If you need the highest possible quality, always start from the original lossless source (CD, FLAC, studio master) rather than a lossy MP3.