Convertio.com

SoXr Resampler: Audiophile-Grade Sample Rate Conversion for WAV

Learn how Convertio.com uses the SoXr resampler library with 28-bit precision and Shibata dithering to deliver artifact-free WAV conversions — the same engine trusted by audiophile tools like SoX, foobar2000, and JRiver.

Convert to WAV with SoXr

SoXr resampling is applied automatically to every conversion

Audio WAV

Tap to choose your file

or

Supports M4A, WAV, FLAC, OGG, AAC, WMA, AIFF, OPUS • Max 100 MB

SoXr resampling applied automatically. Files auto-deleted within 2 hours.

What is Audio Resampling?

When you convert audio from one sample rate to another (e.g. a 44.1 kHz MP3 → 48 kHz WAV for video editing), every single sample must be recalculated on a new time grid. This process is called resampling.

A naïve approach — simply dropping or duplicating samples — creates audible clicks and aliasing. Professional resamplers use mathematical interpolation (typically polyphase FIR filters) to reconstruct a continuous signal from discrete samples, then re-sample it at the new rate. The quality of this interpolation determines whether your audio stays transparent or picks up artifacts.

Key concept: According to the Nyquist-Shannon theorem, any band-limited signal sampled above twice its highest frequency can be perfectly reconstructed. Resampling leverages this theorem — a high-quality resampler can change rates with zero audible degradation.

What is SoXr?

The SoXr (SoX Resampler Library) is an open-source, audiophile-grade resampling engine originally developed for the SoX (Sound eXchange) command-line audio tool. It uses an FFT-based polyphase algorithm that produces results virtually indistinguishable from the original signal.

SoXr is used by professional audio software including foobar2000, JRiver Media Center, MPV, and VLC. Convertio.com integrates SoXr through FFmpeg's aresample filter, applying it to every WAV conversion automatically.

Parameter Value What It Does
EngineSoXr (CR64)64-bit double-precision floating-point computation
Precision28-bit~168 dB signal-to-noise ratio — far beyond audible noise floor
DitheringShibataPsychoacoustically-shaped noise that pushes quantization artifacts away from the 1–5 kHz hearing sensitivity peak
Anti-aliasingAutomaticSteep low-pass filter prevents aliasing when downsampling

SoXr vs FFmpeg's Default Resampler

FFmpeg includes two resampling backends: the default swresample (SWR) and the optional soxr. Here's how they compare:

Aspect swresample (default) SoXr
AlgorithmKaiser-windowed sinc (linear phase)FFT-based oversampled polyphase
Internal precision16-bit (default) or 32-bit float64-bit double (CR64 engine)
Aliasing rejectionGood (−100 dB typical)Excellent (−168 dB with precision=28)
DitheringTriangular (flat spectrum)Shibata (noise-shaped, less audible)
SpeedFasterSlightly slower (~10–15% more CPU)
Passband rippleMeasurable near NyquistNegligible
Best forReal-time streaming, video playbackMastering, archival, distribution

Bottom line: swresample is optimized for speed and is perfectly fine for real-time playback. SoXr is optimized for quality and is the right choice when you're producing a file that will be kept, distributed, or edited further — exactly what a converter does.

Shibata Dithering Explained

When audio is converted between bit depths (e.g. 32-bit float internal processing → 16-bit WAV output), rounding errors create quantization noise. Dithering adds a tiny amount of noise before rounding to eliminate the more unpleasant distortion patterns.

Not all dithering is equal. Standard triangular dithering (TPDF) distributes noise evenly across the frequency spectrum. Shibata dithering uses psychoacoustic noise shaping to push that noise into frequency ranges where human hearing is least sensitive:

Dither Type Noise Distribution Audibility
None (truncation)No noise addedWorst — audible harmonic distortion
Rectangular (RPDF)Flat, randomRemoves distortion, flat noise floor
Triangular (TPDF)Flat, uncorrelatedBetter — no modulation noise
Shibata (noise-shaped)Shifted away from 1–5 kHzLeast audible — exploits hearing curve

Why it matters: Human hearing is most sensitive between 1–5 kHz (the Fletcher-Munson curve). Shibata dithering pushes quantization noise into the less sensitive high-frequency region above 10 kHz, making it effectively inaudible even on high-end monitoring equipment.

When Does Resampling Happen?

SoXr is applied automatically to every WAV conversion on Convertio.com, but its impact is most significant in these scenarios:

Scenario Example SoXr Impact
Downsampling hi-res96 kHz FLAC → 44.1 kHz WAVCritical — prevents aliasing artifacts
Music → video rate44.1 kHz MP3 → 48 kHz WAVImportant — clean rate conversion
Voice downsampling48 kHz podcast → 22.05 kHz WAVImportant — preserves speech clarity
Same rate conversion44.1 kHz MP3 → 44.1 kHz WAVMinimal — dithering still applies for bit-depth changes

The biggest quality difference is during downsampling — when the target rate is lower than the source. Without proper anti-aliasing (which SoXr handles automatically), frequencies above the new Nyquist limit fold back into the audible range as distortion.

28-bit Precision: What It Means

SoXr's precision=28 parameter sets the internal computation to 28 effective bits using the CR64 (constant-rate, 64-bit) engine. This translates to approximately 168 dB of signal-to-noise ratio.

For context:

  • 16-bit audio has ~96 dB dynamic range
  • 24-bit audio has ~144 dB dynamic range
  • SoXr at precision=28 computes at ~168 dB — 24 dB below the noise floor of even 24-bit audio

This means the resampling process itself introduces zero audible noise, even for 24-bit masters. The resampler's internal computation is quieter than the quietest sound any real-world recording can capture.

Why not precision=32? Higher precision values increase CPU time with diminishing returns. At precision=28, SoXr already operates 24 dB below 24-bit audio's noise floor — increasing further would be inaudible and impractical. This is the sweet spot used by most professional audio tools.

How Convertio Uses SoXr

Every WAV conversion on Convertio.com runs through this pipeline:

  1. Upload — your audio file is received over HTTPS
  2. Decode — FFmpeg reads the source format (MP3, FLAC, M4A, OGG, etc.)
  3. Resample — SoXr converts to your chosen sample rate and bit depth
  4. Dither — Shibata noise shaping is applied during bit-depth conversion
  5. Encode — clean PCM samples are written to the WAV container
  6. Download — your WAV file is ready

The entire process is automatic. You just pick your target settings (sample rate, bit depth, channels) and Convertio handles the rest using SoXr under the hood. No configuration required, no "quality mode" toggle — every conversion gets the same studio-grade resampling.

Ready to Convert?

Convert your audio to WAV with SoXr resampling

Audio WAV

Tap to choose your file

or

Supports M4A, WAV, FLAC, OGG, AAC, WMA, AIFF, OPUS • Max 100 MB

Frequently Asked Questions

SoXr (SoX Resampler Library) is an audiophile-grade resampling engine that uses FFT-based polyphase algorithms at 28-bit precision. FFmpeg's default swresample uses a simpler Kaiser-windowed sinc filter that can introduce measurable aliasing near the Nyquist frequency. SoXr eliminates these artifacts entirely while using Shibata dithering for psychoacoustically-optimized noise shaping.

Shibata dithering is a psychoacoustically-optimized noise shaping method that pushes quantization noise away from the 1–5 kHz range where human hearing is most sensitive (the Fletcher-Munson curve). The result is dither noise that is less perceptible than standard triangular or rectangular dithering, even though the total noise energy is similar.

For most casual listening, the difference is inaudible. SoXr matters most when downsampling hi-res audio (e.g. 96 kHz to 44.1 kHz) where aliasing from a lesser resampler could become audible on high-end monitoring equipment. It also ensures bit-perfect transparency for professional mastering workflows where cumulative processing errors matter.

No. SoXr is applied automatically to every WAV conversion on Convertio.com. Simply upload your file, choose your WAV settings (sample rate, bit depth, channels), and the SoXr resampler handles the rest. No special toggle or configuration needed.

Resampling occurs whenever the target sample rate differs from the source — for example, converting a 44.1 kHz MP3 to 48 kHz WAV for video, or downsampling a 96 kHz FLAC to 44.1 kHz for CD burning. Even when sample rates match, SoXr's Shibata dithering ensures clean bit-depth transitions (e.g. internal 32-bit float → 16-bit output).

More MP3 to WAV Guides

WAV Sample Rate & Bit Depth Explained
44.1 vs 48 kHz, 16-bit vs 24-bit, stereo vs mono — which WAV settings should you use?
Back to MP3 to WAV Converter