How Pitch-Preserving Speed Change Works
Changing audio speed without affecting pitch requires a time-stretching algorithm. Convertio uses WSOLA (Waveform Similarity Overlap-Add), which divides the audio into small overlapping segments, repositions them in time, and crossfades at the new positions. The result is a tempo change with the original pitch intact.
The algorithm works best within the 0.5x–2.0x range. Within this window, the crossfade regions align naturally, producing smooth output with minimal artifacts. Beyond 2.0x, gaps between segments become audible; below 0.5x, excessive overlap can introduce a chorus-like effect.
Speed Settings Guide
Choose a speed multiplier based on your intended use:
| Speed | Duration Change | Use Case |
|---|---|---|
| 0.5x | 2× longer | Transcribing fast solos, detailed audio analysis |
| 0.75x | 1.33× longer | Learning moderate-tempo passages, language study |
| 1.0x | No change | Original speed (format conversion only) |
| 1.25x | 20% shorter | Faster podcast listening, lecture review |
| 1.5x | 33% shorter | Speed-listening audiobooks, spoken content |
| 2.0x | 50% shorter | Quick review, scanning long recordings |
FLAC Speed Change: Lossless Advantage
FLAC files preserve the complete original recording with zero compression loss. This matters for speed processing because the WSOLA algorithm works with bit-perfect input — no compression artifacts to amplify, no frequency gaps to expose. 24-bit FLAC files have 48 dB more dynamic range than 16-bit, giving the algorithm even more clean data to work with.
DJs with lossless collections use speed change for BPM matching between tracks. A 128 BPM track adjusted to 130 BPM is only a 1.6% change — completely transparent. For musicians, slowing a FLAC recording to 0.5x reveals every nuance of fast passages, making it the ideal format for transcription work.
Quality note: The quality advantage of FLAC is most noticeable at extreme speed changes. At 0.5x, compressed sources may reveal warbling artifacts that are absent with lossless input.