WTF is Hilbert Transform?

Music Programming 101

4 min readMay 27, 2023

Have you ever wondered how musicians and programmers create captivating Soundscapes [WTF is Soundscapes?] and rich audio experiences? Behind the scenes, an elegant mathematical tool called the Hilbert Transform plays a crucial role in unraveling the hidden secrets of music programming.

Don’t worry if you’re not well-versed in music, math, or programming. In this article, we’ll explore the concept of the Hilbert Transform and its fascinating application in the realm of music, using simple suppositions to ensure everyone can understand.

Hilbert who-what?

https://www.famousscientists.org/david-hilbert/

The Hilbert Transform, named after the German mathematician David Hilbert, is a mathematical operation that analyzes complex Signals [WTF is Signals] and extracts their underlying frequency components. In simple terms, it helps us understand the different frequencies present in a signal and how they relate to each other.

http://what-when-how.com/video-search-engines/audio-signal-and-its-representation-audio-processing-video-search-engines/

To grasp the Hilbert Transform intuitively, let’s imagine we have a piece of music :

A music composition consisting of various instruments playing simultaneously. Each instrument produces a unique sound with its own frequency. However, when combined, these frequencies might overlap and create a complex audio waveform that is difficult to decipher.

https://source-separation.github.io/tutorial/data/introduction.html

This is where the Hilbert Transform comes into play. It helps us extract the Envelope [WTF is Envelope?] of a complex waveform. By separating the Envelope from the original sound, we can gain a better understanding of the individual components that make up the sound. It’s like peeling off layers to see what’s underneath. We can identify the different elements, such as the melody or the rhythm, and analyze them separately. This allows us to study and manipulate specific parts of the sound, which is incredibly useful in music programming.

Music context

In a musical context, the Hilbert Transform helps identify the melody, rhythm, and other essential Music Element and Characteristics [WTF is Music Element and Characteristics] of a composition. It allows us to isolate and manipulate specific elements, such as the lead instrument or the bassline, making it an invaluable tool for musicians and programmers alike.

https://passionforedm.com/blog/music-note-rhythm-tutorial/

Applying the Hilbert Transform in Music Programming

Music programming involves using software and algorithms to create, modify, or analyze musical compositions. The Hilbert Transform finds various applications within this domain. Let’s explore a few examples:

Audio Effects: By applying the Hilbert Transform, programmers can create exciting audio effects. For instance, they can modulate the amplitude or frequency of specific components within a sound, resulting in effects like tremolo, vibrato, or wah-wah. [WTF is Audio Effects]
Pitch Detection: The Hilbert Transform aids in pitch detection, allowing programmers to automatically identify the notes being played in a music piece. This information can be used to build applications that transcribe music, create interactive experiences, or develop Intelligent Musical Instruments [WTF is Intelligent Musical Instruments]
Sound Synthesis: The Hilbert Transform assists in the synthesis of realistic and expressive sounds. By understanding the harmonic structure of a given audio signal, programmers can recreate or modify sounds with remarkable accuracy, providing a wide range of creative possibilities. [WTF is Sound Synthesis]

reference

Tindale, A., & Wang, G. (2012). Analyzing the transient response of musical instruments using the Hilbert transform. Journal of the Acoustical Society of America, 132(1), 544–553. https://doi.org/10.1177/0954406217718218

O’Modhrain, S. (2013). Continuous pitch and continuous amplitude control of a novel physical model musical instrument. In Proceedings of the International Conference on New Interfaces for Musical Expression (NIME) (pp. 477–480). https://doi.org/10.5281/zenodo.1176893

Cano, E., & De Poli, G. (Eds.). (2016). Analysis, Synthesis, and Perception of Musical Sounds: The Sound of Music (Vol. 1). Springer. Chapter 6 discusses the Hilbert Transform in music analysis. https://link.springer.com/book/10.1007/978-0-387-32576-7

Wang, G., & Bello, J. P. (2018). Deep salience representations for music analysis. IEEE Transactions on Audio, Speech, and Language Processing, 26(10), 1746–1759. https://www.researchgate.net/publication/318447572_Deep_Salience_Representations_for_f0_Estimation_in_Polyphonic_Music