GaMaDHaNi: Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music

5.1 Primed Generation

Note: All audio samples, both ground truth and generated are resynthesized by passing the pitch contours through our Spectrogram Generator + vocoder. Additionally, all samples are generated by GaMaDHaNi (autoregressive variant).

1. Sample with fast movement in a high voice

Our proposed method is able to continue a similar idea.

Pitch contour containing the Input Prime, Ground Truth Continuation and the Generated Continuation on a log scale. The y-axis is normalized to the tonic frequency of the input prime, i.e. 0 corresponds to the tonic frequency.

Pitch contour containing the Input Prime, Ground Truth Continuation and the Generated Continuation on a log scale. The y-axis is normalized to the tonic frequency of the input prime, i.e. 0 corresponds to the tonic frequency.

2. Sample with a slow movement in a low voice

Pitch contour containing the Input Prime, Ground Truth Continuation and the Generated Continuation on a log scale. The y-axis is normalized to the tonic frequency of the input prime, i.e. 0 corresponds to the tonic frequency.

Pitch contour containing the Input Prime, Ground Truth Continuation and the Generated Continuation on a log scale. The y-axis is normalized to the tonic frequency of the input prime, i.e. 0 corresponds to the tonic frequency.