In this talk I will describe an approach for automatic extraction of global and local patterns of pitch(F0) contours taking into account the overall trends of these phenomena in the presented data. We propose an iterative algorithm to optimally extract these components to minimize the reconstruction error of the F0 contour. Furthermore, we present a constraint specification strategy to incorporate known constraints on these phenomena to converge on better realizations of the components (like the Phrase and Accent commands of the physiologically motivated Fujisaki Model of F0). The extracted components are shown to be correlated to established theoritical notions of declination, metrical feet and accent tones.
An Iterative, Constrained Approach for Pitch Component Extraction
November 30, 2010
1:00 pm
Gopala Anumanchipalli
Gopala is a PhD student in the LTI, Carnegie Mellon University and INESC-ID Lisboa, IST. He is advised by Dr. Alan W Black and Dr. Luis Oliveira. He is currently at INESC-ID. He is interested broadly in everything to do with language, but specifically works on building models and transformation approaches for prosody in Speech synthesis. He is working in the PT-Star project aiming to do Speech-to-Speech machine translation of video lectures.INESC-ID, LTI/CMUSeminários
Últimos seminários
Cost-Sensitive Learning to Defer to Multiple Experts
March 2, 2026Large language models (LLMs) have emerged as strong contenders in machine translation. Yet, they often fall behind specialized neural machine…
Fair Federated Learning under Group-Specific Distributed Concept Drift
February 24, 2026Machine learning models can become unfair when different groups experience changes in data over time, a phenomenon called group-specific concept…
Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
June 17, 2025Large language models (LLMs) have emerged as strong contenders in machine translation. Yet, they often fall behind specialized neural machine…
Speech as a Biomarker for Disease Detection
May 20, 2025Today’s overburdened health systems face numerous challenges, exacerbated by an aging population. Speech emerges as a ubiquitous biomarker with strong…

