Speech Translation: Modeling and Conversion of speaking style across languages

March 12, 2013

1:00 pm

In this talk I will describe our recent efforts within the PT-STAR project for speech translation across languages. I will begin with brief descriptions about the component systems for speech recognition, machine translation and speech synthesis and talk in greater detail about modeling and conversion of prosodic aspects of speech across languages, a major part of speaking style. Illustrating with example demos for the case of English<->Portuguese translation, I will comment on the bottlenecks in the current speech translation technology and list some challenges for the future that may be of interest to ML/Speech/NLP research community.

Gopala Krishna Anumanchipalli

Gopala is a PhD candidate in the CMU|Portugal program jointly advised by Prof. Alan Black at LTI/CMU and Prof. Luis Oliveira at INESC-ID/IST. His interests are in all aspects of speech and language processing and his PhD thesis is in prosody modeling for speech synthesis and voice conversion within and across languages. He holds a Bachelors in Engineering (CS/AI) and Masters in science (CS) both from IIIT-Hyderabad, India.LTI, CMU and L2F, INESC-ID

Seminários

Últimos seminários

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
June 17, 2025
Large language models (LLMs) have emerged as strong contenders in machine translation. Yet, they often fall behind specialized neural machine…
Speech as a Biomarker for Disease Detection
May 20, 2025
Today’s overburdened health systems face numerous challenges, exacerbated by an aging population. Speech emerges as a ubiquitous biomarker with strong…
Enhancing Uncertainty Estimation in Neural Networks
May 6, 2025
Neural networks are often overconfident about their predictions, which undermines their reliability and trustworthiness. In this presentation, I will present…
Improving Evaluation Metrics for Vision-and-Language Models
April 22, 2025
Evaluating image captions is essential for ensuring both linguistic fluency and accurate semantic alignment with visual content. While reference-free metrics…

Speech Translation: Modeling and Conversion of speaking style across languages

Gopala Krishna Anumanchipalli

Seminários

Últimos seminários

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding

Speech as a Biomarker for Disease Detection

Enhancing Uncertainty Estimation in Neural Networks

Improving Evaluation Metrics for Vision-and-Language Models