We are interested in many areas at the intersection of sophisticated linguistic
analysis and modern algorithms for speech recognition and synthesis.
Recent work includes CRF-based acoustic models for speech recognition,
prosody (prediction of pitch accents from text, and detection of pitch accents from speech),
disfluencies, and linguistic error analysis. Earlier work focused on pronunciation modeling, and
syntactically and semantically enriched language models.
Jiahong Yuan, now Assistant Professor, Linguistics, University of Pennsylvania
Below is a selection of publications in speech recognition and synthesis.
Sharon Goldwater, Dan Jurafsky, and Christopher D. Manning. 2010.
Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates. Speech Communication 52, 181-200.
Yun-Hsuan Sung and Dan Jurafsky. 2009.
Hidden Conditional Random Fields for Phone Recognition.
ASRU 2009. [pdf]