We present a neural network model that computes embeddings of words using recurrent network based on long short-term memories to read in characters. As an alternative to word lookup tables that require a set of parameters for every word type in the vocabulary, our models only require a look up table for characters and a fixed number of parameters for the compositional model, independent of the vocabulary size. As a consequence, our model uses fewer parameters and is also sensitive to lexical content, such as morphology, making it more adequate for tasks where morphological information is required. In part-of-speech tagging, we can perform competitively with state-of-the-art systems, without explicitly engineering lexical features, and using a relatively small number of parameters.
NLP with characters
April 28, 2015
1:00 pm
Wang Ling
Wang Ling is a student of the dual Ph.D. program in Computer Science between Carnegie Mellon University and Instituto Superior Técnico, where he also received his master degree in 2009. His Ph.D. work focuses on Machine Translation in noisy domains, such as Twitter, and Deep Learning for NLP.-Seminários
Últimos seminários
Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
June 17, 2025Large language models (LLMs) have emerged as strong contenders in machine translation. Yet, they often fall behind specialized neural machine…
Speech as a Biomarker for Disease Detection
May 20, 2025Today’s overburdened health systems face numerous challenges, exacerbated by an aging population. Speech emerges as a ubiquitous biomarker with strong…
Enhancing Uncertainty Estimation in Neural Networks
May 6, 2025Neural networks are often overconfident about their predictions, which undermines their reliability and trustworthiness. In this presentation, I will present…
Improving Evaluation Metrics for Vision-and-Language Models
April 22, 2025Evaluating image captions is essential for ensuring both linguistic fluency and accurate semantic alignment with visual content. While reference-free metrics…



