Exploring Medical Records with Machine Learning and Natural Language Processing

April 3, 2018

1:00 pm

We will present an ongoing project at Priberam that covers aspects that go from the definition of ontologies, corpus gathering and annotation criteria to the concrete annotation task, and finally the development of an engine that automatically extracts, labels and enables the exploration of textual data obtained from medical records. We will report on the resulting nested named-entity recognizer based on stack-LSTMs that achieved state-of-the-art results in standard evaluation datasets.

Pedro Balage and Pedro Mendes

Pedro Balage holds a PhD in computer science (2017) from University of São Paulo, with background in computational linguistics (MA) and computer science (BSc). He has been working in the field of Natural Language Processing for about 8 years, of which 2 years as research scientist at Priberam. Pedro Mendes started his lexicographer career 18 years ago in Academia das Ciências de Lisboa and has been working in the field of computational linguistics for the last 15 years as part of the linguistics department in Priberam.Priberam

Seminários

Últimos seminários

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
June 17, 2025
Large language models (LLMs) have emerged as strong contenders in machine translation. Yet, they often fall behind specialized neural machine…
Speech as a Biomarker for Disease Detection
May 20, 2025
Today’s overburdened health systems face numerous challenges, exacerbated by an aging population. Speech emerges as a ubiquitous biomarker with strong…
Enhancing Uncertainty Estimation in Neural Networks
May 6, 2025
Neural networks are often overconfident about their predictions, which undermines their reliability and trustworthiness. In this presentation, I will present…
Improving Evaluation Metrics for Vision-and-Language Models
April 22, 2025
Evaluating image captions is essential for ensuring both linguistic fluency and accurate semantic alignment with visual content. While reference-free metrics…

Exploring Medical Records with Machine Learning and Natural Language Processing

Pedro Balage and Pedro Mendes

Seminários

Últimos seminários

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding

Speech as a Biomarker for Disease Detection

Enhancing Uncertainty Estimation in Neural Networks

Improving Evaluation Metrics for Vision-and-Language Models