In this talk I will present an overview of some of the past and current lines of research in reinforcement learning (RL), as well as some of the challenges that research in this area has faced in the last decades. I will describe a range of recent results that may bring significant advances on some of these fundamental research challenges, and yet rely on the “simplest” optimization approach – gradient search. The ultimate goal of this talk is to provide a high-level perception of RL while hint on current active avenues of research in this area.
Gradient Approaches to Reinforcement Learning
May 25, 2010
1:00 pm
Francisco Melo
Francisco S. Melo received his PhD in Electrical and Computer Engineering at Instituto Superior Técnico, in Lisbon, Portugal. During 2007 he held an appointment as a short-term researcher in the Computer Vision Lab, at the Institute for Systems and Robotics (Lisbon, Portugal) and in 2008 he joined the Computer Science Department of Carnegie Mellon University as a Post-Doctoral Fellow. Since June 2009 he is a Researcher at the Intelligent Agents and Synthetic Characters Group of INESC-ID, where he develops research within reinforcement learning, planning under uncertainty, multiagent and multi-robot systems, developmental robotics, and sensor networks.INESC-IDSeminários
Últimos seminários
Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
June 17, 2025Large language models (LLMs) have emerged as strong contenders in machine translation. Yet, they often fall behind specialized neural machine…
Speech as a Biomarker for Disease Detection
May 20, 2025Today’s overburdened health systems face numerous challenges, exacerbated by an aging population. Speech emerges as a ubiquitous biomarker with strong…
Enhancing Uncertainty Estimation in Neural Networks
May 6, 2025Neural networks are often overconfident about their predictions, which undermines their reliability and trustworthiness. In this presentation, I will present…
Improving Evaluation Metrics for Vision-and-Language Models
April 22, 2025Evaluating image captions is essential for ensuring both linguistic fluency and accurate semantic alignment with visual content. While reference-free metrics…