Did AI See This? Detecting Copyrighted Data in Large-Scale Models’ Training

March 11, 2025

10:39 am

Large-scale models are trained on massive amounts of data, yet the secrecy surrounding training datasets makes it difficult to determine whether specific content was included. In this talk, I introduce two novel approaches for addressing this challenge in the context of large language and vision-language models.

First, I present DE-COP, a method designed to detect whether copyrighted text has been included in a language model’s training data. By leveraging multiple-choice questions that contrast verbatim text with its paraphrases, DE-COP effectively exposes memorization, significantly outperforming prior methods. Unlike most existing training data detectors, it does not rely on access to token probabilities, making it fully applicable to black-box models.

Then, I extend this investigation to vision-language models with DIS-CO, a new approach for identifying copyrighted visual content in training data. DIS-CO queries models with frames from movies, evaluating whether they can correctly guess the corresponding titles in free-form text generation. Using our MovieTection benchmark, built from 14,000 frames across various films, we find that many popular VLMs display clear signs of memorization, raising broader concerns about AI training practices and copyright compliance.

André Duarte

André Duarte is a Dual Degree PhD student at Carnegie Mellon University and Instituto Superior Técnico, supervised by Prof. Lei Li and Prof. Arlindo Oliveira. His research primarily focuses on the security and privacy of Generative AI models, with a particular emphasis on Membership Inference Attacks. In addition to his research, André has also been a part of the INESC-ID team that led the development of two AI solutions for the Portuguese government, aiming to accelerate human evaluations of corporate applications for European funding and citizen reimbursement claims for energy-efficient home investments.Instituto Superior Técnico

Seminários

Últimos seminários

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
June 17, 2025
Large language models (LLMs) have emerged as strong contenders in machine translation. Yet, they often fall behind specialized neural machine…
Speech as a Biomarker for Disease Detection
May 20, 2025
Today’s overburdened health systems face numerous challenges, exacerbated by an aging population. Speech emerges as a ubiquitous biomarker with strong…
Enhancing Uncertainty Estimation in Neural Networks
May 6, 2025
Neural networks are often overconfident about their predictions, which undermines their reliability and trustworthiness. In this presentation, I will present…
Improving Evaluation Metrics for Vision-and-Language Models
April 22, 2025
Evaluating image captions is essential for ensuring both linguistic fluency and accurate semantic alignment with visual content. While reference-free metrics…

Did AI See This? Detecting Copyrighted Data in Large-Scale Models’ Training

André Duarte

Seminários

Últimos seminários

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding

Speech as a Biomarker for Disease Detection

Enhancing Uncertainty Estimation in Neural Networks

Improving Evaluation Metrics for Vision-and-Language Models