Publications

You can also find my articles on my Google Scholar profile.

* indicates equal contribution.

Models That Know How Evaluations Are Designed Score Safer

Katharina Deckenbach*, Haritz Puerto*, Jonas Geiping, Sahar Abdelnabi

arXiv preprint 2026

AI EvaluationsAI Safety

Controllable Reasoning Models are Private Thinkers

Haritz Puerto, Haonan Li, Xudong Han, Timothy Baldwin, Iryna Gurevych

arXiv preprint 2026

ReasoningTrustworthy AI

C-SEO Bench: Does Conversational SEO Work?

Haritz Puerto, Martin Gubri, Tommaso Green, Seong Joon Oh, Sangdoo Yun

NeurIPS D&B 2025

Trustworthy AIAI Evaluations

Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers

Tommaso Green, Martin Gubri, Haritz Puerto, Seong Joon Oh, Sangdoo Yun

EMNLP 2025

ReasoningTrustworthy AI

Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs

Haritz Puerto, Tilek Chubakov, Xiaodan Zhu, Harish Tayyar Madabushi, Iryna Gurevych

ACL 2025

Reasoning

Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models

Haritz Puerto, Martin Gubri, Sangdoo Yun, Seong Joon Oh

Findings of NAACL 2025

Trustworthy AIAI Evaluations

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs

Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych

EMNLP 2024

Reasoning

UKP-SQuARE: An Interactive Tool for Teaching Question Answering

Haishuo Fang, Haritz Puerto, Iryna Gurevych

BEA Workshop @ ACL 2023

Question AnsweringAgents

UKP-SQuARE v3: A Platform for Multi-Agent QA Research

Haritz Puerto, Tim Baumgärtner, Rachneet Sachdeva, Haishuo Fang, Hao Zhang, Sewin Tariverdian, Kexin Wang, Iryna Gurevych

ACL 2023 Demo Track

Question AnsweringAgents

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research

Ji-Ung Lee, Haritz Puerto, Betty van Aken, Yuki Arase, Jessica Zosa Forde, Leon Derczynski, Andreas Rücklé, Iryna Gurevych, Roy Schwartz, Emma Strubell, Jesse Dodge

arXiv preprint 2023

Trustworthy AI

MetaQA: Combining Expert Agents for Multi-Skill Question Answering

Haritz Puerto, Gözde Gül Şahin, Iryna Gurevych

EACL 2023

Question AnsweringAgents

UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

Rachneet Sachdeva*, Haritz Puerto*, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych

AACL 2022 Demo Track

Question AnsweringTrustworthy AIAgents

UKP-SQUARE: An Online Platform for Question Answering Research

Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych

ACL 2022 Demo Track

Question AnsweringAgents

Analysis of the Semantic Answer Types to Understand the Limitations of MRQA Models

Doyeon Lim*, Haritz Puerto*, Sung-Hyon Myaeng

Journal of KIISE, 2020

Question Answering

Regularization of Distinct Strategies for Unsupervised Question Generation

Junmo Kang*, Giwon Hong*, Haritz Puerto*, Sung-Hyon Myaeng

Findings of EMNLP, 2020

Question Answering

Let Me Know What to Ask: Interrogative-Word-Aware Question Generation

Junmo Kang*, Haritz Puerto*, Sung-Hyon Myaeng

MRQA @ EMNLP, 2019

Question Answering

Analysis of Answer Type Application Ability of State-of-the-Art Reading Comprehension Models for Question Answering Task

Haritz Puerto*, Doyeon Lim*, Sung-Hyon Myaeng

Korean Computer Congress, 2019 (Best Paper Award)

Question Answering