Models That Know How Evaluations Are Designed Score Safer
Katharina Deckenbach*, Haritz Puerto*, Jonas Geiping, Sahar Abdelnabi
arXiv preprint 2026
AI EvaluationsAI Safety
Controllable Reasoning Models are Private Thinkers
Haritz Puerto, Haonan Li, Xudong Han, Timothy Baldwin, Iryna Gurevych
arXiv preprint 2026
ReasoningTrustworthy AI
C-SEO Bench: Does Conversational SEO Work?
Haritz Puerto, Martin Gubri, Tommaso Green, Seong Joon Oh, Sangdoo Yun
NeurIPS D&B 2025
Trustworthy AIAI Evaluations
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers
Tommaso Green, Martin Gubri, Haritz Puerto, Seong Joon Oh, Sangdoo Yun
EMNLP 2025
ReasoningTrustworthy AI
Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs
Haritz Puerto, Tilek Chubakov, Xiaodan Zhu, Harish Tayyar Madabushi, Iryna Gurevych
ACL 2025
Reasoning
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
Haritz Puerto, Martin Gubri, Sangdoo Yun, Seong Joon Oh
Findings of NAACL 2025
Trustworthy AIAI Evaluations
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych
EMNLP 2024
Reasoning
UKP-SQuARE: An Interactive Tool for Teaching Question Answering
Haishuo Fang, Haritz Puerto, Iryna Gurevych
BEA Workshop @ ACL 2023
Question AnsweringAgents
UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Haritz Puerto, Tim Baumgärtner, Rachneet Sachdeva, Haishuo Fang, Hao Zhang, Sewin Tariverdian, Kexin Wang, Iryna Gurevych
ACL 2023 Demo Track
Question AnsweringAgents
Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Ji-Ung Lee, Haritz Puerto, Betty van Aken, Yuki Arase, Jessica Zosa Forde, Leon Derczynski, Andreas Rücklé, Iryna Gurevych, Roy Schwartz, Emma Strubell, Jesse Dodge
arXiv preprint 2023
Trustworthy AI
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto, Gözde Gül Şahin, Iryna Gurevych
EACL 2023
Question AnsweringAgents
UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA
Rachneet Sachdeva*, Haritz Puerto*, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych
AACL 2022 Demo Track
Question AnsweringTrustworthy AIAgents
UKP-SQUARE: An Online Platform for Question Answering Research
Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych
ACL 2022 Demo Track
Question AnsweringAgents
Analysis of the Semantic Answer Types to Understand the Limitations of MRQA Models
Doyeon Lim*, Haritz Puerto*, Sung-Hyon Myaeng
Journal of KIISE, 2020
Question Answering
Regularization of Distinct Strategies for Unsupervised Question Generation
Junmo Kang*, Giwon Hong*, Haritz Puerto*, Sung-Hyon Myaeng
Findings of EMNLP, 2020
Question Answering
Let Me Know What to Ask: Interrogative-Word-Aware Question Generation
Junmo Kang*, Haritz Puerto*, Sung-Hyon Myaeng
MRQA @ EMNLP, 2019
Question Answering