ARC@ORU Misleading Large Language Models used (or misused) in Scientific Peer-Reviewing via Hidden Prompt-Injection Attacks

24 juni 2026 09:00 – 10:00 Visual Lab, ARC

ARC Signalbild

ARC bjuder in till ett forskningsseminarium med Giovanni Apruzzese, universitetslektor i datavetenskap vid Reykjavik University.

About the talk

Large Language Models (LLMs) have revolutionized many aspects of our society. Many tasks encompassing document summarization or autonomous content generation can now benefit from the capabilities of LLMs. Among these, a domain in which LLMs are receiving incresing attention is that of scientific peer reviewing. Yet, usage of LLMs in this context must be done with due care: LLMs have certain blind spots which, if exploited, can lead to detrimental effects to the human requesting the service of an LLM.

In this talk, I will outline the reasons why the author of a scientific paper may want to mislead an LLM tasked to review a given paper. Based on these reasons, I will then explain ways in which one can reach their goal via "hidden prompt injections". Finally, I will discuss the results of a large-scale systematic analysis wherein we studied the impact of prompt-injection attacks against commercial LLMs (e.g., ChatGPT, Gemini). In doing so, I will also outline potential countermeasures---as well as counter-countermeasures. The takeaway is that blind reliance on LLMs for peer-review duties is strongly discouraged, and human oversight is still necessary.

Speaker

Giovanni Apruzzese is an Assistant Professor within the Department of Computer Science at Reykjavik University, Iceland; and also within the Hilti Chair of Data and Application Security at the University of Liechtenstein. He obtained the PhD in Information and Communication Technologies at the University of Modena and Reggio Emilia (Italy) in 2020. He authored over 50 peer-reviewed papers at internationally-recognized research venues. His research interests encompass a variety of themes, most of which revolve around cybersecurity and artificial intelligence, but he also appreciates topics within human-computer interaction.

His primary expertise lies in network security and in phishing detection. Giovanni also puts a lot of effort in servicing the scientific community, and he was awarded numerous recognitions for his reviewing duties in leading computer-science venues. Due to this, he has also recently engaged in researching the usage of large-language models for scientific peer reviewing.

Registration for ARC@ORU: Misleading Large Language Models used (or misused) in Scientific Peer-Reviewing via Hidden Prompt-Injection Attacks

I am participating:

GDPR

By submitting, I consent to Örebro University processing the personal data I have entered in the registration form, for the purpose of event administration and for as long as the event is being administered. You may request that any data provided be changed or deleted by contacting carolina.wittenfelt@oru.se.

More information on how Örebro University handles personal data can be found on the Processing of personal data page at Örebro University.

I understand and give my consent to Örebro University processing my personal data in line with the purpose of the event and in accordance with the data protection legislation in force.