Dr. Jan Philip Wahle2025-12-16T13:42:04+01:00

Dr. Jan Philip Wahle

Project Leader

write email
schedule appointment

Jan Philip Wahle in the Göttingen offices doing paraphrase plagiarism and natural language processing research

SHORT BIO

Dr. Jan Philip Wahle is a faculty member at the University of Göttingen. He has been a visiting researcher at the National Research Council Canada (NRC) working with Dr. Saif M. Mohammad. Before his PhD, he worked as a software engineer for the autonomous driving company Aptiv PLC. His main research interests lie in computational linguistics and natural language processing with a focus on reasoning methods via reinforcement learning and agentic systems and AI safety via interpretability. His research has been presented at various conferences, including ACL and EMNLP, and won the ACL Best Resource Paper Award and the SemEval Best Task Award.

PROJECTS

I am open to student projects in the areas of computational linguistics and natural language processing, particularly involving reasoning methods via reinforcement learning and agentic systems and AI safety via interpretability. The slides here are examples of projects that I offer. Don’t hesitate to get in touch with me if you are interested.

SHORT CV

09/2025 – present

Project Leader & Research Fellow
Chair for Scientific Information Analytics, University of Göttingen

10/2022 – 11/2025

Scientific Staff Member
Chair for Scientific Information Analytics, University of Göttingen

09/2021 – 11/2025

Dr. rer. nat, Computer Science
Chair for Scientific Information Analytics, University of Göttingen

07/2020 – 11/2025

Lab Engineer
Chair for Data & Knowledge Engineering, University of Wuppertal

10/2018 – 10/2020

M.Sc., Computer Science (Data Analytics)
Chair for Data & Knowledge Engineering, University of Wuppertal

03/2017 – 10/2018

Junior Software Engineer
Aptiv PLC Wuppertal, Germany

10/2015 – 10/2018

B. Sc., Information Technology (Information Science)
University of Wuppertal

SELECTED PUBLICATIONS

A complete list of my publications is available here


TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent
D. Meier, J. P. Wahle, P. Röttger, T. Ruas, B. Gipp
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP)
(PDF DOI BibTeX)


SPaRC: A Spatial Pathfinding Reasoning Challenge
L.B. Kaesberg, J. P. Wahle, T. Ruas, B. Gipp
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP)
(PDF DOI BibTeX)


Paraphrase Types Elicit Prompt Engineering Capabilities
J. P. Wahle, T. Ruas, Y. Xu, B. Gipp
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP)
(PDF DOI BibTeX)


We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields
J. P. Wahle, T. Ruas, M. Abdalla, B. Gipp, S. M. Mohammad
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP)
(PDF DOI BibTeX)


Paraphrase Types for Generation and Detection
J. P. Wahle, T. Ruas, B. Gipp
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP)
(PDF DOI BibTeX)


How Large Language Models are Transforming Machine-Paraphrase Plagiarism

J. P. Wahle, T. Ruas, F. Kirstein, B. Gipp
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP)
(PDF DOI BibTeX)


Identifying Machine-Paraphrased Plagiarism

J. P. Wahle, T. Ruas, T. Foltynek, N. Meuschke, B. Gipp
Information for a Better World: Shaping the Global Future – 17th International Conference, iConference 2022.
(PDF  DOI  BibTeX)


Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection

J. P. Wahle, T. Ruas, N. Meuschke, B. Gipp
ACM/IEEE Joint Conference on Digital Libraries, JCDL 2021, Champaign, IL, USA, September 27-30, 2021
(PDF  DOI  BibTeX)


D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research

J. P. Wahle, T. Ruas, Saif M. Mohammad. Meuschke, B. Gipp
Proceedings of The 13th Language Resources and Evaluation Conference, LREC 2022, Marseille, France, June 20-25, 2022
(PDF  DOI  BibTeX)

MEDIA COVERAGE

Lower Saxony Ministry of Science and Culture Funds Göttingen Project “EDIKILEX” with €4.5 Million

How can the literary treasures found in texts from the Early New High German period (ca. 1350–1650) be made available and useful for future research in all their breadth and depth? The interdisciplinary joint project EDIKILEX – Edition, Artificial Intelligence, [...]

GippLab wins ACL Best Paper Awards

We're excited to share that our work on multilingual resources has won two major awards at ACL 2025: Best Resource Paper and Best SemEval Task!  This project was the most popular task ever hosted on Codabench, with over 700 participants, [...]

Go to Top