Beesaretheinsectspollenfeedsnsubjobjdetacl:relclnsubj

Hi! I see you've stumbled across my little web page.

I'm a Research Associate in Natural Language Processing at the Institute for Applied Informatics in Leipzig, Germany, as part of the CORAL project. We're building privacy-preserving language models, developing methods of data obfuscation, as well as making models 'forget'. I recently released Grew-TSE, a Python package for the generation of minimal-pair tests (using treebanks) for the evaluation of language-model syntactic performance.

I hail from Dublin, Ireland. I received a BSc Computer Science with Data Science from University College Dublin in 2022 and an MA Linguistics from University Leipzig in 2025, the latter thanks to a DAAD Scholarship. I have also worked on building machine translation models for the EUComMeet project, tools for low-resource languages, and a database for the Institute of Linguistics at Leipzig University. You can find my Master's thesis here, titled "Cross-Linguistic Syntactic Evaluation of Transformers via Treebank Querying".

Feel free to send me an email: gallagher at infai dot org

Recent Publications & Tools


  • 2026
    Targeted Syntactic Evaluation of Language Models on Georgian Case Alignment
    Best Paper Nomination Award @ LoResLM Workshop, co-located with EACL 2026
    ACL Anthology · arXiv
  • 2026
    text-mallet: A Python Package to Smash Text into Obfuscated Formats
    Tool / Python Package (Work-In-Progress)
    Docs · GitHub
  • 2025
    Grew-TSE: Minimal-Pair Test Generation for Syntactic Evaluation
    Tool / Python Package
    Docs · GitHub