I am a first-year PhD student in the CoAStaL natural language processing research group led by Anders Søgaard at the University of Copenhagen. My main areas of interest are natural language processing and machine learning, particularly in multilingual, cross-lingual, multi-modal (language & vision), and low-resource settings. I am funded through a grant by the Novo Nordisk Foundation. As part of the funded project I develop robust recognition models to digitize historical handwritten records that will be used to establish a multigenerational registry of Danish family relationships.
Previously, I completed my master’s in computer science at the Technical University of Darmstadt, where I wrote my thesis under the supervision of Jonas Pfeiffer and Iryna Gurevych. The results of my thesis are summarized in the paper “How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models”.