I am a Staff Research Scientist at Meta Superintelligence Labs, Llama Core Team, where I work on pre-training data curation for large language models. I have contributed to major language model releases including Llama 4, Llama 3, Llama 2, and OPT.
I completed my PhD in Computational Linguistics at the Natural Language Processing Group at Heidelberg University, under the supervision of Prof. Dr. Anette Frank.