My research interest lie in text mining and data analytics for research questions from sociolinguistics, register/ language variation as well as language change. I’m particularly interested in variation of language use considering linguistic as well as other possible variables that might be at play by using probabilistic models.
I’m currently a PostDoc in the Collaborative Research Center (SFB 1102) Information Density and Linguistic Encoding at Saarland University working in Project B1 on Information Density and Scientific Literacy in English: Synchronic and Diachronic Perspectives, where we investigate linguistic densification in the evolution of scientific writing in English (17th century to present).
I received funding from Saarland University to work on Linguistic Profiles of Social Variables in Diachrony (SLingPro). To observe possible linguistic profiles of social variables in diachrony my team and I use the Old Bailey Corpus (Huber et al., 2016) — a digital collection of spoken texts based on the proceedings of the London’s central criminal court from the 18th and 19th century, which is annotated for social variables such as age, gender, and social class.
In my PhD I focused on combining macro- and micro-analytical methods for register analysis (PhD Thesis).