My research interest lie in text mining and data analytics for research questions from sociolinguistics, register/language variation as well as language change. I’m particularly interested in variation of language use considering linguistic as well as other possible variables that might be at play by using probabilistic models.
I’m currently a PostDoc in the Collaborative Research Center (SFB 1102) Information Density and Linguistic Encoding at Saarland University working in Project B1 on Information Density and Scientific Literacy in English: Synchronic and Diachronic Perspectives, where we investigates linguistic densification in the evolution of scientific writing in English (17th century to present).
I received funding from Saarland University to work on Linguistic Profiles of Social Variables in Diachrony (SLingPro). For this we use the Old Bailey Corpus (Huber et al., 2016) — a digital collection of spoken texts based on the proceedings of the London’s central criminal court from the 18th and 19th century, which is annotated for social variables such as age, gender, and social class. This will allow us to observe possible linguistic profiles of social variables in diachrony.
In my PhD I focused on combining macro- and micro-analytical methods for register analysis.