B1: Information Density in English Scientific Writing: A Diachronic Perspective

The project investigates the diachronic development of written scientific English, focusing on Information Density. On the basis of relevant data sets (e.g. Royal Society Corpus) computational language models are built for calculating information density/surprisal on different linguistic units (morphemes, words, syntactic phrases/constructions). Selected phenomena of diachronic variation are investigated w.r.t. the role of information density along with other factors potentially involved in usage change. Both syntagmatic conditions and paradigmatic effects of change are studied.

Publications

2018

Menzel, Katrin

Using diachronic corpora of scientific journal articles for complementing English corpus-based dictionaries and lexicographical resources for specialized languages Inproceedings

Proceedings of EURALEX2018, Ljubljana, Slovenia, 2018.

BibTeX

Fischer, Stefan; Knappen, Jörg; Teich, Elke

Using Topic Modelling to Explore Authors’ Research Fields in a Corpus of Historical Scientific English Inproceedings

Proceedings of DH 2018, Mexico City, Mexico, 2018.

Links | BibTeX

Degaetano-Ortlieb, Stefania

Stylistic Variation over 200 Years of Court. Proceedings According to Gender and Social Class Inproceedings

Proceedings of the 2nd Workshop on Stylistic Variation collocated with NAACL HLT 2018, June 1-6. ACL, pp. 1-10, New Orleans, USA, 2018.

Links | BibTeX

Degaetano-Ortlieb, Stefania; Strötgen, Jannik

Diachronic variation of temporal expressions in scientific writing through the lens of relative entropy Inproceedings

Rehm, Georg; Declerck, Thierry (Ed.): Language Technologies for the Challenges of the Digital Age: 27th International Conference, GSCL 2017, September 13-14, Proceedings. Lecture Notes in Computer Science, pp. 250-275, Springer International Publishing, Berlin, Germany, 2018.

Links | BibTeX

2017

Degaetano-Ortlieb, Stefania; Teich, Elke

Modeling intra-textual variation with entropy and surprisal: Topical vs. stylistic patterns Inproceedings

pp. 68-77, LaTeCH-CLfL Workshop, ACL, Vancouver, Canada, 2017.

BibTeX

Kermes, Hannah; Teich, Elke

Average surprisal of parts-of-speech Inproceedings

Corpus Linguistics 2017, Birmingham, UK, 2017.

BibTeX

Knappen, Jörg; Fischer, Stefan; Kermes, Hannah; Teich, Elke; Fankhauser, Peter

The making of the Royal Society Corpus Inproceedings

pp. 7-11, 21st Nordic Conference on Computational Linguistics (NoDaLiDa) Workshop on Processing Historical lancuage, Gothenburg, Sweden, 2017.

BibTeX

Degaetano-Ortlieb, Stefania; Scholman, Merel; Fischer, Stefan; Teich, Elke; Demberg, Vera

An information-theoretic account on the diachronic development of discourse connectors in scientific writing Inproceedings

39th DGfS AG1, Saarbrücken, Germany, 2017.

BibTeX

Menzel, Katrin; Degaetano-Ortlieb, Stefania

The diachronic development of combining forms in scientific writing Journal Article

Lege Artis. Language yesterday, today, tomorrow. The Journal of University of SS Cyril and Methodius in Trnava. Warsaw: De Gruyter Open, vol. 2 (2) , pp. 185-249, 2017.

Links | BibTeX

Degaetano-Ortlieb, Stefania; Menzel, Katrin; Teich, Elke

The course of grammatical change in scientific writing: Interdependency between convention and productivity Inproceedings

Proceedings of the Corpus and Language Variation in English Research Conference (CLAVIER), Bari, Italy, 2017.

Links | BibTeX

Degaetano-Ortlieb, Stefania

Variation in language use across social variables: a data-driven approach Inproceedings

Proceedings of the Corpus and Language Variation in English Research Conference (CLAVIER), Bari, Italy, 2017.

Links | BibTeX

Degaetano-Ortlieb, Stefania; Stroetgen, Jannik

Diachronic variation of temporal expressions in scientific writing through the lens of relative entropy Inproceedings

Proceedings of the German Society for Computational Linguistics and Language Technology Conference (GSCL), Berlin, Germany, 2017.

Links | BibTeX

2016

Degaetano-Ortlieb, Stefania; Teich, Elke

Information-based modeling of diachronic linguistic change: from typicality to productivity Inproceedings

In Proceedings of Language Technologies for the Socio-Economic Sciences and Humanities (LATECH'16), Association for Computational Linguistics (ACL), Berlin, Germany, 2016.

BibTeX

Kermes, Hannah; Knappen, Jörg; Khamis, Ashraf; Degaetano-Ortlieb, Stefania; Teich, Elke

The Royal Society Corpus. Towards a high-quality resource for studying diachronic variation in scientific writing Inproceedings

In Proceedings of Digital Humanities (DH'16), Krakow, Poland, 2016.

BibTeX

Degaetano-Ortlieb, Stefania; Kermes, Hannah; Khamis, Ashraf; Teich, Elke

An Information-Theoretic Approach to Modeling Diachronic Change in Scientific English Journal Article

Selected Papers from Varieng - From Data to Evidence (d2e), Helsinki, Finnland, 2016.

BibTeX

Fankhauser, Peter; Knappen, Jörg; Teich, Elke

Topical Diversication over Time in the Royal Society Corpus Inproceedings

Proceedings of Digital Humanities (DH'16), Krakow, Poland, 2016.

BibTeX

Kermes, Hannah; Degaetano-Ortlieb, Stefania; Khamis, Ashraf; Knappen, Jörg; Teich, Elke

The Royal Society Corpus: From Uncharted Data to Corpus Inproceedings

In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'16), Portoroz, Slovenia, 2016.

BibTeX

2015

Degaetano-Ortlieb, Stefania; Kermes, Hannah; Khamis, Ashraf; Ordan, Noam; Teich, Elke

The taming of the data: Using text mining in building a corpus for diachronic analysis Inproceedings

Varieng - From Data to Evidence (d2e), University of Helsinki, 2015.

BibTeX

Degaetano-Ortlieb, Stefania; Kermes, Hannah; Khamis, Ashraf; ö, J; Ordan, Noam; Teich, Elke

Information Density in Scientific Writing: A Diachronic Perspective Inproceedings

"Challenging Boundaries" - 42nd International Systemic Functional Congress (ISFCW2015), RWTH Aachen University, 2015.

BibTeX

Khamis, Ashraf; Degaetano-Ortlieb, Stefania; Kermes, Hannah; Knappen, Jörg; Ordan, Noam; Teich, Elke

A resource for the diachronic study of scientific English: Introducing the Royal Society Corpus Inproceedings

Corpus Linguistics 2015, Lancaster, 2015.

BibTeX

Elke Teich

PI

Mail
Website

Stefania Degaetano-Ortlieb

Postdoc

Mail
Website

Katrin Menzel

Postdoc

Mail
Website

Pauline Krielke

PhD

Mail