C4: Mutual Intelligibility and Surprisal in Slavic Intercomprehension (INCOMSLAV)

Project C4 is concerned with the differential encodings of linguistic categories in a cross-linguistic perspective (here: Slavic languages) focusing on density. In particular, the project will investigate the relation of grammaticalisation, encoding density and information density. As a relevant application, intercomprehension within the family of Slavic languages will be explored. The project will bring together results from the analysis of parallel corpora and from a variety of experiments with native speakers of Slavic languages and will compare them with insights of comparative historical linguistics on the relationship between Slavic languages. A statistical language model is used as a measure of surprisal and as a tool to gauge how language users master high degrees of surprisal, due to partial incomprehensibility. The key idea here is that comprehension of an unknown, but related, language should be better, when the language model adapted for understanding the unknown language exhibits relatively low average surprisal, or density.

Publications

2015

Avgustinova, Tania; Fischer, Andrea; Jágrová, Klára; Stenger, Irina

The Empirical Basis of Slavic Intercomprehension Inproceedings

REMU, Joensuu, Finland, 2015.

Links | BibTeX

Stenger, Irina

"Reading Polish with Czech Eyes" or "How Russian Can a Bulgarian Text Be?": Orthographic Differences as an Experimental Variable in Reading Comprehension Inproceedings

11th European Conference on Formal Description of Slavic Languages (FDSL-11), Potsdam, Germany, 2015.

Links | BibTeX

Fischer, Andrea; Jágrová, Klára; Stenger, Irina; Avgustinova, Tania; Klakow, Dietrich; Marti, Roland

An Orthography Transformation Experiment with Czech-Polish and Bulgarian-Russian Parallel Word Sets Inproceedings

Sharp, Bernadette ; Lubaszewski, Wies{ł}aw ; Delmonte, Rodolfo (Ed.): Natural Language Processing and Cognitive Science, pp. pp. 115–126, Ca Foscarina Editrice, Venezia, 2015.

Links | BibTeX

Fischer, Andrea; Demberg, Vera; Klakow, Dietrich

Towards Flexible, Small-Domain Surface Generation: Combining Data-Driven and Grammatical Approaches Proceeding

Association for Computational Linguistics Brighton, 2015.

Links | BibTeX

Fischer, Andrea; Jágrová, Klára; Stenger, Irina; Avgustinova, Tania; Klakow, Dietrich; Marti, Roland

Models for Mutual Intelligibility Inproceedings

Data Mining and its Use and Usability for Linguistic Analysis, Universität des Saarlandes Saarbrücken, 2015.

Links | BibTeX

2014

Dietrich, Klakow; Avgustinova, Tania; Stenger, Irina; Fischer, Andrea; Jágrová, Klára

The INCOMSLAV Project Inproceedings

Seminar in formal linguistics at ÚFAL, Charles University Prague, 2014.

Links | BibTeX

Tania Avgustinova

PI

Mail
Website

Roland Marti

PI

Mail
Website

Dietrich Klakow

PI

Mail
Website

Klára Jágrová

PhD

Mail
Website

Irina Stenger

PhD

Mail
Website

Andrea K. Fischer

PhD

Mail