Linguistic Modeling and its Interfaces
Oberseminar, Detmar Meurers, Summer Semester 2015

This series features presentations and discussions of current issues in linguistic modeling and its interfaces. This includes linguistic modeling in computational linguistics, language acquisition research, Intelligent Computer-Assisted Language Learning, and education, as well as theoretical linguistic research with a focus on the interfaces of syntax and information structure. It is open to anyone interested in this interdisciplinary enterprise.

When: Fr 10ct-12
Where: Room 1.13 (Blochbau, Wilhelmstr. 19)
Mailing list for related announcements:
http://mailman.sfs.uni-tuebingen.de/cgi-bin/mailman/listinfo/ling-mod-info
Moodle page (for discussion forum and materials restricted to the university of Tübingen):
https://moodle02.zdv.uni-tuebingen.de/course/view.php?id=988
A list of talks in previous semesters can be found here:
Winter 14, Summer 14, Winter 13/14, Summer 13, Winter 12/13, Summer 12, Winter 11/12, Summer 11, Summer 10, Winter 09/10, Summer 09

Sessions

April 24. New semester kick-off
May 1 and May 8. No meetings (Holiday, SFB 632 conference).
May 15. Heiko Holz (Universität Tübingen):

Development of a cross-platform serious game for children with dyslexia.

Abstract: One of the major causes of dyslexia, of which about 4 – 8 % of the population are affected, is a deficient phonological awareness - the ability to deal with the sound system of a language and to detect, distinguish and manipulate segments of a language, like syllables, rimes, or even single sounds. Despite research results that a shortcoming in syllable stress detection in the context of words or sentences is a very strong predictor of dyslexia, currently no mobile serious games are known of that explicitly focus on the improvement of this deficit. In this work, I present the iterative and user-centered development of the prototype of a mobile serious game for iOS und Android. The mobile serious game is designed to support primary school-aged children to improve their phonological awareness outside the classroom or learning therapy. It represents an absolute novelty in the area of mobile serious games for children with dyslexia as its focus is on the stress of single syllables. By integrating an intelligent tutoring system, which is based on principles of the cognitive architecture ACT-R, the mobile serious game can adapt to the performance of a child dynamically and offers the possibility to optimize the learning curve and simultaneously maintain motivation and fun.
May 22. Sowmya Vajjala presents her CICLING 2015 research with Ildikó Pilán and Elena Volodina (Pilán et al. 2015):

A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity

Abstract: Corpora and web texts can become a rich language learning resource if we have a means of assessing whether they are linguistically appropriate for learners at a given proficiency level. In this paper, we aim at addressing this issue by presenting the first approach for predicting linguistic complexity for Swedish second language learning material on a 5-point scale. After showing that the traditional Swedish readability measure, Läsbarhetsindex (LIX), is not suitable for this task, we propose a supervised machine learning model, based on a range of linguistic fea- tures, that can reliably classify texts according to their difficulty level. Our model obtained an accuracy of 81.3% and an F-score of 0.8, which is comparable to the state of the art in English and is considerably higher than previously reported results for other languages. We further studied the utility of our features with single sentences instead of full texts since sentences are a common linguistic unit in language learning exercises. We trained a separate model on sentence-level data with five classes, which yielded 63.4% accuracy. Although this is lower than the document level performance, we achieved an adjacent accuracy of 92%. Furthermore, we found that using a combination of different features, compared to using lexical features alone, resulted in 7% improvement in classification accu- racy at the sentence level, whereas at the document level, lexical features were more dominant. Our models are intended for use in a freely acces- sible web-based language learning platform for the automatic generation of exercises, and they will be available also in the form of web-services.
June 9, 12:45-14:00, Gartenstrasse 29a (jointly organized with LEAD Colloqium) Hedderik van Rijn (University of Groningen)

Personalized, Adaptive Learning based on Cognitive Models Increases Learning Efficiency

Abstract: It is well known that the schedule of practice partly determines the efficiency of learning sessions, and thus the retention of the learned materials. Even the classical Leitner method for learning factual information is based on this principle, as better encoded items are practiced less often. By calculating the optimal distance between repetitions of a to be learned item, large learning gains can be obtained. Most methods that aim for optimizing the schedule of practice adapt the schedule based on whether a learner provides a correct, or an incorrect answer. However, even when an item is correctly answered, the speed by which an answer is given could be used to assess how well an item is encoded in memory. In this talk, I will present an adaptive learning system that is based on computational cognitive models of the human long-term memory system. This system keeps track of the internal activation of each to-be-learned item, and updates the internal activation after each presentation. Based on this activation value, the system determines which item needs to be practiced at what point in time, or whether the learner is ready for the presentation of new items. We have tested this system in multiple experiments demonstrating typical learning gains of 10%, and it is now used by a large Dutch publishing house in their online systems associated with all their secondary education learning materials. I will also discuss recent work with this system that suggests that the internal parameters of the system are better predictors of how well an item is mastered than the score on a test, that these parameters are stable over time and relatively stable over materials, and how these parameters correlate with other, more traditional measures of learning aptitude.
References:
- Sense, Behrens, Meijer, Van Rijn (2015) Stability of Individual Parameters in a Model of Optimal Fact Learning. (About to be published, earlier version available at: http://www.iccm2015.org/proceedings/papers/0034/paper0034.pdf )
- Van Rijn, H., Dalenberg, J., Borst, J. & Sprenger, S.A. (2012) Pupil Dilation Co-Varies with Memory Strength of Individual Traces in a Delayed Response Paired-Associate Task. PLoS ONE. 7(12): e51134
- van Rijn, H., Gu, B-M., & Meck, W.H. (2014) Dedicated clock/timing-circuit theories of interval timing and timed behavior. Advances in Experimental Medicine and Biology, 829:75-99.
May 29, June 5, June 12. No meetings (Holidays/theses defenses)
June 19: Shuly Wintner (University of Haifa)

Automatic Identification of Translationese: Highlighting the nature of translation

Abstract: Translated texts differ from texts originally written in the same (target) language. Several Translation Studies hypotheses aim at explaining these differences. We use computational methodology, specifically supervised and unsupervised classification, to distinguish between translated and original texts. This facilitates a close inspection of the specific features along which the two types of texts differ.
This enterprise yields several findings:
- Some Translation Studies hypotheses, especially those purporting to the universality of translationese features, are questionable;
- Interference, namely the ‘fingerprints’ of the original text on the product of the translation process, is by far the dominating feature of translationese;
- Interference is so powerful that by looking only at translations from several languages, the source language can be identified;
- Translationese features are overshadowed by more salient features of the text, including genre, register, domain, etc.
We show that the import of these results is not only theoretical; they have implications for natural language processing applications, in particular statistical machine translation.
June 26: Judith Tonhauser (The Ohio State University)

On the heterogeneity of projective content

Abstract: Tonhauser et al (2013) motivate, based on data from English and Paraguayan Guaraní, that projective content is heterogeneous: this work distinguished four class es of projective content based on two properties, namely whether the content imposes a strong constraint on prior context and whether the content shows local effect. Our goal in this presentation is to provide further empirical evidence for the heterogene ity of projective content. We present preliminary results from experiments that compa re the degree of projectivity for a wide range of types of projective content, includ ing the prejacent of “only”, the pre-state of “stop”, contents of the complements of a range of “factive” verbs, and conventional implicatures. We show that the degree of projectivity of a projective content is correlated with the degree of not-at-issuene ss of that content: the more the content is not-at-issue, the more projective it is.
Reference:
- Tonhauser, Judith, David Beaver, Craige Roberts and Mandy Simons. 2013. “Toward a taxonomy of projective content.” Language 89.1: 66–109.
July 3: Wolfgang Lenhard (Universität Würzburg)

Matching Text Features and Reader Skills via Conjoint IRT

Abstract: Computational linguistics and psychometrics choose different pathways to explain text difficulty: While the first discipline focuses on factors at the text level (e.g., surface features like sentence length, …), psychology emphasizes the individual determinants of reading comprehension (e.g., vocabulary, reading fluency, …). Both aspects are necessary to explain and forecast the performance of groups and individuals. Item-Repsonse-Models (IRT) have the potential to better interlink both theoretical backgrounds within a common approach, by representing text difficulty and reader aptitude on the same scale and modelling the probability of success with logistic regressions. This talk shows an advanced IRT methodology -– so called Conjoint IRT (Klein Entink, 2009) – that can incorporate both, accuracy and reading speed. It further shows how to match reader competencies, text difficulty and time consume, as well as the modelling of these parameters with finite Taylor polynomials on the basis of surface features of the texts.
July 10. David Howcroft (Universität des Saarlandes)

Measuring Linguistic Complexity for Adaptive Generation

Abstract: In this talk I will share the results of some early experiments on ranking sentences by their difficulty using psycholinguistic metrics which correlate with reading times. Preliminary results indicate that measures like surprisal (Hale 2001; Levy 2008) and dependency length (Gibson 2000; Gildea & Temperley 2007) do improve model accuracy beyond simple–but frustratingly good–baseline systems. I will also discuss ongoing work which uses better feature extraction and corpora (and compares to other systems like, e.g., Vajjala & Meurers 2014) as well as the relation of this work to the new project “Adapting Information Density to Changing Situations and Individual Users” at Universität des Saarlandes (SFB 1102).
July 17. Magdalena Wolska (LEAD, Universität Tübingen)

Phonetic representations for short-answer scoring in language assessment

Abstract: Content-oriented scoring of short responses is a meaning focused task in which, for the most part, evaluation should not be affected by low-level input flaws. Responses which scoring systems process are, however, often ill-formed. This is especially true in the context of computer-based foreign language assessment in which form errors, such as misspellings or ungrammaticality, are likely to occur in productions of learners at low proficiency levels.
In this presentation I’ll talk about ongoing work on addressing misspellings in computer-based evaluation of learners’ responses to listening comprehension items in DaF placement tests. Based on a corpus of responses by test takers of different language proficiencies, I’ll illustrate the extent of the misspellings problem and present an evaluation of an off-the-shelf spell-checking tool. As a way of addressing misspellings, I’ll introduce work in progress on scoring based on phonetic representations - specifically, encodings borrowed from historical linguistics - and discuss prospects for scoring based on string similarity; here, computed using phonetically transcribed strings.

References

Pilán, I., S. Vajjala & E. Volodina (2015). A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity. In Proceedings of CICLING 2015- Research in Computing Science Journal Issue (to appear).

_________________________________________________________________________________

Last updated: July 14, 2015