Sabine Schulte im Walde: Inducing German Semantic Verb Classes from Subcategorisation Information (joint work with Chris Brew)

A long-standing linguistic hypothesis asserts a tight connection between the syntactic behaviour of a verb and its meaning components: to a certain extent, the lexical meaning of a verb determines its behaviour, particularly with respect to the choice of its arguments. The theoretical foundation has been established in extensive work on semantic verb classes such as Levin (1993) for English and Vazquez et al. (2000) for Spanish.

From a practical point of view, a verb classification supports Natural Language Processing tasks, since it provides a principled basis for filling gaps in available lexical knowledge. For example, the English verb classification has been used for applications such as machine translation (Dorr, 1997), word sense disambiguation (Dorr and Jones, 1996), and document classification (Klavans and Kan, 1998).

In the talk, I will describe the application of k-Means, a standard clustering technique, to the task of inducing semantic classes for German verbs. A robust statistical parser was used to obtain lexical verb information, referring to purely syntactic frame definitions as well as selectional preferences. The goal of a series of cluster analyses based on the subcategorisation information was (i) to find good values for the linguistically uninteresting parameters of the clustering process, and (ii) to explore the role of the frame descriptions in verb classification. The automatic clustering was evaluated against independently motivated, hand-constructed semantic verb classes.

Back to Clippers page


For questions or comments regarding this page, please contact: Detmar Meurers
Last modified: Tue Apr 2 14:33:44 EST 2002