Gerard Kempen

Publications

Displaying 1 - 5 of 5
  • Kempen, G., & Harbusch, K. (2017). Frequential test of (S)OV as unmarked word order in Dutch and German clauses: A serendipitous corpus-linguistic experiment. In H. Reckman, L. L. S. Cheng, M. Hijzelendoorn, & R. Sybesma (Eds.), Crossroads semantics: Computation, experiment and grammar (pp. 107-123). Amsterdam: Benjamins.

    Abstract

    In a paper entitled “Against markedness (and what to replace it with)”, Haspelmath argues “that the term ‘markedness’ is superfluous”, and that frequency asymmetries often explain structural (un)markedness asymmetries (Haspelmath 2006). We investigate whether this argument applies to Object and Verb orders in main (VO, marked) and subordinate (OV, unmarked) clauses of spoken and written German and Dutch, using English (without VO/OV alternation) as control. Frequency counts from six treebanks (three languages, two output modalities) do not support Haspelmath’s proposal. However, they reveal an unexpected phenomenon, most prominently in spoken Dutch and German: a small set of extremely high-frequent finite verbs with unspecific meanings populates main clauses much more densely than subordinate clauses. We suggest these verbs accelerate the start-up of grammatical encoding, thus facilitating sentence-initial output fluency
  • Kuiper, K., Bimesl, N., Kempen, G., & Ogino, M. (2017). Initial vs. non-initial placement of agent constructions in spoken clauses: A corpus-based study of language production under time pressure. Language Sciences, 64, 16-33. doi:10.1016/j.langsci.2017.06.001.

    Abstract

    In this exploratory study we test the hypothesis that the retrieval from memory of proper noun Agents (PNAs) under processing pressure causes a greater proportion of such semantic arguments to be placed to the right of the initial position in a clause than would be the case if such retrieval from memory were not necessary. This effect is manifest in sports commentary. Processing pressure on sports commentators is modulated by the speed at which the sport is played and reported. Non-initial placement is also facilitated by formulae which have slots in non-initial position. It follows that the non-initial placement of PNAs is not always semantically or pragmatically motivated. This finding therefore runs counter to a strong form of the functionalist hypothesis that syntactic choices available in the systemic structure of the syntax of a language offer solely semantic or pragmatic choices. It is an open question in a weak functionalist account of language and language use how processing and communicative functions interact in general.
  • Harbusch, K., & Kempen, G. (2011). Automatic online writing support for L2 learners of German through output monitoring by a natural-language paraphrase generator. In M. Levy, F. Blin, C. Bradin Siskin, & O. Takeuchi (Eds.), WorldCALL: International perspectives on computer-assisted language learning (pp. 128-143). New York: Routledge.

    Abstract

    Students who are learning to write in a foreign language, often want feedback on the grammatical quality of the sentences they produce. The usual NLP approach to this problem is based on parsing student-generated text. Here, we propose a generation-based ap- proach aiming at preventing errors ("scaffolding"). In our ICALL system, the student constructs sentences by composing syntactic trees out of lexically anchored "treelets" via a graphical drag & drop user interface. A natural-language generator computes all possible grammatically well-formed sentences entailed by the student-composed tree. It provides positive feedback if the student-composed tree belongs to the well-formed set, and negative feedback otherwise. If so requested by the student, it can substantiate the positive or negative feedback based on a comparison between the student-composed tree and its own trees (informative feedback on demand). In case of negative feedback, the system refuses to build the structure attempted by the student. Frequently occurring errors are handled in terms of "malrules." The system we describe is a prototype (implemented in JAVA and C++) which can be parameterized with respect to L1 and L2, the size of the lexicon, and the level of detail of the visually presented grammatical structures.
  • Kempen, G. (1970). Ideaalbeelden van de Europese jeugd: Weerwoord op methodologische kritiek. Dux, 37, 54-56.
  • Kempen, G. (1970). Memory for word and sentence meanings: A set-feature model. PhD Thesis, Katholieke Universiteit Nijmegen, Nijmegen.

Share this page