Gerard Kempen

Publications

Displaying 1 - 12 of 12
  • Kempen, G., & Harbusch, K. (2016). Verb-second word order after German weil ‘because’: psycholinguistic theory from corpus-linguistic data. Glossa: a journal of general linguistics, 1(1): 3. doi:10.5334/gjgl.46.

    Abstract

    In present-day spoken German, subordinate clauses introduced by the connector weil ‘because’ occur with two orders of subject, finite verb, and object(s). In addition to weil clauses with verb-final word order (“VF”; standard in subordinate clauses) one often hears weil clauses with SVO, the standard order of main clauses (“verb-second”, V2). The “weil-V2” phenomenon is restricted to sentences where the weil clause follows the main clause, and is virtually absent from formal (written, edited) German, occurring only in extemporaneous speech. Extant accounts of weil-V2 focus on the interpretation of weil-V2 clauses by the hearer, in particular on the type of discourse relation licensed by weil-V2 vs. weil-VF: causal/propositional or inferential/epistemic. Focusing instead on the production of weil clauses by the speaker, we examine a collection of about 1,000 sentences featuring a causal connector (weil, da or denn) after the main clause, all extracted from a corpus of spoken German dialogues and annotated with tags denoting major prosodic and syntactic boundaries, and various types of disfluencies (pauses, hesitations). Based on the observed frequency patterns and on known linguistic properties of the connectors, we propose that weil-V2 is caused by miscoordination between the mechanisms for lexical retrieval and grammatical encoding: Due to its high frequency, the lexical item weil is often selected prematurely, while the grammatical encoder is still working on the syntactic shape of the weil clause. Weil-V2 arises when pragmatic and processing factors drive the encoder to discontinue the current sentence, and to plan the clause following weil in the form of the main clause of an independent, new sentence. Thus, the speaker continues with a V2 clause, seemingly in violation of the VF constraint imposed by the preceding weil. We also explore implications of the model regarding the interpretation of sentences containing causal connectors.
  • Kempen, G. (2004). Terug naar Wundt: Pleidooi voor integraal onderzoek van taal, taalkennis en taalgedrag. In Koninklijke Nederlandse Akademie van Wetenschappen (Ed.), Gij letterdames en gij letterheren': Nieuwe mogelijkheden voor taalkundig en letterkundig onderzoek in Nederland. (pp. 174-188). Amsterdam: Koninklijke Nederlandse Akademie van Wetenschappen.
  • Kempen, G., & Harbusch, K. (2004). A corpus study into word order variation in German subordinate clauses: Animacy affects linearization independently of grammatical function assignment. In T. Pechmann, & C. Habel (Eds.), Multidisciplinary approaches to language production (pp. 173-181). Berlin: Mouton de Gruyter.
  • Kempen, G., & Harbusch, K. (2004). Generating natural word orders in a semi-free word order language: Treebank-based linearization preferences for German. In A. Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing (pp. 350-354). Berlin: Springer.

    Abstract

    We outline an algorithm capable of generating varied but natural sounding sequences of argument NPs in subordinate clauses of German, a semi-free word order language. In order to attain the right level of output flexibility, the algorithm considers (1) the relevant lexical properties of the head verb (not only transitivity type but also reflexivity, thematic relations expressed by the NPs, etc.), and (2) the animacy and definiteness values of the arguments, and their length. The relevant statistical data were extracted from the NEGRA–II treebank and from hand-coded features for animacy and definiteness. The algorithm maps the relevant properties onto “primary” versus “secondary” placement options in the generator. The algorithm is restricted in that it does not take into account linear order determinants related to the sentence’s information structure and its discourse context (e.g. contrastiveness). These factors may modulate the above preferences or license “tertiary” linear orders beyond the primary and secondary options considered here.
  • Kempen, G., & Harbusch, K. (2004). How flexible is constituent order in the midfield of German subordinate clauses? A corpus study revealing unexpected rigidity. In S. Kepser, & M. Reis (Eds.), Pre-Proceedings of the International Conference on Linguistic Evidence (pp. 81-85). Tübingen: Niemeyer.
  • Kempen, G. (2004). Interactive visualization of syntactic structure assembly for grammar-intensive first- and second-language instruction. In R. Delmonte, P. Delcloque, & S. Tonelli (Eds.), Proceedings of InSTIL/ICALL2004 Symposium on NLP and speech technologies in advanced language learning systems (pp. 183-186). Venice: University of Venice.
  • Kempen, G., & Harbusch, K. (2004). How flexible is constituent order in the midfield of German subordinate clauses?: A corpus study revealing unexpected rigidity. In Proceedings of the International Conference on Linguistic Evidence (pp. 81-85). Tübingen: University of Tübingen.
  • Kempen, G. (2004). Human grammatical coding: Shared structure formation resources for grammatical encoding and decoding. In Cuny 2004 - The 17th Annual CUNY Conference on Human Sentence Processing. March 25-27, 2004. University of Maryland (pp. 66).
  • Kempen, G. (1983). Het artificiële-intelligentieparadigma. Ervaringen met een nieuwe methodologie voor cognitief-psychologisch onderzoek. In J. Raaijmakers, P. Hudson, & A. Wertheim (Eds.), Metatheoretische aspekten van de psychonomie (pp. 85-98). Deventer: Van Loghum Slaterus.
  • Kempen, G. (1983). Natural language facilities in information systems: Asset or liability? In J. Van Apeldoorn (Ed.), Man and information technology: Towards friendlier systems (pp. 81-86). Delft University Press.
  • Kempen, G., & Huijbers, P. (1983). The lexicalization process in sentence production and naming: Indirect election of words. Cognition, 14(2), 185-209. doi:10.1016/0010-0277(83)90029-X.

    Abstract

    A series of experiments is reported in which subjects describe simple visual scenes by means of both sentential and non-sentential responses. The data support the following statements about the lexicalization (word finding) process. (1) Words used by speakers in overt naming or sentence production responses are selected by a sequence of two lexical retrieval processes, the first yielding abstract pre-phonological items (Ll -items), the second one adding their phonological shapes (L2-items). (2) The selection of several Ll-items for a multi-word utterance can take place simultaneously. (3) A monitoring process is watching the output of Ll-lexicalization to check if it is in keeping with prevailing constraints upon utterance format. (4) Retrieval of the L2-item which corresponds with a given LI-item waits until the Ld-item has been checked by the monitor, and all other Ll-items needed for the utterance under construction have become available. A coherent picture of the lexicalization process begins to emerge when these characteristics are brought together with other empirical results in the area of naming and sentence production, e.g., picture naming reaction times (Seymour, 1979), speech errors (Garrett, 1980), and word order preferences (Bock, 1982).
  • Kempen, G. (1983). Wat betekent taalvaardigheid voor informatiesystemen? TNO project: Maandblad voor toegepaste wetenschappen, 11, 401-403.

Share this page