Rinus Verdonschot

Publications

Displaying 1 - 17 of 17
  • Mishra, C., Verdonschot, R. G., Hagoort, P., & Skantze, G. (2023). Real-time emotion generation in human-robot dialogue using large language models. Frontiers in Robotics and AI, 10: 1271610. doi:10.3389/frobt.2023.1271610.

    Abstract

    Affective behaviors enable social robots to not only establish better connections with humans but also serve as a tool for the robots to express their internal states. It has been well established that emotions are important to signal understanding in Human-Robot Interaction (HRI). This work aims to harness the power of Large Language Models (LLM) and proposes an approach to control the affective behavior of robots. By interpreting emotion appraisal as an Emotion Recognition in Conversation (ERC) tasks, we used GPT-3.5 to predict the emotion of a robot’s turn in real-time, using the dialogue history of the ongoing conversation. The robot signaled the predicted emotion using facial expressions. The model was evaluated in a within-subjects user study (N = 47) where the model-driven emotion generation was compared against conditions where the robot did not display any emotions and where it displayed incongruent emotions. The participants interacted with the robot by playing a card sorting game that was specifically designed to evoke emotions. The results indicated that the emotions were reliably generated by the LLM and the participants were able to perceive the robot’s emotions. It was found that the robot expressing congruent model-driven facial emotion expressions were perceived to be significantly more human-like, emotionally appropriate, and elicit a more positive impression. Participants also scored significantly better in the card sorting game when the robot displayed congruent facial expressions. From a technical perspective, the study shows that LLMs can be used to control the affective behavior of robots reliably in real-time. Additionally, our results could be used in devising novel human-robot interactions, making robots more effective in roles where emotional interaction is important, such as therapy, companionship, or customer service.
  • Tamaoka, K., Sakai, H., Miyaoka, Y., Ono, H., Fukuda, M., Wu, Y., & Verdonschot, R. G. (2023). Sentential inference bridging between lexical/grammatical knowledge and text comprehension among native Chinese speakers learning Japanese. PLoS One, 18(4): e0284331. doi:10.1371/journal.pone.0284331.

    Abstract

    The current study explored the role of sentential inference in connecting lexical/grammatical knowledge and overall text comprehension in foreign language learning. Using structural equation modeling (SEM), causal relationships were examined between four latent variables: lexical knowledge, grammatical knowledge, sentential inference, and text comprehension. The study analyzed 281 Chinese university students learning Japanese as a second language and compared two causal models: (1) the partially-mediated model, which suggests that lexical knowledge, grammatical knowledge, and sentential inference concurrently influence text comprehension, and (2) the wholly-mediated model, which posits that both lexical and grammatical knowledge impact sentential inference, which then further affects text comprehension. The SEM comparison analysis supported the wholly-mediated model, showing sequential causal relationships from lexical knowledge to sentential inference and then to text comprehension, without significant contribution from grammatical knowledge. The results indicate that sentential inference serves as a crucial bridge between lexical knowledge and text comprehension.
  • Tamaoka, K., Zhang, J., Koizumi, M., & Verdonschot, R. G. (2023). Phonological encoding in Tongan: An experimental investigation. Quarterly Journal of Experimental Psychology, 76(10), 2197-2430. doi:10.1177/17470218221138770.

    Abstract

    This study is the first to report chronometric evidence on Tongan language production. It has been speculated that the mora plays an important role during Tongan phonological encoding. A mora follows the (C)V form, so /a/ and /ka/ (but not /k/) denote a mora in Tongan. Using a picture-word naming paradigm, Tongan native speakers named pictures containing superimposed non-word distractors. This task has been used before in Japanese, Korean, and Vietnamese to investigate the initially selected unit during phonological encoding (IPU). Compared to control distractors, both onset and mora overlapping distractors resulted in faster naming latencies. Several alternative explanations for the pattern of results - proficiency in English, knowledge of Latin script, and downstream effects - are discussed. However, we conclude that Tongan phonological encoding likely natively uses the phoneme, and not the mora, as the IPU..

    Additional information

    supplemental material
  • Wang, M., Shao, Z., Verdonschot, R. G., Chen, Y., & Schiller, N. O. (2023). Orthography influences spoken word production in blocked cyclic naming. Psychonomic Bulletin & Review, 30, 383-392. doi:10.3758/s13423-022-02123-y.

    Abstract

    Does the way a word is written influence its spoken production? Previous studies suggest that orthography is involved only when the orthographic representation is highly relevant during speaking (e.g., in reading-aloud tasks). To address this issue, we carried out two experiments using the blocked cyclic picture-naming paradigm. In both experiments, participants were asked to name pictures repeatedly in orthographically homogeneous or heterogeneous blocks. In the naming task, the written form was not shown; however, the radical of the first character overlapped between the four pictures in this block type. A facilitative orthographic effect was found when picture names shared part of their written forms, compared with the heterogeneous condition. This facilitative effect was independent of the position of orthographic overlap (i.e., the left, the lower, or the outer part of the character). These findings strongly suggest that orthography can influence speaking even when it is not highly relevant (i.e., during picture naming) and the orthographic effect is less likely to be attributed to strategic preparation.
  • Kinoshita, S., & Verdonschot, R. G. (2021). Phonological encoding is free from orthographic influence: evidence from a picture variant of the phonological Stroop task. Psychological Research, 85, 1340-1347. doi:10.1007/s00426-020-01315-2.

    Abstract

    The phonological Stroop task, in which the participant names the color of written distractors, is being used increasingly to study the phonological encoding process in speech production. A brief review of experimental paradigms used to study the phonological encoding process indicated that currently it is not known whether the onset overlap benefit (faster color naming when the distractor shares the onset segment with the color name) in a phonological Stroop task is due to phonology or orthography. The present paper investigated this question using a picture variant of the phonological Stroop task. Participants named a small set of line drawings of animals (e.g., camel) with a pseudoword distractor printed on it. Picture naming was facilitated when the distractor shared the onset segment with the picture name regardless of orthographic overlap (CUST–camel = KUST–camel < NUST–camel). We conclude that the picture variant of the phonological Stroop task is a useful tool to study the phonological encoding process, free of orthographic influence.

    Additional information

    426_2020_1315_MOESM1_ESM.docx
  • Kinoshita, S., Yu, L., Verdonschot, R. G., & Norris, D. (2021). Letter identity and visual similarity in the processing of diacritic letters. Memory & Cognition, 49(4), 815-825. doi:10.3758/s13421-020-01125-2.

    Abstract

    Are letters with a diacritic (e.g., a) recognized as a variant of the base letter (e.g., a), or as a separate letter identity? Two recent masked priming studies, one in French and one in Spanish, investigated this question, concluding that this depends on the language-specific linguistic function served by the diacritic. Experiment 1 tested this linguistic function hypothesis using Japanese kana, in which diacritics signal consonant voicing, and like French and unlike Spanish, provide lexical contrast. Contrary to the hypothesis, Japanese kana yielded the pattern of diacritic priming like Spanish. Specifically, for a target kana with a diacritic (e.g., (sic), /ga/), the kana prime without the diacritic (e.g., (sic), /ka/) facilitated recognition almost as much as the identity prime (e.g., (sic) = (sic)), whereas for a target kana without a diacritic, the kana prime with the diacritic produced less facilitation than the identity prime (e.g., (sic) < (sic)). We suggest that the pattern of diacritic priming has little to do with linguistic function, and instead it stems from a general property of visual object recognition. Experiment 2 tested this hypothesis using visually similar letters of the Latin alphabet that differ in the presence/absence of a visual feature (e.g., O and Q). The same asymmetry in priming was observed. These findings are consistent with the noisy channel model of letter/word recognition (Norris & Kinoshita, Psychological Review, 119, 517-545, 2012a).
  • Konishi, M., Verdonschot, R. G., & Kakimoto, N. (2021). An investigation of tooth loss factors in elderly patients using panoramic radiographs. Oral Radiology, 37(3), 436-442. doi:10.1007/s11282-020-00475-6.

    Abstract

    Objectives The aim of this study was to observe the dental condition in a group of elderly patients over a period of 10 years in order to clarify important risk factors. Materials and methods Participants were elderly patients (in their eighties) who took panoramic radiographs between 2015 and 2016, and for whom panoramic radiographs taken around 10 year earlier were also available. The number of remaining and lost teeth, the Eichner Index, the presence or absence of molar occlusion, the respective condition of dental pulp, dental crowns, alveolar bone resorption, as well as periapical lesions were investigated through the analysis of panoramic radiographs. Additionally, other important variables were collected from patients' medical records. From the obtained panoramic radiograph sets, the patients' dental condition was investigated, and a systematic comparison was conducted. Results The analysis of the panoramic radiographs showed that the number of remaining teeth decreased from an average of 20.8-15.5, and the percentage of patients with 20 or more teeth decreased from 69.2 to 26.9%. A factor analysis investigating tooth loss risk suggested that tooth loss was associated with the bridge, P2 or greater resorption of the alveolar bone, and apical lesions, and gender (with males having a higher risk compared to females). Conclusions Teeth showing P2 or greater alveolar bone resorption, bridge, and apical lesions on panoramic radiographs are most likely to be lost in an elderly patient's near future. Consequently, this group should be encouraged to visit their dental clinics regularly and receive comprehensive instruction on individual self-care methods.
  • Konishi, M., Fujita, M., Shimabukuro, K., Wongratwanich, P., Verdonschot, R. G., & Kakimoto, N. (2021). Intraoral ultrasonographic features of tongue cancer and the incidence of cervical lymph node metastasis. Journal of Oral and Maxillofacial Surgery, 79(4), 932-939. doi:10.1016/j.joms.2020.09.006.

    Abstract

    Purpose: The purpose of this study was to investigate the relationship between the visual characteristics of tongue lesion images obtained through intraoral ultrasonographic examination and the occurrence of late cervical lymph node metastasis in patients with tongue cancer.
    Patients and Methods: This study investigated patients with primary tongue cancer who were examined using intraoral ultrasonography at Hiroshima University Hospital between January 2014 and December 2017. The inclusion criteria were squamous cell carcinoma, curative treatment administration, lateral side of tongue, surgery or brachytherapy alone, no cervical lymph node or distant metastasis as primary treatment, and treatment in our hospital. The exclusion criteria were carcinoma in situ, palliative treatment, dorsum of tongue, and multiple primary cancers. The follow-up period was more than 1 year. The primary endpoint was the occurrence of late cervical lymph node metastasis, and the primary predictor variables were age, gender, longest diameter, thickness, margin or border shapes of the lesion, and treatment methods. The relationship between the occurrence of late cervical lymph node metastasis and the longest diameter, thickness, margin types, and border types as evaluated through intraoral ultrasonography were assessed. The data were collected through a retrospective chart review.
    Results: Fifty-four patients were included in this study. The analysis indicated that irregular lesion margins were significantly associated with the occurrence of late cervical lymph node metastasis (P < .0001). The cutoff value for late cervical lymph node metastasis was 21.2 mm for the longest diameter and 3.9 mm for the thickness.
    Conclusions: The results of this study indicates that the irregular lesion margin assessed using intraoral ultrasonography may serve as an effective predictor of late cervical lymph node metastasis in N0 cases. (C) 2020 American Association of Oral and Maxillofacial Surgeons
  • Konishi, M., Fujita, M., Takeuchi, Y., Kubo, K., Imano, N., Nishibuchi, I., Murakami, Y., Shimabukuro, K., Wongratwanich, P., Verdonschot, R. G., Kakimoto, N., & Nagata, Y. (2021). Treatment outcomes of real-time intraoral sonography-guided implantation technique of 198Au grain brachytherapy for T1 and T2 tongue cancer. Journal of Radiation Research, 62(5), 871-876. doi:10.1093/jrr/rrab059.

    Abstract

    It is often challenging to determine the accurate size and shape of oral lesions through computed tomography (CT) or magnetic resonance imaging (MRI) when they are very small or obscured by metallic artifacts, such as dental prostheses. Intraoral ultrasonography (IUS) has been shown to be beneficial in obtaining precise information about total tumor extension, as well as the exact location and guiding the insertion of catheters during interstitial brachytherapy. We evaluated the role of IUS in assessing the clinical outcomes of interstitial brachytherapy with 198Au grains in tongue cancer through a retrospective medical chart review. The data from 45 patients with T1 (n = 21) and T2 (n = 24) tongue cancer, who were mainly treated with 198Au grain implants between January 2005 and April 2019, were included in this study. 198Au grain implantations were carried out, and positioning of the implants was confirmed by IUS, to ensure that 198Au grains were appropriately placed for the deep border of the tongue lesion. The five-year local control rates of T1 and T2 tongue cancers were 95.2% and 95.5%, respectively. We propose that the use of IUS to identify the extent of lesions and the position of implanted grains is effective when performing brachytherapy with 198Au grains.
  • Verdonschot, R. G., Han, J.-I., & Kinoshita, S. (2021). The proximate unit in Korean speech production: Phoneme or syllable? Quarterly Journal of Experimental Psychology, 74, 187-198. doi:10.1177/1747021820950239.

    Abstract

    We investigated the “proximate unit” in Korean, that is, the initial phonological unit selected in speech production by Korean speakers. Previous studies have shown mixed evidence indicating either a phoneme-sized or a syllable-sized unit. We conducted two experiments in which participants named pictures while ignoring superimposed non-words. In English, for this task, when the picture (e.g., dog) and distractor phonology (e.g., dark) initially overlap, typically the picture target is named faster. We used a range of conditions (in Korean) varying from onset overlap to syllabic overlap, and the results indicated an important role for the syllable, but not the phoneme. We suggest that the basic unit used in phonological encoding in Korean is different from Germanic languages such as English and Dutch and also from Japanese and possibly also Chinese. Models dealing with the architecture of language production can use these results when providing a framework suitable for all languages in the world, including Korean.
  • Wongratwanich, P., Shimabukuro, K., Konishi, M., Nagasaki, T., Ohtsuka, M., Suei, Y., Nakamoto, T., Verdonschot, R. G., Kanesaki, T., Sutthiprapaporn, P., & Kakimoto, N. (2021). Do various imaging modalities provide potential early detection and diagnosis of medication-related osteonecrosis of the jaw? A review. Dentomaxillofacial Radiology, 50: 20200417. doi:10.1259/dmfr.20200417.

    Abstract


    Objective: Patients with medication-related osteonecrosis of the jaw (MRONJ) often visit their dentists at advanced stages and subsequently require treatments that greatly affect quality of life. Currently, no clear diagnostic criteria exist to assess MRONJ, and the definitive diagnosis solely relies on clinical bone exposure. This ambiguity leads to a diagnostic delay, complications, and unnecessary burden. This article aims to identify imaging modalities' usage and findings of MRONJ to provide possible approaches for early detection.

    Methods: Literature searches were conducted using PubMed, Web of Science, Scopus, and Cochrane Library to review all diagnostic imaging modalities for MRONJ.

    Results: Panoramic radiography offers a fundamental understanding of the lesions. Imaging findings were comparable between non-exposed and exposed MRONJ, showing osteolysis, osteosclerosis, and thickened lamina dura. Mandibular cortex index Class II could be a potential early MRONJ indicator. While three-dimensional modalities, CT and CBCT, were able to show more features unique to MRONJ such as a solid type periosteal reaction, buccal predominance of cortical perforation, and bone-within-bone appearance. MRI signal intensities of vital bones are hypointense on T1WI and hyperintense on T2WI and STIR when necrotic bone shows hypointensity on all T1WI, T2WI, and STIR. Functional imaging is the most sensitive method but is usually performed in metastasis detection rather than being a diagnostic tool for early MRONJ.

    Conclusion: Currently, MRONJ-specific imaging features cannot be firmly established. However, the current data are valuable as it may lead to a more efficient diagnostic procedure along with a more suitable selection of imaging modalities.
  • Yoshihara, M., Nakayama, M., Verdonschot, R. G., Hino, Y., & Lupker, S. J. (2021). Orthographic properties of distractors do influence phonological Stroop effects: Evidence from Japanese Romaji distractors. Memory & Cognition, 49(3), 600-612. doi:10.3758/s13421-020-01103-8.

    Abstract

    In attempting to understand mental processes, it is important to use a task that appropriately reflects the underlying processes being investigated. Recently, Verdonschot and Kinoshita (Memory & Cognition, 46,410-425, 2018) proposed that a variant of the Stroop task-the "phonological Stroop task"-might be a suitable tool for investigating speech production. The major advantage of this task is that the task is apparently not affected by the orthographic properties of the stimuli, unlike other, commonly used, tasks (e.g., associative-cuing and word-reading tasks). The viability of this proposal was examined in the present experiments by manipulating the script types of Japanese distractors. For Romaji distractors (e.g., "kushi"), color-naming responses were faster when the initial phoneme was shared between the color name and the distractor than when the initial phonemes were different, thereby showing a phoneme-based phonological Stroop effect (Experiment1). In contrast, no such effect was observed when the same distractors were presented in Katakana (e.g., "< ") pound, replicating Verdonschot and Kinoshita's original results (Experiment2). A phoneme-based effect was again found when the Katakana distractors used in Verdonschot and Kinoshita's original study were transcribed and presented in Romaji (Experiment3). Because the observation of a phonemic effectdirectly depended on the orthographic properties of the distractor stimuli, we conclude that the phonological Stroop task is also susceptible to orthographic influences.
  • Kajihara, T., Verdonschot, R. G., Sparks, J., & Stewart, L. (2013). Action-perception coupling in violinists. Frontiers in Human Neuroscience, 7: 349. doi:10.3389/fnhum.2013.00349.

    Abstract

    The current study investigates auditory-motor coupling in musically trained participants using a Stroop-type task that required the execution of simple finger sequences according to aurally presented number sequences (e.g., "2," " 4," "5," "3," "1"). Digital remastering was used to manipulate the pitch contour of the number sequences such that they were either congruent or incongruent with respect to the resulting action sequence. Conservatoire-level violinists showed a strong effect of congruency manipulation (increased response time for incongruent vs. congruent trials), in comparison to a control group of non-musicians. In Experiment 2, this paradigm was used to determine whether pedagogical background would influence this effect in a group of young violinists. Suzuki-trained violinists differed significantly from those with no musical background, while traditionally-trained violinists did not. The findings extend previous research in this area by demonstrating that obligatory audio-motor coupling is directly related to a musicians' expertise on their instrument of study and is influenced by pedagogy.
  • Starreveld, P. A., La Heij, W., & Verdonschot, R. G. (2013). Time course analysis of the effects of distractor frequency and categorical relatedness in picture naming: An evaluation of the response exclusion account. Language and Cognitive Processes, 28(5), 633-654. doi:10.1080/01690965.2011.608026.

    Abstract

    The response exclusion account (REA), advanced by Mahon and colleagues, localises the distractor frequency effect and the semantic interference effect in picture naming at the level of the response output buffer. We derive four predictions from the REA: (1) the size of the distractor frequency effect should be identical to the frequency effect obtained when distractor words are read aloud, (2) the distractor frequency effect should not change in size when stimulus-onset asynchrony (SOA) is manipulated, (3) the interference effect induced by a distractor word (as measured from a nonword control distractor) should increase in size with increasing SOA, and (4) the word frequency effect and the semantic interference effect should be additive. The results of the picture-naming task in Experiment 1 and the word-reading task in Experiment 2 refute all four predictions. We discuss a tentative account of the findings obtained within a traditional selection-by-competition model in which both context effects are localised at the level of lexical selection.
  • Stewart, L., Verdonschot, R. G., Nasralla, P., & Lanipekun, J. (2013). Action–perception coupling in pianists: Learned mappings or spatial musical association of response codes (SMARC) effect? Quarterly Journal of Experimental Psychology, 66(1), 37-50. doi:10.1080/17470218.2012.687385.

    Abstract

    The principle of common coding suggests that a joint representation is formed when actions are repeatedly paired with a specific perceptual event. Musicians are occupationally specialized with regard to the coupling between actions and their auditory effects. In the present study, we employed a novel paradigm to demonstrate automatic action–effect associations in pianists. Pianists and nonmusicians pressed keys according to aurally presented number sequences. Numbers were presented at pitches that were neutral, congruent, or incongruent with respect to pitches that would normally be produced by such actions. Response time differences were seen between congruent and incongruent sequences in pianists alone. A second experiment was conducted to determine whether these effects could be attributed to the existence of previously documented spatial/pitch compatibility effects. In a “stretched” version of the task, the pitch distance over which the numbers were presented was enlarged to a range that could not be produced by the hand span used in Experiment 1. The finding of a larger response time difference between congruent and incongruent trials in the original, standard, version compared with the stretched version, in pianists, but not in nonmusicians, indicates that the effects obtained are, at least partially, attributable to learned action effects.
  • Verdonschot, R. G., La Heij, W., Tamaoka, K., Kiyama, S., You, W.-P., & Schiller, N. O. (2013). The multiple pronunciations of Japanese kanji: A masked priming investigation. Quarterly Journal of Experimental Psychology, 66(10), 2023-2038. doi:10.1080/17470218.2013.773050.

    Abstract

    English words with an inconsistent grapheme-to-phoneme conversion or with more than one pronunciation (homographic heterophones; e.g., lead-/l epsilon d/, /lid/) are read aloud more slowly than matched controls, presumably due to competition processes. In Japanese kanji, the majority of the characters have multiple readings for the same orthographic unit: the native Japanese reading (KUN) and the derived Chinese reading (ON). This leads to the question of whether reading these characters also shows processing costs. Studies examining this issue have provided mixed evidence. The current study addressed the question of whether processing of these kanji characters leads to the simultaneous activation of their KUN and ON reading, This was measured in a direct way in a masked priming paradigm. In addition, we assessed whether the relative frequencies of the KUN and ON pronunciations (dominance ratio, measured in compound words) affect the amount of priming. The results of two experiments showed that: (a) a single kanji, presented as a masked prime, facilitates the reading of the (katakana transcriptions of) their KUN and ON pronunciations; however, (b) this was most consistently found when the dominance ratio was around 50% (no strong dominance towards either pronunciation) and when the dominance was towards the ON reading (high-ON group). When the dominance was towards the KUN reading (high-KUN group), no significant priming for the ON reading was observed. Implications for models of kanji processing are discussed.
  • Verdonschot, R. G., Nakayama, M., Zhang, Q., Tamaoka, K., & Schiller, N. O. (2013). The proximate phonological unit of Chinese-English bilinguals: Proficiency matters. PLoS One, 8(4): e61454. doi:10.1371/journal.pone.0061454.

    Abstract

    An essential step to create phonology according to the language production model by Levelt, Roelofs and Meyer is to assemble phonemes into a metrical frame. However, recently, it has been proposed that different languages may rely on different grain sizes of phonological units to construct phonology. For instance, it has been proposed that, instead of phonemes, Mandarin Chinese uses syllables and Japanese uses moras to fill the metrical frame. In this study, we used a masked priming-naming task to investigate how bilinguals assemble their phonology for each language when the two languages differ in grain size. Highly proficient Mandarin Chinese-English bilinguals showed a significant masked onset priming effect in English (L2), and a significant masked syllabic priming effect in Mandarin Chinese (L1). These results suggest that their proximate unit is phonemic in L2 (English), and that bilinguals may use different phonological units depending on the language that is being processed. Additionally, under some conditions, a significant sub-syllabic priming effect was observed even in Mandarin Chinese, which indicates that L2 phonology exerts influences on L1 target processing as a consequence of having a good command of English.

    Additional information

    English stimuli Chinese stimuli

Share this page