EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 2000, 12 (3), 413± 430

The DRC model of visual word recognition and reading aloud: An extension to German Johannes C. Ziegler Laboratoire de Psychologie Cognitive, CNRS, Aix-en-Provence, France, and Macquarie Centre for Cognitive Science, Macquarie University, Sydney, Australia

Conrad Perry and Max Coltheart Macquarie Centre for Cognitive Science, Macquarie University, Sydney, Australia

The dual-route cascaded (DRC) model of word recognition and reading aloud was implemented in German. In this paper, we describe crosslinguistic diŒerences and similarities between the German and the English DRC. The German DRC was evaluated with respect to its ability to correctly pronounce all German monosyllabic words and to simulate the loan word (regularity) eŒect. Furthermore, we obtained DRC predictions concerning a number of benchmark eŒects previously investigated in English, namely eŒects of word frequency, word length, and neighbourhood size. A person who has not studied German can form no idea of what a perplexing language it is. Surely there is not another language that is so slip-shod and systemless, and so slippery and elusive to the grasp. Ð Mark Twain, 1880/1979

Requests for reprints should be addressed to J. Ziegler, Laboratoire de Psychologie Cognitive, CNRS and Universite de Provence, 29, av. R. Schuman, F-13621 Aix-en-Provence, France. Email: [email protected] This research was supported by a special investigator grant to M. Coltheart. J. Ziegler was supported by an ARC postdoctoral fellowship. We wish to thank Chris Herdman, Jonathan Grainger, Ken Paap, and Marco Zorzi for helpful comments on an earlier version of this paper.

€ 2000 Psychology Press Ltd http://www.tandf.co.uk/journals/pp/09541446.html

414

ZIEGLER, PERRY, COLTHEART

More than a decade ago, Eckart Scheerer (1987) concluded his review on visual word recognition in German by stating that ``if certain amendments are made to it, dual route theory (Coltheart, 1978) is a viable candidate for an orthography-invariant model of visual word processing’ ’ (p. 242). Since this time, computational modelling has become a major tool for developing and testing theories in word recognition (e.g., Grainger & Jacobs, 1993, 1994, 1996; Jacobs & Grainger, 1992, 1994; Jacobs, Rey, Ziegler, & Grainger, 1998; Norris, 1994; Plaut, McClelland, Seidenberg, & Patterson, 1996; Seidenberg & McClelland, 1989; Ziegler, Rey, & Jacobs, 1998; Zorzi, Houghton, & Butterworth, 1998). In the spirit of this movement towards computational models of reading, the previously mentioned dual-route theory was implemented in English in the form of a localist connectionist model, the dual route cascaded model (DRC; Coltheart, Curtis, Atkins, & Haller, 1993). Since then, the DRC stood its test against a variety of empirical data (Coltheart & Rastle, 1994; Rastle & Coltheart, 1998, 1999a, 1999b). After recent strong inference testing against a number of competitive models, the DRC seems to provide a better quantitative account of reading performance than its competitors (Coltheart, Rastle, Perry, Langdon, & Ziegler, submitted). However, up to now, the DRC has not been implemented in any other language than English. Thus, Scheerer’ s intuition that the DRC (as an implementation of dual-route theory) would be suitable as an orthography-invariant model of word recognition has never been explicitly tested. Extending the DRC to German was therefore the main focus of the present research. To anticipate some of the conclusions, the DRC could be easily extended to German using the same parameter values as its English counterpart. DiŒerences in linguistic structure between German and English were re¯ ected in the nonlexical rule system. Orthography-to-phonology relations in German could be captured by a fairly general rule system. The implemented version of the model makes quantitative predictions that should motivate future research in psycholinguistics and neuropsychology in German. The present article is organised as follows. First, we brie¯ y describe some relevant features of German orthography and phonology. Second, we summarise the architecture and basic mechanisms of the English DRC. Third, we describe the implementation of the German DRC and some structural diŒerences between the German and the English DRC. Fourth, we test the model’ s ability to correctly translate all German monosyllabic words from orthography to phonology and to simulate the loan word (regularity) eŒect. Finally, we present predictions for some benchmark eŒects in reading aloud, namely word frequency, length, and neighbourhood size eŒects.

THE DRC MODEL

415

GERMAN ORTHOGRAPHY AND PHONOLOGY As with English, the German writing system is considered to be phonologically deep because it adheres to the principle of morpheme constancy (see Scheerer, 1987; for a comprehensive analysis of German phonology see Wiese, 1996). Morpheme constancy means that the spelling of morphemes is kept constant in spite of context-induced changes in their pronunciation (e.g., HEAL vs. HEALTH). As summarised in Scheerer (1987), German has a complicated set of phonographic regularities that can be exploited during reading. For example, vowel length is marked either by putting an ``h’ ’ after a vowel or by vowel gemination. For example, the vowel is long in both JAHR (year) and HAAR (hair). Lengthening by ``h’ ’ is more frequent than gemination but occurs only in front of liquid consonants. Vowel length is never marked in front of b, d, f, and g. For example, the German word TAG (day) has a long vowel but this is not marked in orthography neither by gemination nor by inserting the letter ``h’ ’ after the vowel. Vowels are typically shortened by geminating the following consonants. For example, the vowel in the German word BALL (ball) is short because it is followed by two consonants. Some consonants are never geminated, however (c, h, j, v, w, x). With respect to the overall degree of regularity, German is said to have highly regular orthography-to-phonology correspondences. Irregularity seems to come mainly from loan words, proper names, and geographical terms. For any alphabetically written language, the degree of irregularity of that language can be measured once a set of grapheme± phoneme correspondence rules (GPCs) is speci® ed for the language. The degree of irregularity is then de® ned as the percentage of words for which the rule pronunciation disagrees with the lexical pronunciation. THE ENGLISH DRC The architecture of the English DRC (Coltheart et al., 1993, submitted) is depicted in Figure 1. It can be broken into two main parts, the lexical and the nonlexical route. In the lexical route, the orthographic levels (i.e., visual feature detectors, letter units, orthographic lexicon) are almost identical to those in the interactive activation model of McClelland and Rumelhart (1981). Each unit in the feature level represents one of a letter’ s features, each unit in the letter level represents one letter of the alphabet, and each unit in the word level represents one word in the lexicon. The interactive activation model implements two major principles: The ® rst principle is interactive

Figure 1. Overall architecture of the DRC.

416

THE DRC MODEL

417

activationÐ that is, activation spreads along bidirectional connections between units belonging to diŒerent levels. Compatible units have excitatory connections; they mutually activate one another. Incompatible units have inhibitory connections; they mutually inhibit. The second principle is within-level inhibitionÐ that is, units within the same level mutually inhibit one another through inhibitory lateral connections. The phonological levels (phonological output lexicon, phoneme system) are equivalent to the orthographic levels, except that instead of letters, they contain phonemes, and instead of whole-word orthographic units they contain whole-word phonological units. There are only excitatory connections between the orthographic and phonological lexicon. That is, a given word in the orthographic lexicon does not inhibit all incompatible words in the phonological lexicon and vice versa. Note, however, that lateral inhibition within each level does a very similar job. The nonlexical route deals with assembling phonology. This route operates as a serial mechanism that converts individual letters or letter groups into single phonemes via a grapheme± phoneme conversion procedure (the GPC rules). Letter information becomes progressively available to the nonlexical route. At the ® rst cycle, no letters are available. After a constant number of cycles (10 in the simulations reported here), the ® rst letter is assembled into a phoneme. After this, every 17 cycles another letter becomes available to the system. It has been shown that changing this parameter produces changes in the speed of the nonlexical route and thus in the impact of assembled phonology on lexical processing (see Rastle & Coltheart, 1999b). The dynamics which govern the activation ¯ ow of the model and more speci® c architectural details are explained in greater detail in a number of other papers (Coltheart et al., 1993, submitted; Coltheart & Rastle, 1994; Rastle & Coltheart, 1998, 1999a, b).

IMPLEMENTATION OF THE GERMAN DRC The ® rst step in implementing the German DRC was to change DRC’s vocabulary from English to German. This involved changing the content of the orthographic input and phonological output lexicon as well as the phoneme and letter units. As the model’ s vocabulary, we used all monosyllabic and monomorphemic German words given in the CELEX database (Baayen, Piepenbrock, & van Rijn, 1993). Loan words that were not in the German dictionary Duden (1982) as well as proper names and geographical terms were excluded. This selection resulted in 1448 word units in the orthographic lexicon and 1408 word units in the phonological

418

ZIEGLER, PERRY, COLTHEART

output lexicon.1 The 1448 orthographic word forms ranged from two to eight letters with an average word length of 4.5 letters. The phonological forms ranged from two to six phonemes with an average of 3.7 phonemes. Although monosyllabic words are rare in some European languages (e.g., Spanish, Italian), they are fairly common in German. Monosyllabic words in German account for 53.8 per cent of all word occurrences. Thus, they provide a fairly representative sample of all German words. Word frequency in the model aŒects the rise of activation of each word unit. For the word units in the orthographic lexicon, this rise-of-activation was calculated on the basis of printed frequencies given in the CELEX database, for the word units in the phonological lexicon, spoken word frequencies from the same database were used for this computation. Frequencies of homographs were summed in the orthographic lexicon, frequencies of homophones were summed in the phonological output lexicon. Other than that, no changes in lexical processes were made compared to the English DRC. We expected no problems for crosslinguistic generalization of the DRC’s lexical processes because these processes are mainly governed by the dynamics of interactive activation which have been shown to produce stable eŒects in languages other than English, such as Spanish or French (e.g., Carreiras, Perea, & Grainger, 1997; Ziegler et al., 1998). However, the major question was whether a rule set could be obtained that would be able to assemble phonology by converting graphemes into phonemes. GERMAN GPC RULES In the original DRC, there are three major types of rules: single-letter, multiletter, and context-sensitive rules (in the present paper, we shall ignore phonotactic output rules). These three types of rules are graphemeto-phoneme rules, i.e., none of the rules is bigger than the individual grapheme or phoneme. All rules are position speci® c, i.e., they apply to the beginning, the middle, or the end of the string. As for English, the German rule set was generated in the following way: (1) An appropriate database was selected that included the orthographic and phonological form of all monosyllabic words. As for the English model, CELEX was 1

The number of whole-word phonological units is smaller than the number of whole-word orthographic units because of heterographic homophones that can have more than one orthographic representation for a single phonological representation. The inverse case, in which a single orthographic representation has more than one phonological representation (heterophonic homographs), is much rarer.

THE DRC MODEL

419

used. (2) All words were segmented into their individual phonemes and their corresponding graphemes. (3) For each grapheme in a given position, the corresponding grapheme± phoneme correspondence was extracted semi-automatically. In case a single grapheme mapped onto more than one phoneme, the most frequently occurring grapheme± phoneme correspondence was used. All rules are listed in Appendix A. Single-letter rules Single letter rules apply when a single letter maps into a single phoneme. There are 45 single letter rules in German and 39 in English. In fact, all single letters of the alphabet are also graphemes, i.e., they map onto one phoneme. In German, most of the one-letter graphemes map consistently into one and only one phoneme. However, there are exceptions. For example, the letter ``b’ ’ maps into /b/ or /p/ depending on where in the string it occurs. The same is true for ``d’ ’ , ``g’ ’, and ``y’ ’. By allowing rules to be position-speci® c, this kind of irregularity can be eliminated. The general rule behind this disambiguation is that voiced consonants at the end of a word become unvoiced. However, there are a few cases of true irregularities at this level that cannot be eliminated by position speci® city. The most prominent is the grapheme ``v’ ’ that sometimes maps into /v/ and sometimes into /f/. Because the v ± /f/ mapping is the most frequent, it is de® ned as the rule in the German DRC. Accordingly, words like VERS (/f rs/) are regular while words like VERB (/v rp/) are irregular. Multiletter-rules There are only 48 multiletter rules in German as compared to 146 in English. More strikingly, the number of multiletter rules in German can further be reduced to a few very general rules (super rules). For example, the German DRC has currently separate rules for vowels that are lengthened by putting an ``h’ ’ after the vowel (i.e., eh, ah, ih, oh, uh). However, in principle, all these individual rules could be replaced by a single super rule, which indicates that vowels are lengthened when followed by ``h’ ’. Similarly, using separate rules for vowel gemination (i.e., aa, ee, oo) overestimates the number of multiletter rules that are really needed to do grapheme-to-phoneme conversion in German. In sum, one of the major diŒerences between German and English seems to lie in the fact that the nonlexical computation of phonology in English requires more and bigger multiletter rules than German. We shall come back to the implications of this ® nding in the discussion.

420

ZIEGLER, PERRY, COLTHEART

Context-sensitive rules Context-sensitive rules are needed when preceding or following letters consistently determine the pronunciation of a particular grapheme. For example, ``s’ ’ in the beginning of a word is mainly pronounced /s/. However, when followed by a vowel, it is pronounced /z/ and when followed by the plosives ``p’ ’ or ``t’’, it is pronounced / /. Such regularities are accounted for by adding context-sensitive rules, such as s[V] ± /z/ and s[p, t] ± / /, where [V] stands for ``when followed by any vowel’’ and s[p, t] stands for ``when followed by the letters ``p’ ’ or ``t’’. German has 38 context-sensitive rules, whereas English has only 14. Thus, on the ® rst glance, German has more than twice as many contextsensitive rules as English. However, a closer look shows that a large number of the context-sensitive rules could again be replaced by a few super rules. For example, a vowel followed by two or more consonants tends to be short (e.g., V[C][C] ± short V), a vowel followed by one consonant tends to be long (e.g., V[C] ± long V). This general rule re¯ ects a tendency in languages towards isochrony, that is, the attempt to keep the duration of the phonological rime of the syllable constant. Thus, when a vowel is followed by several consonants in spoken language, it is shortened to keep the overall syllable duration constant (for a discussion see Lass, 1984, chap. 10). This general rule includes consonant gemination as a special case, that is, when a vowel is short in spoken language but only followed by a single consonant, the consonant is geminated in written language to indicate the shortening of the vowel. If we count this general rule as a single rule, the number of context-sensitive rules in German would be reduced to six. MODEL EVALUATION The German DRC was tested in two ways. First, we investigated the model’ s capacity to pronounce all German monosyllabic words. This data allowed us to evaluate some general aspects of the model’ s behaviour, such as its overall error rate, the percentage of correct rule applications, and the number of rule violations (i.e., the number of irregular words). Second, we tested the model’ s ability to capture one of the major naming eŒects that has previously been investigated in German: the regularity or loan word eŒect. Overall error rate The model was presented with all 1448 monosyllabic words; it mispronounced only 16 of them (1.10 per cent). All errors were actually errors

THE DRC MODEL

421

to loan words, such as mispronunciations of words like JOB, CHIC, and CHARME. As a comparison, the overall error rate of the English DRC for all monosyllabic English words was 1.17 per cent. This percentage includes errors to nonhomophonic homographs for which at least one pronunciation must, by de® nition, be wrong. The remaining errors were almost always on low-frequency words with exceptional correspondences in early letter positions (see Rastle & Coltheart, 1999b). How often are rules correct? To evaluate the e ciency of the nonlexical route, it is useful to compare German and English with respect to the percentage of correct rule applications. This information is presented in Table 1. These statistics were generated by comparing the pronunciation generated from the model’ s nonlexical route (via the set of GPC rules) with the pronunciation given in the CELEX database. If the two pronunciations were the same, all of the rules used to generate the word pronunciation were considered correct; if the two pronunciations were diŒerent, all of the rules used to generate the word pronunciation were considered incorrect. The percentage of correct rule applications also gives a ® rst index about how regular German is compared to English. The more rules are correctly applied, the more regular the orthography ± phonology mapping. As can be seen in Table 1, German single-letter rules were used correctly about 88 per cent of the time, whereas English single-letter rules were used correctly about 80 per cent of the times. The biggest diŒerence between German and English was with respect to multiletter rules. Not only does German have fewer multiletter rules than English, but the German rules are also more often correct than their English counterparts TABLE 1 Total number of German and English rules of each type and percentage of correct rule applications (% correct)

Rule type Single-letter German English Multiletter German English Context-sensitive German English

No. rules

% Correct

45 39

88.5 81.8

48 146

95.2 80.6

38 14

87.5 75.5

422

ZIEGLER, PERRY, COLTHEART

(95 per cent versus 81 per cent for German and English, respectively). A closer comparison between German and English multiletter rules also shows that German has virtually no graphemes larger than two letters while English has many. Thus, it seems that German aŒords fewer, smaller, and more reliable multiletter rules. Finally, German has more context-sensitive rules than English. Again, they were correctly applied more often than their English counterparts. Averaged across the three rule types, German rules were correctly applied 90.4 per cent of the time, whereas English rules were correctly applied only 79.3 per cent of the time. How many words are irregular in German? Having speci® ed the German GPC rules (see Appendix A), we can now give a rule-set-de® ned answer as to how many monosyllabic words are irregular in German. This was done in a simulation in which the German DRC’ s lexical route was switched oŒ. Thus, words were read by the nonlexical route only. Any word that is named incorrectly by the nonlexical route is, by de® nition, irregular; it violates the GPC rules. From the 1448 words that were submitted to the German DRC with its lexical route switched oŒ, 150 were read incorrectly by the nonlexical route. Therefore, irregularity in the mapping between orthography and phonology in German is 10.3 per cent according to this rule set. Interestingly, there were a number of systematic subtypes of irregularity. The largest type of irregularity (53 per cent) was due to subtle diŒerences in vowel length. For example, the O in LOCH (hole) is short while it is long in HOCH (high). HOCH is irregular because it violates the rule that vowels followed by two consonants should be short. This group also includes a number of highly frequent function words. For example, the function words BIS (until) or MIT (with) should be pronounced with a long vowel according to rule but they are pronounced with a short vowel. The second largest type of irregularity (44 per cent) came from irregular loan words. Most of them are listed in Appendix B. They include words like JAZZ, JOB, FAIR, CREME and so on. Finally, a fairly small percentage of words (3 per cent) did not fall in any of these categories (e.g., VOLT, ZIG). As a comparison, the English DRC mispronounced 17.94 per cent of all English monosyllabic words when reading was purely by the model’ s nonlexical route. In summary, contrary to common belief, the present analysis shows a considerable degree of irregularity in German orthography-to-phonology mapping. This is mainly due to intrusions of loan words into the German vocabulary and to unpredictable changes in vowel length (see earlier example). Irregularity in vowel length could be considered as subtle

THE DRC MODEL

423

irregularity because only a single phonetic feature (i.e., vowel length) is aŒected by this type of irregularity. It remains an empirical question whether this type of irregularity has the same eŒect as irregularity in English. The loan word effect Unlike English, for which a large empirical database concerning a variety of word recognition eŒects is available, the existing data on German word naming is rather meagre. However, one eŒect, the loan word eŒect, has previously been investigated in a naming task in German (Scheerer, 1987). Given that many loan words are irregular, the loan word eŒect is the German equivalent of the regularity eŒect. The regularity eŒect re¯ ects the ® nding that low-frequency words with regular GPCs are named faster than those with irregular GPCs (e.g., Andrews, 1982; Baron & Strawson, 1976). The regularity eŒect has played a critical role in the development of word naming models in English (e.g., Coltheart et al., 1993; Plaut et al., 1996). To test for the existence of regularity eŒects in German, Scheerer (1987) selected three groups of loan words: (1) regular loan words that did not contain any violation of German orthography (e.g., FILM, TEST); (2) regular loan words that violated German orthography; they contained at least one bigram that does not occur among native words (e.g., SKAT, TYP); (3) loan words with irregular GPCs2 (e.g., CHEF, TEAM). As shown in Table 2, Scheerer (1987) found that regular loan words (group 1) were not signi® cantly diŒerent from native German words. However, regular loan words with orthographic violations (group 2) produced signi® cantly longer latencies than native words. Finally, loan words with irregular GPCs (group 3) produced longer latencies and more naming errors (19.5 per cent) than any of the other groups. The German DRC was tested in its ability to capture this eŒect while using the original parameter values of the English DRC. For this purpose, we selected among all German monosyllabic words those loan words that fall into the three categories described above. This resulted in a selection of 51 words in group 1, 35 words in group 2, and 66 words in group 3. These words are listed in Appendix B. All words were of fairly low frequency and there was no signi® cant diŒerences between the three groups in terms of word frequency, F(2, 149) = .12, p .10. In

2 Scheerer de® ned loan words as being GPC irregular when their pronunciation was listed in the Duden dictionary. In the Duden, pronunciations are given only for those words that would be incorrectly pronounced according to German rule.

424

ZIEGLER, PERRY, COLTHEART

TABLE 2 The loan word effect in German. Mean latencies (in ms), errors (in %), and number of cycles for native words and three classes of loan words. Human data taken from Scheerer (1 9 8 7 , Table 2 )

Human data Ð Ð Ð Ð Ð Ð Ð Ð Ð RTs Errors

Word classes Native words Loan words Reg GPC/Reg Orth (Test, Film) Reg GPC/Irreg Orth (Skat, Typ) Irreg GPCs (Team, Chef)

Ð Ð Ð Ð Cycles

DRC Ð Ð Ð Ð Ð Errors

573

0.3

80.3

0.0

566 591 634

0.9 1.6 19.5

81.3 85.6 96.4

0.0 0.0 24.2

Reg = Regular; Irreg = Irregular; Orth = Orthography.

addition, a group of 45 native words was selected that matched these loan words with respect to word frequency and word length. The simulation results are presented in Table 2. The model accurately predicts the pattern obtained by Scheerer (1987). Planned comparisons showed that, in the model, no diŒerence was obtained between regular loan words (group 1) and native German words, F(1, 95) = .14, p .10. As in the human data, however, regular loan words with irregular orthography (group 2) and irregular loan words (group 3) diŒered signi® cantly from native words, F(1, 79) = 3.4, p .05 and F(1, 110) = 42.3, p .0001, respectively. Also, the model accurately predicted the high error rate for loan words with irregular GPCs as observed in the human data. Together, it seems that the model does a good job in simulating the loan word eŒect. SOME PREDICTIONS FOR FUTURE RESEARCH As mentioned previously, little research has been done in German with respect to some of the classic eŒects of word naming in English (e.g., word frequency, word length, or neighbourhood size eŒects). This makes it impossible to fully test the model on these variables. However, it may be useful to present the model’ s predictions concerning these eŒects in an attempt to motivate future research in German. Predictions for these eŒects are presented in Table 3. As can be seen in Table 3, when using four frequency classes (low, logF 1; medium, 1 logF 2; high, 2 logF 3; very high, 3 logF 4), the model predicts a graded, log-linear, frequency eŒect for German

THE DRC MODEL

425

TABLE 3 DRC latencies (in cycles) and standard errors (in parentheses) for a number of benchmark effects previously investigated in English, namely effects of word frequency (F), length, and neighbourhood size (N)

F-eŒect Ð Ð Ð Ð Ð Ð Ð Ð Ð Ð Groups DRC Ð

Low Medium High Very high

84.2 81.9 77.5 72.8

(.34) (.28) (.24) (.36)

Ð

Length-eŒect Ð Ð Ð Ð Ð Ð Ð Ð Groups DRC three four ® ve six

76.6 79.4 81.3 85.0

(.94) (.18) (.25) (.51)

Ð

Ð

Ð

Ð Ð Ð Groups

Ð

N-eŒect Ð Ð Ð Ð Ð Ð Ð DRC

low F± small N low F± large N high F± small N high F± large N

85.0 81.5 77.1 74.7

(.46) (.69) (.38) (.38)

naming. Such eŒects have previously been obtained in English (e.g., Forster & Chambers, 1973; Monsell, 1991). The DRC also predicts an inhibitory length eŒect for German. Naming latencies increase in a seemingly linear fashion with increasing word length. In English, length eŒects have been reported for naming nonwords and low-frequency words. No length eŒects were found for high-frequency words (Weekes, 1997). The German DRC predicts a facilitatory neighbourhood size eŒect3 for both high- and low-frequency words. Facilitatory neighbourhood size eŒects in naming have been consistently reported in English, although they seem to be restricted to low-frequency words (for a review, see Andrews, 1997). Finally, as shown previously, the German DRC predicts a regularity eŒect in German. Future research should investigate whether normal readers would show regularity costs for what we have called subtle irregularities, such as unpredictable vowel prolongations in minimal pairs (e.g., HOCH/DOCH, DACH/NACH, BART/HART). GENERAL DISCUSSION In the present paper, we have generalised the DRC to German using the same parameter values as the English DRC. The model was evaluated for its ability to correctly read all monosyllabic German words and to simulate the loan word eŒect of Scheerer (1987). The model obtained satisfactory performance on both tests. These results support Scheerer’ s

3 Neighbourhood size in German varies between 0 and 15 neighbours with a mean of 2.8 neighbours per word. Accordingly, in the simulations, words were de® ned low N if they had less than two neighbours and high N if they had more than three neighbours.

426

ZIEGLER, PERRY, COLTHEART

(1987) claim that the ``dual route theory is a viable candidate for an orthography-invariant model of visual word processing’ ’ (p. 242). One interesting aspect of the present model is its potential to motivate new research by providing a tool which allows researchers to anticipate the model’ s quantitative and qualitative predictions before running an experiment. A number of predictions for some benchmark eŒects of word recognition were provided. In summary, according to the model, naming latencies in German should exhibit a facilitatory frequency eŒect, an inhibitory word length eŒect, and a facilitatory neighbourhood size eŒect. There are a number of challenging questions with respect to the present implementation. One is whether the German and English versions of the model are sensitive to crosslinguistic diŒerences despite the fact that the overall architecture and dynamics are identical. One diŒerence between the German and the English DRC is the structure of the nonlexical route. German seems to require fewer rules than English and many of the individual rules seem to be reducible to a relatively small number of fairly general super rules. More speci® cally, German has fewer and smaller multiletter rules than English. This makes the interesting prediction that the grain size of the nonlexical computation is smaller in German than English. Thus, German readers can successfully use smaller chunks to compute phonology than English readers. One consequence may be that German readers exhibit a stronger length eŒect than English readers. In turn, if Germans use smaller chunks, it could be expected that eŒects of bigger grain sizes (e.g., body-rime eŒects) are weaker in German than in English. Furthermore, our rule statistics support the possibility that German readers not only use smaller chunks than English readers but that these smaller chunks are more reliable (i.e. less irregular). Irregularity in German was about 10 per cent as compared to 18 per cent in English. For example, the English words ball, cat, and hand all have diŒerent pronunciations of the vowel ``a’ ’ while their German equivalents Ball, Katze, and Hand have the same vowel pronunciation. The obvious prediction is that if German readers use smaller chunks to compute phonology and if these smaller chunks are more reliable (i.e., regular), then nonword reading or learning-to-read should be easier in German than in English. In fact, a recent crosslinguistic study has shown that word and nonword reading in children was faster and less error-prone in German than in English (Landerl, Wimmer, & Frith, 1997). A more fundamental question is whether the DRC should be generalised to German at all. Coltheart et al.’ s (1993) major justi® cation for a dual-route architecture in English was the fact that neither a lexical nor a nonlexical system alone seemed su cient to explain both nonword and exception word reading, as well as the double dissociation between

THE DRC MODEL

427

surface and phonological dyslexia. However, if a major part of German words (about 95 per cent if loan words are excluded) can be pronounced correctly using a limited set of general rules, one might argue that there is little need for a separate lexical system. The advantage of a simulation approach is that such a possibility can directly be tested by comparing the original model with a version of the model in which the lexical route is ``switched oŒ’’ (the relevant parameters are set to zero). Similarly, one could argue that because German has a small set of general rules that reliably predict pronunciations, the nonlexical route may operate more quickly for the German rule set than is the case for the English rule set. Again, this possibility can directly be tested by modifying the speed of the nonlexical route.4 To summarise, the present paper showed that the DRC could be implemented for the German language. The model successfully simulates the loan word eŒect that can be seen as the equivalent of the regularity eŒect in English. More empirical data are now needed to explore whether the German DRC can capture word recognition and reading in as much detail as its English counterpart. Furthermore, the implemented DRC should provide a useful tool for investigating potential diŒerences or similarities between German and English, such as the importance of a lexical system or the relative speed of nonlexical and lexical routes. Manuscript received February 1999 Revised manuscript received October 1999

REFERENCES Andrews, S. (1982). Phonological recoding: Is the regularity eŒect consistent? Memory and Cognition, 10, 565± 575. Andrews, S. (1997). The eŒect of orthographic similarity on lexical retrieval: Resolving neighborhood con¯ icts. Psychonomic Bulletin and Review, 4, 439± 461. Baayen, R.H., Piepenbrock, R., & van Rijn, H. (1993). The CELEX lexical database CDROM. Philadelphia: Linguistic Data Consortium, University of Pennsylvania. Baron, J., & Strawson, C. (1976). Use of orthographic and word-speci® c knowledge in reading words aloud. Journal of Experimental Psychology: Human Perception and Performance, 2, 383± 393. Carreiras, M., Perea, M., & Grainger, J. (1997). Orthographic neighborhood eŒects on visual word recognition in Spanish: Cross-task comparisons. Journal of Experimental Psychology: Learning, Memory, and Cognition, 23, 857± 871.

4 Keep in mind that the serial route operates from left to right, processing one letter at a time, with a ® xed number of cycles in between each letter. Thus, reliability of the rules alone does not naturally speed up the nonlexical route.

428

ZIEGLER, PERRY, COLTHEART

Coltheart, M. (1978). Lexical access in simple reading tasks. In G. Underwood (Ed.), Strategies of information processing (pp. 151± 216). London: Academic Press. Coltheart, M., Curtis, B., Atkins, P., & Haller, M. (1993). Models of reading aloud: Dual-route and parallel-distributed-processing approaches. Psychological Review, 100, 589± 608. Coltheart, M., & Rastle, K. (1994). Serial processing in reading aloud: Evidence for dualroute models of reading. Journal of Experimental Psychology: Human Perception and Performance, 20, 1197± 1211. Coltheart, M., Rastle, K., Parry, C., Langdon, R., & Ziegler, J.C. (submitted). DRC: A computational model of visual word recognition and reading aloud. Manuscript submitted for publication. Duden: Das FremdwoÈrterbuch. (1982). Mannheim, Germany: Bibliographisches Institut. Forster, K.I., & Chambers, S.M. (1973). Lexical access and naming time. Journal of Verbal Learning and Verbal Behavior, 12, 627± 635. Grainger, J., & Jacobs, A.M. (1993). Masked partial-word priming in visual word recognition: EŒects of positional letter frequency. Journal of Experimental Psychology: Human Perception and Performance, 19, 951± 964. Grainger, J., & Jacobs, A.M. (1994). A dual-read out model of word context eŒects in letter perception: Further investigations of the word superiority eŒect. Journal of Experimental Psychology: Human Perception and Performance, 20, 1158± 1176. Grainger, J., & Jacobs, A.M. (1996). Orthographic processing in visual word recognition: A multiple read-out model. Psychological Review, 103, 518± 565. Jacobs, A.M., & Grainger, J. (1992). Testing a semistochastic variant of the Interactive Activation Model in diŒerent word recognition experiments. Journal of Experimental Psychology: Human Perception and Performance, 18, 1174± 1188. Jacobs, A.M., & Grainger, J. (1994). Models of visual word recognition: Sampling the state of the art. Journal of Experimental Psychology: Human Perception and Performance, 20, 1311± 1334. Jacobs, A.M., Rey, A., Ziegler, J.C., & Grainger, J. (1998). MROM-P: An interactive activation, multiple read-out model of orthographic and phonological processes in visual word recognition. In J. Grainger & A.M. Jacobs (Eds.), Localist connectionist approaches to human cognition (pp. 147± 188). Mahwah, NJ: Lawrence Erlbaum Associates Inc. Landerl, K., Wimmer, H., & Frith, U. (1997). The impact of orthographic consistency on dyslexia: A German± English comparison. Cognition , 63, 315± 334. Lass, R. (1984). Phonology: An introduction to basic concepts. Cambridge, UK: Cambridge University Press. McClelland, J.L., & Rumelhart, D.E. (1981). An interactive activation model of context eŒects in letter perception: Part I. An account of basic ® ndings. Psychological Review, 88, 375± 407. Monsell, S. (1991). The nature and locus of word frequency eŒects in reading. In D. Besner & G.W. Humphreys (Eds.), Basic processes in reading: Visual word recognition (pp. 148± 197). Hillsdale, NJ: Lawrence Erlbaum Associates Inc. Norris, D. (1994). A quantitative multiple-levels model of reading aloud. Journal of Experimental Psychology: Human Perception and Performance, 20, 1212± 1232. Plaut, D.C., McClelland, J.L., Seidenberg, M.S., & Patterson, K. (1996). Understanding normal and impaired word reading: Computational principles in quasi-regular domains. Psychological Review, 103, 56± 115. Rastle, K., & Coltheart, M. (1998). Whammies and double whammies: The eŒect of length on nonword reading. Psychonomic Bulletin and Review, 5, 277± 282. Rastle, K., & Coltheart, M. (1999a). Lexical and nonlexical phonological priming in reading

THE DRC MODEL

429

aloud. Journal of Experimental Psychology: Human Perception and Performance, 25, 461± 481. Rastle, K., & Coltheart, M. (1999b). Serial and strategic eŒects in reading aloud. Journal of Experimental Psychology: Human Perception and Performance, 25, 482± 503. Scheerer, E. (1987). Visual word recognition in German. In A. Allport, D. MacKay, W. Prinz, & E. Scheerer (Eds.), Language perception and production: Shared mechanism in listening, speaking, reading and writing (pp. 227± 244). London: Academic Press. Seidenberg, M.S., & McClelland, J.L. (1989). A distributed developmental model of word recognition and naming. Psychological Review, 96, 523± 568. Twain, M. (1979). The awful German language. In The unabridged Mark Twain. Philadelphia: Running Press. (Original work published 1880.) Weekes, B.S. (1997). DiŒerential eŒects of number of letters on words and nonword naming latency. Quarterly Journal of Experimental Psychology, 50A, 439± 456. Wiese, R. (1996). The phonology of German. New York: Oxford University Press. Ziegler, J.C., Rey, A., & Jacobs, A.M. (1998). Simulating individual word identi® cation thresholds and errors in the fragmentation task. Memory and Cognition, 26, 490± 501. Zorzi, M., Houghton, G., & Butterworth, B. (1998). Two routes or one in reading aloud? A connectionist dual-process model. Journal of Experimental Psychology: Human Perception and Performance, 24, 1131± 1161.

APPENDICES Appendix A German GPC rules. Each rule presents a grapheme and its corresponding phoneme together with the position at which the rule applies (b = beginning; m = middle; e = end; a = all). For example, the rule b ± /b/ (b) indicates that the grapheme b maps into the phoneme /b/ at the beginning (b). In the context sensitive rules, the symbols [V] or [C] stand for any vowel or consonant, respectively (see also text). Single-letter rules b± /b/ (b); b ± /p/ (me); c ± /k/ (a); d ± /d/ (b); d ± /t/ (me); f ± /f/ (a); g ± /g/ (b); g ± /k/ (me); h ± /h/ (a); j ± /j/ (a); k ± /k/ (a); l ± /l/ (a); m ± /m/ (a); n ± /n/ (a); p ± /p/ (a); q ± /k/ (a); r ± /r/ (a); s ± /s/ (a); t ± /t/ (a); v ± /f/ (a); w ± /v/ (a); x ± /x/ (a); y ± /j/ (b); y ± /y/ (me); z ± /ts/ (a); a ± / / (be); ± / / (b); e ± /e / (be); o ± /o / (be); ± / / (be); u ± /u / (be); ± /y / (be); i ± /I/ (be)

Multiletter rules aa ± au ±

/a / (a); ah ± /a / (a); h ± / / (a); ai ± /ai/ (a); auh ± /au/ (me); /au/ (a); u ± / y/ (a); ch ± /k/ (b); ch ± /x/ (me); ck ± /k/ (a); dt ± /t/ (a); ea ± /i / (a); ee ± /e / (a); eh ± /e / (a); ei ± /ai/ (a); eu ± / y/ (a); Œ± /f/ (a); ie ± /i / (a); ieh ± /i / (me); ih ± /i / (a); kh ± /k/ (a); ll ± /l/ (a); mm ± /m/ (a); ng ± / / (a); nn ± /n/ (a); oa ± /o /

430

ZIEGLER, PERRY, COLTHEART

(a); oh ± (a); pf ± tch ± /t tz ± /ts/

/o / (a); h ± / / (a); oo ± /o / (a); ou ± /u / (a); ph ± /f/ /pf/ (a); pp ± /p/ (a); rr ± /r/ (a); sch ± / / (a); ss ± /s/ (s); / (a); th ± /t/ (a); ts ± /ts/ (e); tsch ± /t / (a); tt ± /t/ (a); (a); uh ± /u / (a); h ± /y / (a); zz ± /ts/ (a)

Context-sensitive rules s[V] ± /z/ (b); s[p,t] ± / / (b); n[k,x] ± / / (a); [q]u ± /v/ (a); ch[s] ± /k/ (a); u[c][h] ± /u /; [C] ± /e / (me); [C][C] ± / / (me); a[C] ± /a / (me); a[C][C] ± /a/ (me); e[C] ± /e / (a); e[C][C] ± / / (a); i[C] ± /i / (me); i[C][C] ± /I/ (me); [C] ± / / (me); [C][C] ± / / (me); o[C] ± o / (me); o[C][C] ± / / (me); [C] ± /y / (me); [C][C] ± /Y/ (me); u(C) ± /u / (me); u[C][C] ± / / (me)

Appendix B Loan words in the CELEX database according to Scheerer’ s (1987) classi® cation. Regular loan words (no violation of orthography) AKT, BASS, BLUFF, BOSS, BRIE, BROM, DRINK, DROPS, FESCH, FILM, FLIRT, FLOR, JOD, JUST, KILT, KLAN, KLON, KLUB, KOKS, KULT, KUR, LOG, LORD, MOL, NORM, PAKT, PER, PIER, PINK, PLAN, PLUS, POMP, PORT, PRO, PUCK, PUR, QUARK, QUART, QUARZ, QUINT, RE, RUM, SCHAH, SCHEICH, SOL, SONG, SPRIT, TAFT, TRAM, TREND, VERS

Regular loan words (with violations of orthography, i.e., illegal bigrams) BOX, CHLOR, CHOR, CHRIST, CHROM, CLAN, CLUB, FIX, FJORD, GEN, GNOM, JUX, KHAN, KREM, LAX, LUX, MOP, NERV, PAX, PHON, PNEU, PSALM, SKALP, SKAT, SKETCH, SKRIPT, SMART, SNOB, SPHINX, STOP, SWING, THAI, TWIST, TYP

Irregular loan words (violations of GPCs) AIR, BAND, BEAT, BIT, BLUES, BOB, BON, BOOM, BOY, BRIDGE, CAMP, CHARME, CHEF, CHIC, CLOWN, CODE, CREME, FAIR, FAN, FLAIR, GAG, GIN, GROS, JAZZ, JEEP, JET, JOB, KODE, KORPS, LIVE, LUNCH, MATCH, PUMPS, QUAI, RANCH, REN, ROUGE, SAFE, SCHI, SET, SEX, SHOW, SKI, SLANG, SLIP, SLUM, SPIN, SPLEEN, SPOT, SPRAY, SQUAW, STEAK, TAB, TEAK, TEAM, TIP, TOAST, TOUR, TRIP, TWEED, TWEN, VAMP, VERB, YARD, YEN, ZEN

The DRC model of visual word recognition and reading ...

Email: [email protected]. This research was supported by a ... this paper. 2000 Psychology Press Ltd http://www.tandf.co.uk/journals/pp/09541446.html.

213KB Sizes 1 Downloads 147 Views

Recommend Documents

Reexamining the word length effect in visual word recognition ... - crr
yielded a mixture of null effects and inhibitory length ef- fects. .... although the effect was stronger in German than in En- ..... Perry, C., & Ziegler, J. C. (2002). A ...

Syllabic length effects in visual word recognition and ...
was used to describe how one might get syllable effects without explicit syllable units. ... in terms of orthography-to-phonology conversion rules and that the pronunciation ... The last characteristic concerns the presence/absence of a syllabic ...

Syllabic length effects in visual word recognition ... - Semantic Scholar
Previous studies on the syllable-length effect in visual word recognition and naming ..... as subjects for course credit, 16 in the Experiment 1 (naming) and 40 in ... presented in isolation on the center of the display screen of a Pentium computer.

Model-based DRC for design and process integration
Enhanced checks, however, for non-lithographic failure classes (metal slotting rules, for example) .... *See Fig 4 (frames E-G) for an illustration. 5.2. Extracting ...

Model-based DRC for design and process integration
Model-based DRC for design and process integration. Chi-Yuan Hung*a, .... provides the DBclassify function to perform this data reduction. It allows users to ...

Developmental Word Recognition
... but rather one of restructuring (improvement with increased auto- .... Greece and had used English in school for at least 10 .... Procedure. Pretest and Training.

Relating Natural Language and Visual Recognition
Grounding natural language phrases in im- ages. In many human-computer interaction or robotic scenar- ios it is important to be able to ground, i.e. localize, ref-.

Model in Word - Microsoft
ground domain), for which large amounts of train- ing data are available, to a different domain (the adaptation domain), for which only small amounts of training ...

Model in Word - Microsoft
training data effectively. However, such training data is not always available for many search do- mains, such as non-English search markets or per- son name ...

Opposite effects of visual and auditory word-likeness on ... - Frontiers
Aug 29, 2013 - The present fMRI study investigated the effects of word-likeness of visual and auditory stimuli on activity along the ventral visual stream. In the context of a one-back task, we presented visual and auditory words, pseudowords, and ar

Opposite effects of visual and auditory word-likeness on ... - Frontiers
Aug 29, 2013 - auditory processing are expected in visual regions. Keywords: fMRI .... million (based on the CELEX database; Baayen et al., 1993). The pseudowords matched the ...... an open-access article distributed under the terms of the.

Everyday politics and practices of family planning in the DRC
Exceptions for the doctoral degree can be granted to applicants who are able to ... and believes this will contribute to our mission and vision of being the best.

Functional specificity of the visual word form area
Available online 24 May 2007. Abstract ... Available online at www.sciencedirect.com. Brain and Language .... In addition, three 20 s blocks of a blank screen were presented at the .... condition, which contributed a disproportionate share of.

offline handwritten word recognition using a hybrid neural network and ...
network (NN) and Hidden Markov models (HMM) for solving handwritten word recognition problem. The pre- processing involves generating a segmentation ...

Visual Recognition - Vision & Perception Neuroscience Lab - Stanford ...
memory: An alternative solution to classic problems in perception and recognition. ... of faces, objects and scenes: Analytic and holistic processes (pp. 269–294).

Face Tracking and Recognition with Visual Constraints in Real-World ...
... constrain term can be found at http://seqam.rutgers.edu/projects/motion/face/face.html. ..... [14] Y. Li, H. Ai, T. Yamashita, S. Lao, and M. Kawade. Tracking in.