社団法人 電子情報通信学会 信学技報 THE INSTITUTE OF ELECTRONICS, IEICE Technical Report INFORMATION AND COMMUNICATION ENGINEERS TL2011-20 (2011-8)

Why object clefts are easier to process than subject clefts in Japanese: Frequency or expectation? Barış KAHRAMAN† † ‡

Atsushi SATO‡

Hajime ONO‡†

and

Hiromu SAKAI‡‡

Faculty of Education, Çanakkale Onsekiz Mart University, Anafartalar Campus C1-217, Çanakkale, 17100 Turkey

Graduate School of Letters, Hiroshima University 1-1-1 Kagamiyama, Higashi-Hiroshima, Hiroshima, 739-8524 Japan ‡† ‡‡

Faculty of Science & Engineering, Kinki University 3-4-1 Kowakae, Higashi-Osaka, Osaka, 577-8502 Japan

Graduate School of Education, Hiroshima University 1-1-1 Kagamiyama, Higashi-Hiroshima, Hiroshima, 739-8524

E-mail:†[email protected],‡[email protected],‡†[email protected],‡‡[email protected] Abstract

Previous studies have shown that both frequency and expectation for upcoming structures play an important role on sentence

processing. However, it is still unclear which one of these factors has a stronger impact on sentence processing. In order to explore the possible effects of frequency and expectation; we conducted a corpus analysis in Japanese in the current study. We first calculated the distribution and transitional probabilities of subject and object clefts, and then compared them with reading time data in our previous study. The results showed the number of subject clefts was higher than object clefts, whereas transitional probability of object clefts was higher than subject clefts at the embedded verb position. The results indicate that expectation can account for the processing difficulty of clefts in Japanese, whereas the simple frequency failed to explain the processing difficulty.

Keyword

Expectation, Frequency, Corpus Analysis, Sentence processing, Japanese, Subject Clefts, Object Clefts

なぜ日本語目的語分裂文は主語分裂文よりも処理しやすいのか ~頻度と予期の観点からの考察~ カフラマン バルシュ

佐藤 淳

小野 創

酒井 弘

あ ら ま し 文処理を扱った先行研究では「頻度」と後続する構造に対する「予期」が重要な役割を果たしていると言われてい る。しかし,これらの要因のうちどちらの方がより強い影響力をもつかは明らかではない。本研究では,「頻度」と「予期」の 影響について検討するために,日本語でコーパス分析を行い,主語分裂文と目的語分裂文の分布及び遷移的確率を調べた。結果, 主語分裂文の頻度の方が目的語分裂文の頻度よりも高いのに対して,埋め込み動詞の位置における目的語分裂文の遷移的確率の 方が主語分裂文よりも高いことがわかった。これらの結果を先行研究で観察された読み時間のデータと照らし合わせることで, 日本語における分裂文処理の難しさを「予期」で捉えられるのに対して「頻度」では捉えられないことがわかった。

キーワード

予期,頻度,コーパス分析,文処理,日本語,主語分裂文,目的語分裂文 some structures more easily than other structures, because

1. Introduction Previous studies have pointed out that experience is

they are more familiar with frequent structures. For

one of the most important factors that govern the human

example, in English, subject relative clauses (SRs) are

sentence processing (e.g., Gennari & MacDonald, 2008,

easier to process than object relative clauses (ORs) (e.g.,

2009

&

King & Just, 1991 [8]; Staub, 2010 [9]). Reali &

Christiansen [4]; Wells, et al., 2009 [5]). According to

Christiansen (2007) reported that SRs are more frequent

these studies, experience is shaped by distributional

than ORs. Moreover, they found that SRs occur more

pattern of linguistic input we are exposed to. However, it

frequently with proper nouns while ORs frequently occur

is still not fully understood what kind of linguistic input

with pronouns. Taking these distributional patterns of SRs

forms our sentence comprehension system.

and ORs into consideration, Reali & Christiansen (2007)

[1],[2];

MacDonald,

1999

[3];

MacDonald

According to one view, the frequency of particular

conducted a series of experiments, and showed that ORs

structure is one of the most important factors (e.g.,

were processed more easily than SRs when pronouns were

Gennari & MacDonald, 2008, 2009 [1], [2]; Mak, et al.,

used within relative clauses [7]. Similarly, previous

2002 [6]; Reali & Christiansen, 2007 [7]). People process

studies have shown that SRs frequently occur with

― 67 ― This article is a technical report without peer review, and its polished and/or extended version may be published elsewhere. Copyright ©2011 by IEICE

animate head-nouns, whereas ORs occur more frequently

[12]. In Chinese, Wu, et al. (2009) conducted a corpus

with inanimate head-nouns (e.g., Gennari & MacDonald,

analysis and self-paced reading experiments. However,

2008 [1]; Mak, et al., 2002 [6]). These studies have

unlike Japanese, their results were in line with the

shown that when ORs were presented with animate

distributional patterns of SRs and ORs [13]. In Korean on

head-nouns, they were processed more easily. Taken

the other hand, Yun, et al. (2010) argued that the

together,

conditional probabilities of SRs and ORs account for the

these

studies

imply

that

the

processing

asymmetry between SRs and ORs stems from their distributional difference.

processing difficulty of ORs. In Japanese, relative clauses are not marked explicitly.

From a different viewpoint, it has been argued that

In Chinese, on the other hand, explicit relative clauses

distributional patterns of linguistic input affect people’s

marker (i.e. DE) is used. In the case of Korean, relative

expectation for upcoming constituents and structures.

clause verb is marked by adnominal form of predicates.

According to this view, entropy (uncertainty) about

These cross-linguistic differences indicate that ambiguity

upcoming structures, namely conditional probabilities of

of relative clauses might be higher in Japanese than in

upcoming structures which are derived from combinations

Chinese and Korean. Since the possibility of occurrence

of linguistic input is decisive factor in the sentence

of a relative clause is quite higher in Chinese and Korean,

processing (e.g., Hale, 2003 , 2006 [10][11]. For example,

the impact of the frequency or expectation might have

Hale (2006) showed that conditional probability of SRs is

been stronger in these languages. In Japanese, on the

higher than ORs at relative pronoun. In other words, SRs

other hand, since the possibilities of occurrence of other

are more likely to be expected than ORs at relative

structures are also high at relative clause verb, the

pronoun in English [11]. Therefore, SRs are easier to

possible

process than ORs.

sentence processing might have been masked. In order to

Overall, both frequency and expectation seem to play

influences

of

frequency

or

expectation

on

provide convincing evidence from Japanese, regarding the

an important role in sentence processing, and account for

possible

the difficulty of sentence processing. However, it is still

investigation of unambiguous structures would be more

unclear which one of these factors has a stronger impact

helpful. For this purpose, cleft sentences provide a good

on sentence processing. In other words, we do not fully

test case.

understand whether a particular structure is processed

effects

of

frequency

and

expectation,

the

In Japanese, cleft sentences are very similar to relative

more easily because it is more frequent or it is easier to

clauses (see

expect upcoming structures from linguistic input.

identical, but the embedded verb is marked with no-wa in

Most of the claims regarding the importance of

cleft

1). Particularly

sentences.

The

particle

their word no

is

orders are

taken

as

a

frequency and expectation were made from European

complementizer or nominalizer, and wa is a binding

languages, whereas contributions from Asian languages

particle (Hiraiwa & Ishihara, 2002 [15]). We assume that

are comparatively limited (e.g., Japanese: Sato, 2011

the complex of the particle no-wa is a cleft marker.

[12]; Chinese: Wu, et al., 2009 [13]; Korean: Yun, et al.,

Unlike relative clauses, structural ambiguity can be

2010

cleft

resolved at the embedded verb, due to the use of no-wa.

sentences, and attempts to provide insights on how

[14]).

The

Moreover, Kahraman, et al (2011) have already compared

frequency

the reading times of subject clefts (SCs) and object clefts

and/or

current

study

expectation

uses relates

Japanese to

sentence

processing, and how our experience is shaped. In the next section, we will explain why cleft sentences in Japanese were chosen as the target structure.

(OCs) as shown in (1) [16]. 1 (1) a. SC condition: Kyonen sobo-o inaka-de kaihoushita-nowa shinseki-da. Last year grandma-acc village-loc nursed-NOWA relative-cop

2. Why Japanese Clefts?

‘It is the relative who nursed my grandmother last year at

In Japanese, Sato (2011) conducted a corpus analysis

the village.’

and compared the distribution of SRs and ORs. The

(1) b. OC condition:

results showed that frequencies of SRs and ORs did not

Kyonen sobo-ga inaka-de kaihoushita-nowa shinseki-da.

differ significantly. Based on these results, Sato (2011)

Last year grandma-nom village-loc nursed-NOWA relative-cop

argued that the simple frequency of SRs and ORs cannot account for the processing difficulty of ORs in Japanese

1

Due to the space limitations, examples were simplified. See Kahraman (2011) for complete list of examples [17].

― 68 ―

‘It is the relative who my grandmother nursed last year at the village.’

(4) Other clefts: Kaisatsuguchi-o deta-nowa ticket gate-acc

The results showed that OCs were read faster than SCs

(5) Non-clefts 2

such as Structural Distance Hypothesis (O’Grady, 2007)

Tairyoku-ga

[18] and Dependency Locality Theory (Gibson, 1998)

physical power-nom existed-NOWA

[19] cannot account for the processing difficulty of clefts

SCs and OCs or expectation difference for upcoming constituents at the embedded verb position might have affected the results. Therefore, the investigation of distributional patterns of cleft sentences in Japanese would provide insights on how frequency and expectation are related to sentence processing. In the current study, in order to explore the relation between

the

frequency,

expectation

and

sentence

atta-nowa

iumademonai needless to say

‘It is needless to say that there was a physical power.’

and relative clauses at the same time [16]. Furthermore, Kahraman, et al. (2011) speculated that the frequency of

9:15-cop.

‘It was 9:15 when I got out of the ticket gate.’

at the embedded verb position. Kahraman, et al. (2011) argued that in Japanese, the proximity-based accounts

9:15-da.

got out-NOWA

In SCs, the subject NP appears in the focused position, while in OCs, the object NP does so. In the case of other clefts, the focused element is adverbial phrases such as time, place, reason. In the case of non-clefts, although the verb is marked with no-wa, there is no copular, and the element appears after no-wa has a predicative relation with the original clause.

3.1. Results 3.1.1. Corpus Analysis 1: General Frequency

processing, we conducted a corpus analysis. The results

From our 3 million-word-corpus, 2085 sentences were

indicated that expectation can account for the difficulty

extracted. We then manually counted these sentences. In

of SCs, whereas the simple frequency of SCs and OCs

total, the number of clefts was 1756 (84%), and the

cannot account for the observed difficulty pattern.

number of non-clefts was 329 (16%). Of these sentences, 656 sentences were tagged as SCs (31%) while 170 sentences were tagged as OCs (8%), and 930 sentences

3. Corpus Analysis We first compared the simple frequency of SCs and

were tagged as other clefts (45%). A test of Chi-square

OCs, and calculated their transitional probabilities at the

showed that there was a significant difference among 4

embedded verb position.

types of sentences [χ 2 (3) = 662.97, p < .01]. Ryan’s

In the current study, we used a part (3 million words)

procedure showed that all of the 4 conditions significantly

of the corpus KOTONOHA (10 million words from

differed from each other (p < .01). This indicates that the

written Japanese developed by the National Institute for

number of SCs was statistically higher than OCs.

Japanese Language). Due to the absence of a large-scale

These results indicate that the distributions of SCs and

parsed corpus in Japanese, we conducted an automated

OCs are inconsistent with the reading time data observed

morpheme analysis by Mecab 0.98 (developed by Taku

in Kahraman, et al. (2011) [16]. Although OCs were

Kubo). In Japanese, cleft sentences are marked with the

easier to process than SCs in the self-paced reading

morpheme no-wa. In order to extract cleft sentences, we

experiment, the number of SCs was higher than that of

used ChaKi.NET (developed by NAIST), and selected

OCs in the corpora. Therefore, the processing asymmetry

them by pulling out NOWA. Extracted sentences were

between SCs and OCs cannot be explained by their

manually classified into SCs, OCs, other clefts and

distributions. However, in their reading experiment, Kahraman, et al.

non-clefts. Examples are as follows:

(2011) only used transitive verbs. In Corpus Analysis 1,

(2) SCs: Teki-o

taoshita-nowa

enemy-acc

Salamanca-da-tta.

overthrew-NOWA Salamanca-cop-past

‘It was Salamanca who overthrew the enemy’ (3) OCs:

verb types. Therefore, the inconsistency between the reading time data and frequency may be due to the verb types. In order to test this possibility, we conducted Corpus Analysis 2 that takes verb types into consideration.

Ore-ga aitenishiteita-nowa I-nom

we counted all kinds of cleft sentences regardless of their

deal with-NOWA

hanzaisha-da-tta.

Specifically, we compared the number of SCs and OCs

criminal-cop-past

‘It was the criminal that I dealt with.’

2

We included these sentences in the analysis, because they will be necessary while computing transitional probabilities.

― 69 ―

with

transitive

verbs

which

take

nominative

and

to conditional probability statistics such as conditional

accusative NPs as their arguments.

entropy [20]. In other words, transitional probabilities

3.1.2. Corpus Analysis 2: Cleft sentences with

and conditional entropy make similar predictions for the processing difficulty. Transitional probability is defined

Transitive Verbs In total 752 sentences were extracted excluding passive,

as a conditional probability measuring the predictability

causative, ditransitive and intransitive verbs. Of these,

of adjacent elements (e.g, Aslin, et al., 1998 [20];

231 sentences were tagged as SCs (31%) while 131

Pelucchi, et al., 2009 [21]; Saffran, et al., 1996 [22];

sentences were tagged as OCs (17%). The number of

Thompson & Newport, 2007 [23]). The formulation of

other clefts was 296 (39%), and the number of non-clefts

transitional probability is as shown in (6).

was 94 (13%). A test of Chi-square showed that there was

(6) probability of Y|X = (frequency of XY) / (frequency of X)

a significant difference among 4 types of sentences. [χ 2 (3) = 136.16, p < .01]. Ryan’s procedure again showed that all of the 4 conditions significantly differed from each other (p < .01). In other words, distributional tendencies of cleft sentences with transitive verb are very similar to their general tendencies in Corpus Analysis 1. This indicates that the inconsistency between the reading time data in Kahraman, et al. (2011) and frequency cannot be attributed to the verb types. Overall, the results of the two analyses revealed that the simple frequencies of SCs and OCs cannot account for their processing asymmetry in Japanese. This suggests that other factors such as expectation for upcoming constituents might have a stronger impact on sentence processing. In the next subsection, we will attempt to explore

some

possible

effects

of

expectation

by

calculating transitional probabilities of SCs and OCs at the embedded verb. In order to calculate the processing difficulty of a particular structure, Hale (2003, 2006) proposed Entropy Reduction Hypothesis [10], [11]. According to Entropy Reduction Hypothesis, if the entropy, namely uncertainty about upcoming structures is greater, the processing harder.

When

there

are

On the other hand, if structure X is followed by structure Y or Z, and if the frequencies of Y and Z are equal, their transitional

probabilities

probability

of

a

are

particular

0.5.

When

structure

transitional

increases,

its

expectation becomes higher. In

the

present

analysis,

we

applied

transitional

probability formula to SCs and OCs with transitive verbs. In other words, we calculated transitional probabilities of SCs and OCs at the embedded verb position. In order to compute transitional probability of SCs, we divided the total

frequency

of

SCs

into

total

frequency

of

[accusative-NP + verb-NOWA] sequence. In the case of OCs, total frequency of OCs was divided into total frequency of [nominative-NP + verb-NOWA] sequence. 3 The results are as follows. Transitional probability of SCs within [accusative-NP + verb-NOWA] sequence

3.1.3. Transitional Probabilities

becomes

For example, if structure X is always followed by structure Y, transitional probability of Y at X is 1 (100%).

many

possible

continuations, entropy is high, and when the possibilities of upcoming continuations decrease, entropy is reduced. In other words, if the predictability of an upcoming structure is higher than another structure, it is processed more easily. In order to calculate the amount of entropy reduction, conditional entropies of possible structures should be computed. In order to do this, formalized grammar is necessary [10], [11]. However, unlike English, there is no

was

.57

(205/357).

On

the

other

hand

transitional

probability of OCs within [nominative-NP + verb-NOWA] sequence was .75 (86/114). 4 The results showed that transitional probability of OCs within [nominative-NP + verb-NOWA] sequence was higher than that of SCs within [accusative-NP + verb-NOWA]. These results indicate that although the frequency of OCs is lower than SCs, the Japanese parser’s certainty about OCs is higher than SCs. In other words, proportion of expectation for OCs at the embedded verb position is higher than that for SCs. If we assume that this kind of distributional pattern is learned and used for making predictions in the sentence processing, the processing asymmetry between OCs and SCs in Japanese can be explained by expectation. In the next section, we will

fully parsed corpus in Japanese. Therefore, calculation of entropy reduction of a structure seems quite hard in Japanese. Nevertheless, Aslin, et al. (1998) pointed out that transitional probabilities are functionally equivalent

3 Sentences in which an element intervened between NP and verb were also included in the analysis. 4 The number of SCs and OCs differ from the Corpus Analysis 2, because we only included clefts in which subject or object explicitly expressed before the embedded verb.

― 70 ―

discuss these results in more detail.

within cleft sentences. In order to examine their possible

4. General Discussion

effects we need to conduct more detailed analyses. As we will discuss below, our results are share some

In the current study, we investigated into the question how the frequency and expectation are related to sentence

similarities

processing, and how human experience is shaped. In order

Previous studies in child language acquisition have

with

child

to answer these questions we investigated the frequency

reported that transitional probabilities have a strong

and transitional probabilities of subject and object clefts,

impact on the acquisition of word segmentation by young

and compared these results with previous processing data

children (e.g. Aslin, et al., 1998 [20]; Pelucchi, et al.,

of Japanese clefts [16]. Main findings of the current study

2009 [21]; Saffran, et al., 1996 [22]; Thompson &

can be summarized as follows. The simple frequency of

Newport, 2007 [23]). For example, Aslin et al. (1998)

SCs was higher than that of OCs, whereas transitional

used an artificial language to test the possible effects of

probability of OCs was higher than that of SCs at the

transitional probabilities and frequency in acquisition of

embedded verb position.

word

segmentation

by

language

acquisition

8-month-old

studies.

children.

They

Reali and Christiansen (2007) showed that subject

manipulated the frequency and transitional probability of

relative clauses (SRs) are more frequent than object

words. The artificial words were matched in frequency,

relative clauses in English, and argued that the processing

but differed in their transitional probabilities. The results

asymmetry between SRs and ORs can be attributed to

showed

their distribution [7]. In the case of Japanese clefts, the

probabilities to segment words, whereas they did not use

results

and

the frequency information. This suggests that transitional

Christiansen. Even though OCs were easier to process

probabilities are likely to play a more important role on

than SCs [16], SCs were more frequent than OCs.

the language acquisition than frequency does.

are

considerably

different

from

Reali

that

8-month-old

children

used

transitional

Although our study is not about learning per se, and the

Therefore, the processing asymmetry between SCs and their

linguistic unit is quite different from child language

distributional pattern in a corpus. Sato (2011) has also

acquisition studies, the results are in line with respect to

shown that the distributions of SRs and ORs are not

stronger

reflected in their processing difficulty in Japanese [12].

indicates in both language acquisition and adult sentence

Taken together our results suggest that the simple

processing, expectation for upcoming continuations would

frequency

processing

have stronger impact than the simple frequencies. It thus

difficulty of relative clauses and cleft sentences in

can be said that transitional (conditional) probabilities are

Japanese (Roland et al., 2007 [24]).

likely to play a more crucial role than the frequency in

OCs

in

Japanese

alone

cannot

cannot

be

account

attributed

for

the

to

While the simple frequencies of SCs and OCs are not in

the

effects

formation

of

of

transitional

our

probabilities.

experience

line with their processing difficulty, their transitional

comprehension system.

probabilities are consistent with the difficulty pattern

4.1. Limitations and Future Studies

and

This

language

observed in Kahraman et al (2011) [16]. At the embedded

As we discussed above, in the present study, we could

verb position, transitional probability of OCs was higher

not analyze the distribution of proper noun, pronouns,

than that of SCs. In other words, the uncertainty about

animate and inanimate nouns within cleft sentences.

SCs was greater than OCs, indicating that the processing

However, previous studies showed that these kinds of

difficulty of SCs can be explained by expectation [10],

lexical items are also related to sentence processing (e.g.,

[11], [14]. Our study indicates an important possibility

Gennari & MacDonald, 2008 [1]; Mak, et al., 2002 [6];

that the effects of frequency observed in previous studies

Wu, et al., 2009 [13]). In future studies we need to

might be attributed to transitional probabilities. In other

explore the distribution of these noun types and examine

words, the distribution of noun types such as proper

their possible influence on transitional probabilities of

noun–pronoun or animate–inanimate might have changed

upcoming structures and sentence processing.

expectation of upcoming constituents and this would have

Another limitation of the current study is the gap

affected the results (e.g., Reali & Chiristiansen, 2007 [7];

between the corpus analyses and reading time data in

Gennari & MacDonald, 2008, 2009 [1], [2]; Mak, et al.,

Kahraman, et al. (2011) study. In the current study, we

2002 [6]; Wu, et al., 2009 [13]). However, in the present

did not manipulate the test sentences nor run any

study we could not analyze the distribution of noun types

experiment, based on the distributional patterns of cleft

― 71 ―

sentences. In order to draw more conclusive conclusions regarding to effects of probabilistic factors such as frequency and expectation, we need to conduct follow up experiments. We leave these issues for future studies.

5. Conclusions In order to explore possible effects of frequency and expectation on sentence processing, we conducted corpus analyses, and compared the distribution and transitional probabilities of subject and object clefts in Japanese. The results

showed

that

the

transitional

probabilities

successfully account for the processing difficulty of SCs, whereas the simple frequency cannot, indicating that expectation has a stronger impact than

the simple

frequency on sentence processing. Overall, our results suggest that, to integrate the sentence processing and probabilistic factors, we need to investigate corpora from different dimensions (Roland, et al., 2007 [24]).

References [1] S.P. Gennari and M.C. MacDonald, Semantic indeterminacy and relative clause comprehension, Journal of Memory and Language vol.58, pp.161-187, 2008. [2] S.P. Gennari and M.C. MacDonald, Linking production and comprehension process: The case of relative clauses, Cognition, vol.111, pp.1-23, 2009. [3] M.C. MacDonald, Distributional information in language comprehension, production, and acquisition: Three puzzles and a moral, The emergence of language, eds.,. B. MacWhinney et al., pp.177-196, Lawrence Erlbaum, Mahwah, NJ, 1999. [4] M.C. MacDonald and M.H. Christiansen, Reassessing working memory: Comment on Just and Carpenter (1992) and Waters and Caplan (1996), Psychological Review, vol.109, pp.35-54, 2002. [5] J.B. Wells, M.H. Christiansen, D.S. Race, D.J. Acheson and M.C. MacDonald, Experience and sentence processing: Statistical learning and relative clause processing, Cognitive Psychology, vol.58, pp.250-271, 2009. [6] W.M. Mak, W. Vonk and H. Schriefers, The influence of animacy on relative clauses processing. Journal of Memory and Language, vol.47, pp.50–68, 2002. [7] F. Reali and M.H. Christiansen, Processing of relative clauses is made easier by frequency of occurence. Journal of Memory and Language, vol.57, pp.1-23, 2007. [8] J. King and M.A. Just, Individual differences in syntactic processing: The role of working memory, Journal of Memory and Language, vol.30, pp.580-602, 1991 [9] A. Staub, Eye movements and processing difficulty in object relative clauses, Cognition, vol.116, pp.71-86, 2010. [10] J. Hale, The information conveyed by words in sentences, Journal of Psycholinguistic Research,

vol.32, pp.101–123, 2003. [11] J. Hale, Uncertainty about the rest of the sentence, Cognitive Science, vol.30, pp.643-672, 2006 [12] A. Sato, Nihongo kankeisetsu no shorifuka o kettei suru youin no kentou: Koopasu ni okeru shiyouhindo o chuushin ni, PhD Thesis, Hiroshima Univ., 2011. [13] F. Wu, E. Kaiser and E. Andersen, Animacy effects in Chinese relative clause processing, Proc. WECOL 2009, pp. 318-329, 2009. [14] J. Yun, J. Whitman and J. Hale, Subject-object asymmetries in Korean sentence comprehension, Proc. CogSci, Vol.32, pp. 2010. [15] K. Hiraiwa and S. Ishihara, Missing links: Cleft, sluicing and ‘no da’ construction in Japanese, Proc. HUMIT 2001, vol.43, pp.35-54, MITWPL, Cambridge: MA, 2002. [16] B. Kahraman, A. Sato, H. Ono and H. Sakai, Incremental processing of gap-filler dependencies: Evidence from the processing of subject and object clefts in Japanese, Proc. TCP 2011, vol.12, in press 2011. [17] B. Kahraman, Processing “gap-filler dependencies” in Japanese and Turkish: Regarding the incrementality of sentence processing, PhD Thesis, Hiroshima Univ., 2011. [18] W. O’Grady, Syntactic Development, University of Chicago Press, Chicago, 1997. [19] E. Gibson, Linguistic complexity: Locality of syntactic dependencies, Cognition, vol.68, no1, pp.1-76, 1998. [20] R.N. Aslin, J.R. Saffran and E.L. Newport, Computation of conditional probability statistics by 8-month-old infants, Psychological Science, vol.9, pp.321–324, 1998. [21] B. Pelucchi, J.F. Hay and J.R. Saffran, Statistical learning in a natural language by 8-month-old infants, Child Development, vol.80, pp.674-685, 2009 [22] J.R. Saffran, R.N. Aslin and E.L. Newport, Statistical learning by 8-month-old infants, Science, Vol.274, pp.1926-1928, 1996. [23] S.P. Thompson and E.L. Newport, Statistical learning of syntax: The Role of transitional probability, Language Development and Learning, vol.3, pp.1-42, 2007. [24] D. Roland, F. Dick and J.L. Elman, Frequency of basic English grammatical structures: A corpus analysis, Journal of Memory and Language, vol.57, pp.349-379, 2007.

Acknowledgments This research was supported by (1) Grant-in-Aid for Scientific

Research

(B)

“Neurocognitive

basis

for

language learning through the processing of input and output (PI: Hiromu Sakai, #20320060)” by JSPS; (2) Grant-in-Aid for Scientific Research (B) (PI: Kentaro Nakatani, #21320083) by JSPS; (3) Grant-in Aid for Young Scientists from the MEXT (PI: Hajime Ono, # 21720152). We would like to thank Rosalynn Chiu for her help. All remaining errors are our own.

― 72 ―

Frequency or expectation?

Keyword Expectation, Frequency, Corpus Analysis, Sentence processing, Japanese, Subject Clefts, .... Kyonen sobo-ga inaka-de kaihoushita-nowa shinseki-da.

234KB Sizes 1 Downloads 246 Views

Recommend Documents

Frequency or Composition?
E-Mail address: [email protected] ... E-Mail address: [email protected] ... Deterrence effects are essential for effective antitrust policy as authorities cannot.

Frequency or Composition? Pedro P. Barros , Joseph A ...
our successful enforcement efforts” (U.S. Department of Justice Antitrust Division, ... policy have received a good bit of scholarly attention (e.g., Feinberg, 1980; ... We use the convention that a higher η means a higher degree of ..... Page 10

CONDITIONAL MEASURES AND CONDITIONAL EXPECTATION ...
Abstract. The purpose of this paper is to give a clean formulation and proof of Rohlin's Disintegration. Theorem (Rohlin '52). Another (possible) proof can be ...

A-Frequency-Dictionary-Of-Japanese-Routledge-Frequency ...
A FREQUENCY DICTIONARY OF PORTUGUESE (ROUTLEDGE FREQUENCY DICTIONARIES). Read On the internet and Download Ebook A Frequency Dictionary Of Portuguese (Routledge Frequency Dictionaries). Download Mark Davies ebook file at no cost and this file pdf ava

Expectation-Maximization Algorithm in Image ...
Mar 15, 2007 - Specification. Estimate. Maximize. Gaussian. E-Step. M-Step. Results. Reference. Expectation-Maximization Algorithm in Image. Segmentation.

pdf-1479\groundhog-day-expectation-management-examining ...
... apps below to open or edit this item. pdf-1479\groundhog-day-expectation-management-exam ... lkans-by-us-army-command-and-general-staff-col.pdf.

Extended Expectation Maximization for Inferring ... - Semantic Scholar
uments over a ranked list of scored documents returned by a retrieval system has a broad ... retrieved by multiple systems should have the same, global, probability ..... systems submitted to TREC 6, 7 and 8 ad-hoc tracks, TREC 9 and 10 Web.

Repetition Suppression and Expectation Suppression ...
Sep 26, 2012 - data analysis that may arise from having an unequal number of trials per condition ... were performed using SPSS Statistics v16. (IBM).

Repetition Suppression and Expectation Suppression Are Dissociable ...
Sep 26, 2012 - we observed ES (but no RS) during an intermediate (100–200 .... domain. More specifically, for the early time window, we chose to focus on the P50 ..... Friston K (2009) The free-energy principle: a rough guide to the brain?

Expectation, Disappointment, and Exit: Reference ... - Wharton Marketing
Dec 29, 2016 - way we make policy, manage firms, and design markets. ... and endogenously within the context of the application.1 A practical theory of reference- .... Importantly, the BIN option may disappear in the course of the auction.

Extended Expectation Maximization for Inferring ... - Semantic Scholar
Given a user's request, an information retrieval system assigns a score to each ... lists returned by multiple systems, and (b) encoding the aforementioned con-.

Generalized Expectation Criteria for Semi-Supervised ... - Audentia
Generalized Expectation Criteria for Semi-Supervised Learning of. Conditional Random Fields. Gideon S. Mann. Google Inc. 76 Ninth Avenue. New York, NY ...

Enhancing a tone by shifting its frequency or intensity
shall subsequently call it “frequency enhancement”) may have exactly the same ...... 3These data were also submitted to a three-way analysis of variance, in.

Enhancing a tone by shifting its frequency or intensity
(i.e., any of the three “inner” components). In the Freq ... semitone (i.e., 1/12 octave). ...... Kidd, G., Mason, C. R., Richards, V. M., Gallun, F. J., and Durlach, N. I..

Band Output Frequency Input Frequency Tone Call Coverage Area ...
Frequency. Input. Frequency. Tone. Call. Coverage Area. State. Special. Purpose. Owner. Notes. 6M. 53.35000. 52.35000. K4VCM. Signal Mtn /. Chattanooga.

Multi-frequency identification device
Aug 15, 2006 - Rietzler, Marktoberdorf (DE). FOREIGN PATENT DOCUMENTS. EP. 1209615. 5/2002. (73) Assignee: Neology, Inc., PoWay, CA (US). FR.

Low frequency words
Spectral analysis. 1. ... novel spectral analysis techniques. .... Mr. Jitender and Ms Sumathi for fMRI acquisition; Ms Megha Sharda for help with data analysis and.

TrueSkill - Updating player skills in tennis with Expectation ... - GitHub
Mar 21, 2013 - and matching players on Xbox Online Games, it is a general rating model ... The expected skill value m is accompanied by a level of uncertainty v, which ... tennis match and computing marginal distributions for skill variables of both

asymmetric expectation effects of regime shifts ... - Emory Economics
output as the monetary policy switches from the bad regime to the good regime. ... We call this difference the expectation effect of regime shifts ... The model features nominal rigidities in the form of staggered price setting and dynamic ...... and