Relative clause extraction complexity in Japanese 1 2

Tomoko Ishizuka , Kentaro Nakatani , Edward Gibson3 Boston University1, Harvard University2, Massachusetts Institute of Technology3

1

Relative Clauses (RCs) Object extraction vs. Subject extraction (1) English Object-extracted RCs (OERCs)⇒harder to process The reporter who the statesman attacked had a bad reputation. (2) English Subject-extracted RCs (SERCs) The reporter who attacked the statesman had a bad reputation.

Higher complexity of object extraction is widely acknowledged in SVO languagesÅ\such as English, French, German but cf. Chinese (Hsiao & Gibson, 2003)

2

Three Main Accounts for higher complexity of OERCs 1) Resources ⇒ e.g. Dependency Locality Theory (Gibson, 2000) 2) Depth of Embedding ⇒ e.g. Structural Distance Hypothesis (O’Grady et al.,2000) 3) Accessibility ⇒ e.g. Accessibility Hierarchy (Keenan & Comrie,1977; Keenan & Hawkins, 1987)

3

The Dependency Locality Theory (DLT) —Gibson (2000) Basic idea: Sentence comprehension involves two components of computational resource use: (1) INTEGRATION resources: connecting an incoming word into the current structure (2) STORAGE resources: keeping track of the incomplete structural dependencies in the current structure • Relevant aspect of this theory => A cost associated with performing an integration increases with the distance between the heads of two projections being integrated together. • Integration cost consists of 1) discourse processing cost 2) structural integration cost

4

Illustration of the cost function: (1) Object-extracted RC 3 0 2 0 1 0 1 1 1 The reporter who the statesman attacked had a bad reputation. 0 1 0 0 1 1+2 3 0 0 1 (2) Subject-extracted RC 3 0 0 1 0 1 0 1 1 The reporter who attacked the statesman had a bad reputation. 0 1 0 1 0 1 30 0 1 Integration is longer at the embedded verb for the object extraction. Thus, the DLT predicts that object extractions should be harder.

5

The Structural Distance Hypothesis (SDH) —O’Grady et al., 2000 The distance traversed by a syntactic operation, calculated in terms of the number of nodes crossed, determines a structure’s relative complexity.

(1) OERC Structural distance: 2 nodes (S, VP) The reporter who [the statesman [attacked e]] had a bad reputation. (2) SERC Structural distance: 1 node (S) The reporter who [e attacked the statesman] had a bad reputation.

The Accessibility Hierarchy (AH) — Keenan & Comrie, 1977 Keenan & Hawkins, 1987 The AH universally determines the degree of accessibility to RC formation. Subject position is more accessible than object position in the AH. Thus, SDH & AH predict that Object extractions should be harder. 6

Main Question: What makes some extractions more difficult than others? • Resources • Depth of embedding • Accessibility

To answer this question, we did self-paced reading study of Japanese RCs. 44 native Japanese speakers participated in the study.

7

Other potential sources of RC extraction difficulty 1) Canonical Word Order (MacDonald & Christiansen, 2002) Canonical word orders are easier to process than noncanonical word orders. Therefore, SERCs are easier to process than OERCs in English. It is not clear what the canonical word order theory predicts for languages like Japanese. Because Japanese allows subject and object null pronouns, both SV and OV word orders are canonical in Japanese. 2) Perspective Shift (MacWhinney, 1982) There is complexity associated with shifting the perspective of a clause, where the perspective of a clause is taken from the subject of the clause.

8

3) Case Matching (Sauerland & Gibson, 1998) There is complexity associated with RCs in which the case of the extracted element does not match the head noun of the RC. In this study, the effects of perspective shift and case matching are controlled. The RC always involves a perspective shift, and the extracted element in the RC never matches its head noun (case clash).

4) Ambiguity. In Japanese, there is always a temporary ambiguity in RCs, such that the first NP can be taken as a main clause. For the predictions of ambiguity resolution in our materials, see below.

9

Experimental method: self-paced word-by-word moving window with comprehension questions after each item. 24 targets (4 each of 6 conditions), 65 fillers. Plausibility norming study: A questionnaire with 25 participants Items: simple transitive clauses that made up each RC Results: 4 items were found to be more plausible in one version. => These were omitted from analyses

10

RC-extraction conditions with dative topic (-niwa) head nouns: a. Singly embedded OERC [ N1-nom e V1] N2-dat-top b. Singly embedded SERC [ e N1-acc V1] N2-dat-top c. Doubly embedded OERC [ [ N1-nom e V1] N2-nom e V2] N3-dat-top d. Doubly embedded SERC [ e [ eN1-acc V1] N2-acc V2] N3-dat-top RC-extraction conditions with case matching head nouns (nom and acc): e. Case matching SERC [ e N1-acc V1] N2-nom V2 N3-dat-top f. Case matching OERC [ N1-nom e V1] N2-acc V2 N3-dat-top

11

Predictions of ambiguity resolution in conditions (a)-(d). In the object-extracted versions, the first segment NP-ga (nom) might initially be analyzed to be part of a matrix clause. In the subject-extracted versions, the first segment NP-o (acc) is more likely to be analyzed as part of a relative clause, because of the absence of a lexical subject. (All items, including fillers, had lexical subjects.) The object-extracted RCs are therefore potentially more ambiguous than the subject-extracted RCs at the first NP. This ambiguity is resolved at the verb, which was unambiguously transitive (e.g., “chase”, “respect”). Thus, if there is an ambiguity effect, it should occur at the verb.

12

Predictions: The DLT predicts that SERCs (b, d, e) should be harder. Since Japanese is the head-final, SOV language, RCs always precede their head nouns, unlike RCs in English and French. Thus, there are always more intervening discourse referents between the gap and the extracted head for SERC than OERC. The SDH & AH predict that OERCs (a, c, f) should be harder. Object extractions always cross two nodes, S and VP. In contrast, Subject extractions always cross one node, VP. In addition, Perspective Shift Theory and Case Matching Principle predict, (e) and (f) should be processed faster than (c) and (d) at region N2.

13

Examples: a. Singly embedded OERC 車が追跡したオートバイにはまだ免許を取得していない高校生が乗っ ていた。

[kuruma-ga e tuisekisita] ootobai-ni-wa mada menkyo-o syutoku sitenai kookoosei-ga notteita. [car-nom e chased] motorbike-dat-top yet license-acc got has-not high_school_student-nom rode. ‘A high school student who has not got a license was on the motorbike which the car chased.’ b. Singly embedded SERC 車を追跡したオートバイにはまだ免許を取得していない高校生が乗っていた。

[e kuruma-o tuisekisita] ootobai-ni-wa mada menkyo-o syutoku sitenai kookoosei-ga notteita. [e car-acc chased] motorbike-dat-top yet license-acc got has-not high_school_student-nom rode. 14

‘A high school student who has not got a license was on the motorbike which chased the car.’ c. Doubly embedded OERC トラックが追い越した車が追跡したオートバイには無免許の高校生が 乗っていた。

[[torakku-ga e oikosita] kuruma-ga e tuisekisita] ootobai-ni-wa mumenkyo-no kookoosei-ga notteita. [[truck-nom e passed] car-nom e chased] motorbike-dat-top without_license -gen high_school_student-nom rode. ‘A high school student without a license was on the motorbike which the car which the truck passed chased.’ d. Doubly embedded SERC

15

トラックを追い越した車を追跡したオートバイには無免許の高校生が 乗っていた。

[e [e torakku-o oikosita] kuruma-o tuisekisita] ootobai-ni-wa mumenkyo-no kookoosei-ga notteita. [e [e truck-acc passed] car-acc chased] motorbike-dat-top without_license-gen high_school_student-nom rode. ‘A high school student without a license was on the motorbike which chased the car which passed the truck.’ e. Case matching SERC トラックを追い越した車が追跡したオートバイには無免許の高校生が 乗っていた。

[e torakku-o oikosita] kuruma-ga tuisekisita ootobai-ni-wa mumenkyono kookoosei-ga notteita. [e truck-acc passed] car-nom chased motorbike-dat-top without_license-gen high_school_student-nom rode. 16

‘A high school student without a license was on the motorbike which the car which passed the truck chased.’ f. Case matching OERC トラックが追い越した車を追跡したオートバイには無免許の高校生が 乗っていた。

[torakku-ga e oikosita] kuruma-o tuisekisita ootobai-ni-wa mumenkyono kookoosei-ga notteita. [truck-nom e passed] car-acc chased motorbike-dat-top without_license-gen high_school_student-nom rode. ‘A high school student without a license was on the motorbike which chased the car which the truck passed.’

17

Main Results Figure 1: Mean residual reading times for singly embedded conditions OERC (a): 550.74 ms. SERC (b): 339.31 ms. Nearly Significant by subjects, F1(1,44) = 2.58 p = 0.115 Significant by items F2(1,19) = 4.60 p < 0.05 Shows: OERCs are harder to process than SERCs.

18

Figure 2: Mean residual reading times for doubly embedded conditions OERC (c): 389.66 ms. SERC (d): 178.04 ms. Significant by subj F1(1,44) = 8.78 p = 0.005 Significant by items F2(1,19) =12.19 p =0.002 Shows: OERCs are harder to process than SERCs.

19

Figure 3: Mean residual reading times for case matching conditions OERC (f): mean 295.67 ms. SERC (e): mean 226.22 ms. Shows: OERCs tended to be read slower than SERCs but not significant (Fs < 0.9, ps > 0.3

20

Supporting results Numerically better comprehension performance for SERCs suggests that SERCs are easier to process than OERCs. OERC SERC

Singly embedded 71.3% 78.5%

Doubly embedded 65.8% 71.7%

Prediction from Ambiguity There were no slow down observed at the verb of object extractions. Therefore, we consider the results obtained above are not due to the ambiguity effect.

21

Figure 4: Mean residual reading times for case matching conditions CLASH: 451.47 ms. MATCH: 260.95 ms. Significant by subjects F1(1,44) = 7.69 p < 0.01 Significant by items F2(1,19) = 6.93 p < 0.02

Shows: Case matching conditions are easier than case clashing conditions.

22

Conclusions:

• Object extractions are harder to process than Subject extractions in Japanese, matching results previously reported for in SVO languages. • In case clash conditions, the reading times for object extractions were much slower. • In case match conditions, object extractions tended to be slower, but the difference was not significant. • Comprehension performance was better for subject extractions than for object extractions. The evidence is compatible with the Structural Distance Hypothesis, and the Accessibility Hierarchy, and not compatible with the Dependency Locality Theory.

23

Nevertheless… The Hsiao and Gibson (2003) results run counter to what we would expect based on the discussion of Japanese so far. How do we reconcile the two views? Further Questions arise: Are there other reasons object extractions are harder in Japanese other than depth of embedding or the accessibility?

24

References Gibson, E. (2000). The dependency locality theory: A distancebased theory of linguistic complexity. In Y. Miyashita, A. Marantz & W. O’Neil (Eds.), Image, language, brain, (pp. 95126). Cambridge, MA: MIT Press. Keenan, E. L., & Comrie, B. (1977). Noun phrase accessibility and universal grammar. Linguistic Inquiry 8, 1. 63-99. Keenan, E. L., & Hawkins, S. (1987). The psychological validity of the accessibility hierarchy. In E. Keenean, Universal Grammar: 15 essays (pp. 60-85). Routledge. London. MacWhinney, B. (1982). Basic syntactic processes. In S. Kuczaj (Ed.), Language development: volume 1, syntax and semantics (pp. 73-136). Erlbaum, Hillsdale, NJ. MacDonald, M. & Christiansen, M. (2002). Reassessing working memory: Comment on Just and Carpenter (1992) and Waters and Caplan (1999). Psychological Review, 109, 35-54.

25

O’Grady, W., Yamashita, Y., Lee, M., Choo, M., & Cho, S. (2000). Computational factors in the acquisition of relative clauses. Paper presented at the International Conference on the Development of the Mind, Tokyo. Sauerland, U., & Gibson, E. (1998). How to predict the relative clause attachment preference. Paper presented at the 11th CUNY sentence processing conference, Rutgers University, New Brunswick NJ.

26

Relative clause extraction complexity in Japanese - CiteSeerX

(1) INTEGRATION resources: connecting an incoming word into the ... 2) structural integration cost ..... Computational factors in the acquisition of relative clauses ...

655KB Sizes 0 Downloads 291 Views

Recommend Documents

Relative clause extraction complexity in Japanese - CiteSeerX
Illustration of the cost function: (1) Object-extracted ... Items: simple transitive clauses that made up each RC. Results: 4 items ... effect, it should occur at the verb.

Relative-Clause Processing in Typologically Distinct ...
ORCs were found in ERP studies by Mecklinger, Schriefers, Steinhauer, and Friederici. (1995). ..... Development of Language, ed. by John R. Hayes, 279-362.

RELATIVE CLAUSE 1.pdf
Sign in. Loading… Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying.

Case markers as clause boundary inducers in Japanese
May 7, 2002 - Nara Institute of Science and Technology. Graduate School of Information Science ... Local information allowing, the boundary between.

Processing Japanese Relative Clauses in Context
Taro: “Which reporter stands as a candidate for the election?” Hanako: “It seems to be the reporter who {the writer interviewed / interviewed the writer }.” ...

Processing Relative Clauses in Japanese with Two ...
during the online experiment (e.g., LH corresponds to: when reading the high RC, how natural it is for the ..... in-Aid for COE Research (08CE1001) from the Japanese Ministry of Education, Science ... Parsing modifiers: special purpose routines in th

Reduced Complexity Solution for Weight Extraction in QRD-LSL ...
Jan 21, 2009 - Nonetheless, this approximate solution requires algorithm convergence and in- finite memory support (forgetting factor equal to one). To obtain.

Processing Japanese Relative Clauses in Context
VP. (2) Sub-Ext. Structural distance: 1 XP (TP). The reporter who [e attacked the ... C horse j. -Nom T'. VP kicked. V'. NP V e i. NP. CP. NP mule i. Op. C'. TP. C e.

Processing Japanese Relative Clauses in Different ...
The production rate of relative clauses and other type of sentences .... subjects and 1 item were excluded from the data analysis because of low accuracy ...

Extraction and Search of Chemical Formulae in Text ... - CiteSeerX
trade-off between recall and precision for imbalanced data are proposed to improve the .... second set of issues involve data mining, such as mining fre- ... Documents PDF ...... machines for text classification through parameter-free threshold ...

Challenges for Discontiguous Phrase Extraction - CiteSeerX
Any reasonable program must start by putting the exponentially many phrases into ..... Web page: www.cs.princeton.edu/algs4/home (see in particular:.

Textline Information Extraction from Grayscale Camera ... - CiteSeerX
INTRODUCTION ... our method starts by enhancing the grayscale curled textline structure using ... cant features of grayscale images [12] and speech-energy.

Challenges for Discontiguous Phrase Extraction - CiteSeerX
useful for many purposes, including the characterization of learner texts. The basic problem is that there is a ..... Master's thesis, Universität. Tübingen, Germany.

Reducing TCB Complexity for Security-Sensitive ... - CiteSeerX
Apr 18, 2006 - action client, VPN gateway and digital signatures in an e- mail client), we achieved a considerable reduction in code size and complexity.

Semantic Property Grammars for Knowledge Extraction ... - CiteSeerX
available for this task, which only output a parse tree. In addition, we ... to a DNA domain or region, while sometimes it refers to a protein domain or region.

Semantic Property Grammars for Knowledge Extraction ... - CiteSeerX
source of wanted concept extraction, we can directly apply the same method- ... assess a general theme in the (sub)text: since the parser retrieves the seman-.

Reducing TCB Complexity for Security-Sensitive ... - CiteSeerX
Apr 18, 2006 - mail client), we achieved a considerable reduction in code size and .... functionality and security of the service. The security re- ... window manager uses a few pixels at the top of the screen, ...... Crypto-. Gram Newsletter.

The Extraction and Complexity Limits of Graphical Models for Linear ...
graphical model for a classical linear block code that implies a de- ..... (9) and dimension . Local constraints that involve only hidden variables are internal ...

Relative efficiency of biparental mating, single capsule ... - CiteSeerX
TMV 3 × IS 387-2 .... selection procedures (SSD, Random Bulk and ... 1 and 2). Desirable mean values of the BIP could largely be attributed to the predominance ...

gender identity and relative income within households - CiteSeerX
We argue that this pattern is best ..... is white,21 72% of wives with a high school degree or less are married to a husband with similar .... Trade; Retail Trade; Finance, Insurance, and Real Estate; Business, Personal, and Repair Services; ...

A Random Field Model for Improved Feature Extraction ... - CiteSeerX
Center for Biometrics and Security Research & National Laboratory of Pattern Recognition. Institute of ... MRF) has been used for solving many image analysis prob- lems, including .... In this context, we also call G(C) outlier indicator field.