Modeling long distance dependencies in double r july 2008
1 / 27

Modeling Long-Distance Dependencies in Double R July 2008 - PowerPoint PPT Presentation

  • Uploaded on

Modeling Long-Distance Dependencies in Double R July 2008. Jerry Ball Human Effectiveness Directorate Air Force Research Laboratory. Double R Model.

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about 'Modeling Long-Distance Dependencies in Double R July 2008' - azura

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Modeling long distance dependencies in double r july 2008

Modeling Long-Distance Dependencies in Double R July 2008

Jerry Ball

Human Effectiveness Directorate

Air Force Research Laboratory

Double r model
Double R Model

  • Goal: Model the basic grammatical patterns of English to support development of cognitively plausible and functional language comprehension systems

    • Declaratives – “The man hit the ball”

    • Questions

      • Yes-No Questions – “Did the man hit the ball?”

      • Wh Questions – “Where did the man hit the ball?”

    • Imperatives – “Hit the ball!”

    • Relative Clauses – “The ball that the man hit”

    • Wh Clauses – “I know where the man hit the ball”

    • Passive constructions – “The ball was hit”

Empirical evidence
Empirical Evidence

  • Basic grammatical patterns have been most extensively studied in generative grammar

    • The focus in generative grammar has been on studying the syntactic form of linguistic expressions in isolation frommeaningandprocessing

  • The “Simpler Syntax”of Culicover and Jackendoff (2005) is redressing the consideration of meaning and simplifying syntax as a side effect

  • O’Grady’s “Syntactic Carpentry” (2005) integrates processing as well (see also Hawkins, 2004)

  • Reference grammars (Huddleston & Pullum, 2002; Quirk et al., 1985) provide a wealth of examples which integrate form, function and meaning

Long-Distance Dependencies

  • Long-distance dependencies are the sin qua non of modern linguistic theorizing

    • An important motivation for Chomsky’s transformational grammar – deep structures with arguments in place are mapped to surface structures with arguments “moved” by various transformations

      • Introduction of traces supported the collapsing of deep and surface structure – traces mark the original location

      • Construction specific transformations were generalized to Move asubject to universal, parameterized constraints

    • Many basic grammatical constructions involve long-distance dependencies

      • Wh questions, relative clauses, passive constructions…

    • Require retention of grammatical information for extended stretches of input

Long-Distance Dependencies

  • Binding of pronouns and anaphors:

    • Anaphors (“himself”) vs. pronouns (“him”)

      • Johni kicked himselfi ( i = i) (Principle A of GB Theory)

      • Johnikicked himj ( i not = j) (Principle B of GB Theory)

    • Proper binding often requires use of semantic information (but considered syntactic in generative grammar)

      • Johni and Maryjwere talking. Shejtold himi… (gender)

      • Johni is reading a bookj. Itjis about… (animacy)

      • Johni is reading the comicsj. Theyjare… (number)

Long distance dependencies
Long-Distance Dependencies

  • Verb Control

    • Object Control: “HeipersuadedmejPROj to go”

      • PROj is an “implicit” pronoun (a trace without movement)

    • Subject Control: “HeipromisedmejPROito go”

  • Raising Verbs

    • “Heiseemsti to like me”

      • ti is a trace of a “raised” argument

Long distance dependencies1
Long-Distance Dependencies

  • Passive Constructions

    • “The balliwas kickedti by the man”

      • The object is “raised” out of its normal position and the subject is pushed into an oblique complement position “by the man”

  • Wh Questions

    • “Whoi did JohnjdecidePROj to see ti”

  • Relative Clauses

    • “The ballithat the man kicked ti”

Modeling long distance dependencies
Modeling Long-Distance Dependencies

  • An ontology of DM chunk types supports the grammatical distinctions

  • Productions match buffer elements at the appropriate level of the ontology given the function of the production, e.g.

    • Production matches pronoun “he…”  project nominal and put in subject buffer

    • Production matches predicate specifier (e.g. “…is…”)  project a declarative clause

    • Production matches declarative clause and a nominal in subject buffer (e.g. “he is…”)  integrate the nominal as the subject of the clause

    • Production matches transitive verb (e.g. “hitting”) functioning as clausal head (e.g. “he is hitting…”) and a nominal (e.g. “…the ball”)  integrate the nominal as the object of the verb

Ontology of situation referring expressions
Ontology of Situation Referring Expressions

  • Decl-sit-refer-expr

  • Yes-no-quest-sit-refer-expr

    • “Is he going?”

  • Wh-quest-sit-refer-expr

    • “Where did he go?”

  • Imp-sit-refer-expr

    • “Don’t go!”

  • Wh-sit-refer-expr

    • “I know where he went”

  • Rel-sit-refer-expr

    • “The book that you like”


Situation Referring Expression

corresponds to Clause in other


What are the grammatical cues

that trigger recognition of

an expression type? These cues

need to be accessible!

Slots in referring expressions
Slots in Referring Expressions

  • Bind-indx (all referring expression types)

    • Identifier for referring expression

  • Parent (all chunk types)

    • Links child to parentchunk

    • Used to avoid multiply integrating chunk into other chunks

  • Token (all chunk types)

    • Distinguishes types from tokens (and type-tokens)

  • Grammatically relevant semantic info

    • Animate (all object referring expression types)

    • Gender (all animate referring expression types)

    • Number (all object referring expression types)

    • Person (all object referring expression types)

Recognizing wh quest and wh situation referring expressions
Recognizing Wh-Quest and Wh-Situation Referring Expressions

…where he went

Where did he…?

(p cog-process-obj-refer-expr-->



isa process-obj-refer-expr


isa wh-refer-expr ;; “where”


isa operator-pred-spec ;; “did”


isa obj-refer-expr ;; “he”

=subject> isa nothing


isa context

- sit-context "wh-quest-sit-refer-expr“

==> project wh-quest-sit-refer-expr

(p cog-process-pred-type



isa process-pred-type


isa wh-refer-expr ;; “where”


isa refer-expr ;; “he”


isa pred-type ;; “went”


isa context

- sit-context "wh-sit-refer-expr"

- sit-context "wh-quest-sit-refer-expr"

==> project wh-sit-refer-expr








Note: the more grammatical cues, the greater the likelihood of being correct!

“Who kicked…?” “Where the heck is...?” “Why is there…?

Modeling long distance dependencies1
Modeling Long-Distance Dependencies

  • Model needs simultaneous access to multiple grammatical elements

    • Serial retrieval from DM is not a viable option

    • Buffers support simultaneous access – buffers on left-hand side of productionconstitute focus of attention – limited to ~4(Cowan, 2000) besides goal and context buffers

    • Can’t predict in advance of production selection which grammatical elements will be needed

    • Buffers and productions are functionally motivated – they are needed in the processing of various constructions

  • A model with fewer buffers (and productions) that handles a similar set of phenomena might be a better model, but a model with fewer buffers that handles fewer phenomena is not comparable (Ball, in preparation)

Double r buffers single chunk
Double R Buffers – Single Chunk

  • Subject – stores the subject

  • Wh-focus – stores the fronted wh expression

  • Rel-focus – stores the relative clause marker

  • Context – stores contextual information

  • Construct – buffer for constructing DM chunks

    • Dual path processing –construct chunk vs. retrieve chunk

  • Retrieval-2 – buffer for storing retrieved or constructed DM chunks

    • Retrieval buffer only used temporarily, retrieved chunk is copied into retrieval-2 for subsequent processing

  • Most-recent-loc-refer-expr –just the most recent

    • Supports locative fronting “On the table is the book”

Double r buffers multiple chunk

Most-recent-child- obj-refer-expr

Most-recent-parent- obj-refer-expr

Most-recent-grandparent- obj-refer-expr

Most-recent-child- obj-refer-expr-head

Most-recent-parent- obj-refer-expr-head

Most-recent-grandparent- obj-refer-expr-head

Double R Buffers – Multiple Chunk

Obj-Refer-Expr buffers

Obj-Refer-Expr-Head buffers

  • St-wm-1

  • St-wm-2

  • St-wm-3

  • St-wm-4


object referring expression

corresponds to nominal

in other approaches

Four generic


Working Memory


Double r buffers multiple chunk1

Most-recent-child- sit-refer-expr

Most-recent-parent- sit-refer-expr

Most-recent-grandparent- sit-refer-expr

Most-recent-child- sit-refer-expr-head

Most-recent-parent- sit-refer-expr-head

Most-recent-grandparent- sit-refer-expr-head

Double R Buffers – Multiple Chunk

Sit-Refer-Expr buffers

Sit-Refer-Expr-Head buffers

Note 1: with the introduction of obj-refer-expr and sit-refer-expr

specific buffers, the short-term working memory buffers are

infrequently used (primarily for conjunctions and adverbs)

Note 2: child, parent and grandparent buffers are all directly accessible,

whereas only st-wm-1 is directly accessible

Long-Distance Dependencies

I want to go

Note: entire representation

is not accessible at once!

Infinitive sit-refer-expr has

implied subj with trace

bound to matrix subj

Traces only

occur in



Combination of“bind-indx” and “trace” needed to

indicate long-distance dependency

Long-Distance Dependencies

He promised me to go

Alternative view: antecedent & trace both bind to same object in situation model

Subject control (verb): matrix clause subject binds to subject

of infinitive situation complement – subject must be accessible

Long-Distance Dependencies

He persuaded me to go

Objectcontrol (verb): matrix clause (indirect) object binds to subject

of infinitive situation complement – object must be accessible

Long-Distance Dependencies

Who did he kick the ball to?

Object of preposition

is bound to fronted who-obj-refer-expr – wh-focus must be accessible

Long-Distance Dependencies

The man that I gave the book

I-Obj Trace to Obj-Refer-Expr

with animate or humanhead

Rel-focus co-indexed with Obj-Refer-Expr

rel-focus and subject must be accessible

(rel-focus is optional)

Long-Distance Dependencies

The book that I gave the man

Obj Trace to Obj-Refer-Expr

with inanimate head

Rel-focus co-indexed with Obj-Refer-Expr

rel-focus and subjectmust be accessible

(rel-focus is optional)

Architectural constraints
Architectural Constraints

  • No hard architectural limit on the number of buffers

  • Buffers provide the context for production selection and execution

    • Highly context sensitive

  • Productions limited to accessing ~4 buffers on left-hand side (beside goal and context buffers)

    • Focus of attention (Cowan, 2000)

    • “Conscious activity corresponds to the manipulation of the contents of these buffers by production rules” (Anderson, 2007)

  • Can humans learn to buffer useful information?

    • Fronted Wh-expression buffer very useful in English, but not needed in in situ languages like Chinese

Processing constraints
Processing Constraints

  • A “mildly” deterministic, serial processing mechanism (selection and integration) operating over a parallel, probabilistic substrate (activation)

  • Interactive and non-autonomous processing (no distinctly syntactic representations exist)

  • Incremental processing with immediate determination of meaning – word by word

  • No algorithmic backtracking or lookahead – a mechanism of context accommodation (Ball et al. 2007) used instead

  • Forward chaining only

  • Declarative and explicit linguistic representations generated via implicit execution of productions

  • Operates in real-time on Marr’s algorithmic level (serial and parallel processing are relevant)

    • No slow down with length of input


  • Additions to model are

    • motivated by functional considerations

    • driven by empirical evidence

    • constrained by well-established cognitive constraints on language processing

  • Goal is a large-scale, functional language comprehension system implemented in the ACT-R cognitive architecture

  • Model currently handles a fairly wide-range of grammatical constructions including numerous forms of long-distance dependency


Anderson, J. (2007). How Can the Human Mind Occur in the Physical Universe. Oxford: Oxford University Press.

Ball, J. (in preparation). A Naturalistic Functional Approach to Modeling Language Comprehension.

Ball, J., Heiberg, A. & Silber, R. (2007). Toward a Large-Scale Model of Language Comprehension in ACT-R 6. Proceedings of the 8th International Conference on Cognitive Modeling.

Cowan, N. (2000). The magical number 4 in short-term memory: A reconsideration of mental storage capacity. Behavioral and Brain Sciences, 24, 87-185.

Culicover, P. & Jackendoff, R. (2005). SimplerSyntax. Oxford: Oxford University Press.

Hawkins, J. (2004). Efficiency and Complexity in Grammars. Oxford: Oxford University Press.

Huddleston, R. & Pullum G.(2002). The Cambridge Grammar of the English Language. NY: Cambridge Unversity Press.

O’Grady, William (2005). Syntactic Carpentry, an Emergentist Approach to Syntax. Mahway, NJ:LEA.

Quirk, R., Greenbaum, S., Leech, G. & Svartvik, J. (1985). A Comprehensive Grammar of the English Language. Essex, UK: Pearson Education Limited.

Long distance dependencies2
Long-Distance Dependencies

The ball by the table was kickedby the man

passive cue

(be + V-ed or V-en)

Subject co-indexed with Object

subject must be accessible