1 / 10

The ARC Project:

The ARC Project:. Charles Hollingsworth (cholling@gmail.com) Stefaan Van Liefferinge (svlieffe@uga.edu) Rebecca A. Smith (rsmith17@uga.edu) Michael A. Covington (mc@uga.edu) Walter D. Potter (potter@uga.edu). Creating Logical Models of Gothic Cathedrals Using Natural Language Processing.

holli
Download Presentation

The ARC Project:

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The ARC Project: Charles Hollingsworth (cholling@gmail.com) Stefaan Van Liefferinge (svlieffe@uga.edu) Rebecca A. Smith (rsmith17@uga.edu) Michael A. Covington (mc@uga.edu) Walter D. Potter (potter@uga.edu) Creating Logical Models of Gothic Cathedrals Using Natural Language Processing This research benefited from the generous support of a Digital Humanities Start-Up Level 1 Grant from the National Endowment for the Humanities (Grant Number HD5110110), a University of Georgia Research Foundation Grant, and from The University of Georgia President's Venture Fund.

  2. About ARC • ARC (Architecture Represented Computationally) is a collaborative project between architectural historians and artificial intelligence researchers • Our goal is to assist architectural historians (and others) with the task of gathering and using information from architectural descriptions • Specifically, we aim to create a logical representation for Gothic cathedrals, closely tied to the semantics of natural language, that reflects the mental model historians have of the "typical" Gothic cathedral. • This model can then be used to create representations of specific cathedrals based on verbal descriptions

  3. Gothic cathedrals are major monuments of cultural heritage Gothic is particularly suited for logical analysis Structure follows a logical form Many typical features, such as pointed arches and cruciform floor plan Much repetition of elements, such as columns and vaulting units Why Gothic?

  4. Superuser Mode A small set of superusers create and edit generic model of a Gothic cathedral Consists of features all or most Gothic cathedrals have in common Administrator Mode Administrators input information about specific buildings Need only describe how they differ from the generic model Basic Outline of ARC • User Mode • Cannot add new information, but can submit queries about the model • Can test models for completeness and consistency

  5. ARC English:  An Architectural Description Language • At the superuser level, ARC is an exercise in natural language programming • Rather than enter information using Prolog or other programming language syntax, the superuser will enter information in "ARC English" • This is a true subset of English that is expressive enough to describe the necessary architectural entities, their properties, and their relationships (spatial and functional) to each other. • It should allow for multiple ways of expressing the same idea, rather than enforcing a strict syntax in the manner of programming languages

  6. Example of ARC English A column is a type of support.  Every column has a base, a shaft, and a capital.  Most columns have a plinth.  The base is above the plinth, the shaft is above the base, and the capital is above the shaft.  Some columns have a necking.  The necking is between the shaft and the capital.

  7. Some challenges • Referring to unnamed entities:  Skolem functions are used in place of proper nouns, allowing us to describe properties of hypothetical or nonspecific entities such as "each column's base" • Context sensitivity:  When we say "the nave" or "the capital", which one are we referring to?  This depends on what was said in previous sentences.  Analysis takes place at the level of discourse, not at the sentence level. • Defeasible reasoning:  "Most columns have a necking" makes no definite universal claim; allows for the possibility that a particular column has no necking • Partial ordering:  If we're just told that the capital is above the shaft, we don't know that it's immediately above

  8. From ARC English to real-world descriptions • No matter how carefully we design ARC English, it will never capture the full range of English as used in scholarly articles about architecture • Real-world descriptions frequently contain information irrelevant to ARC, for example historical background • The task of the Administrator Mode software is more information extraction than natural language programming • The generic model tells us what hasn't been specified, and the software can search real-world descriptions to fill in the gaps (e.g. how many vaulting units are in the nave, whether the columns have a necking, how many stories in the elevation)

  9. Querying ARC • User mode interaction with ARC recalls natural-language database querying • Sample queries might include "How many vaulting units are in the nave at Saint-Denis?" or "Show me all cathedrals with a four-story elevation." • Whereas web searches only look for strings of characters, the ARC software will be able to process queries on a semantic level, resulting in more relevant information • ARC queries can also tell us whether a given description is underspecified (does not tell us all relevant information) or contradictory (contains incompatible information)

More Related