textual geospatial integration project n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Textual-Geospatial Integration Project PowerPoint Presentation
Download Presentation
Textual-Geospatial Integration Project

Loading in 2 Seconds...

play fullscreen
1 / 20

Textual-Geospatial Integration Project - PowerPoint PPT Presentation


  • 60 Views
  • Uploaded on

Textual-Geospatial Integration Project. J AMES F REW University of California, Santa Barbara. Geospatially-Augmented Search. What’s here? Find library objects associated with a given location : Place name(s) “Footprint” (geographic extent) Where’s this?

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Textual-Geospatial Integration Project' - jalene


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
textual geospatial integration project

Textual-Geospatial Integration Project

JAMES FREW

University of California, Santa Barbara

geospatially augmented search
Geospatially-Augmented Search
  • What’s here?
    • Find library objects associated with a given location:
      • Place name(s)
      • “Footprint” (geographic extent)
  • Where’s this?
    • Find the location(s) associated with a given library object
examples from trec 9
Examples (from TREC-9)
  • Find documents that contain residential real estate listings within New Jersey.
  • Find reports on automobile traffic in the Washington, DC metropolitan area.
  • What forms of entertainment are available in Newport Beach, California?
why is gas difficult
Why Is GAS® Difficult?
  • Few library objects have explicit locations
    • Assigned reliably
    • Identified in object’s metadata
  • Many objects (especially text documents) have implicit locations
    • Present in, or inferable from, object’s content
    • Not necessarily identified as locations
where s this service
“Where’s This” Service

PARSE

text document

potential names, types, coordinates

type thesaurus

LOOKUP

gazetteer

gazetteer entries (known places)

ANALYZE

ranked footprints and placenames

“best” name(s)

EVALUATE

composite footprint

geo parsing
Geo-parsing
  • Extract “geographic facts” from text
  • Characterize by
    • Potential place component
      • name, type, footprint
    • Related fact (with preposition)
      • “in …”, “northeast of …”, etc.
    • Frequency
    • Importance
    • Context
geo parsing example 1 2
Geo-parsing Example (1/2)

(California,,,,1,K)

(Callahan,,,(in,California),1,K)

(Callahan-Yreka,,,(area of,),1,T)

(Early Cambrian,,,,1,B)

(Klamath Mountains,,,(eastern,),1,T)

(Klamath Mountains,,,(within,),1,B)

(Klamath Mountains,,,,1,K)

(Northern California,,,,1,T)

(Ordovician,,,,1,B)

(Ordovician,,,,1,K)

(Paleozoic,,,(in,California),1,B)

(Paleozoic,,,,1,K)

geo parsing example 2 2
Geo-parsing Example (2/2)

(Silurian,,,,1,K)

(Siskiyou County,,,(in,California),1,K)

(Skookum Gulch,,,,1,K)

(Skookum Gulch,,,,1,T)

(Skookum Gulch,,,,2,B)

(United States,,,,1,K)

(Yreka,,,(in,California),1,K)

(,fault,,,2,B)

(,rocks,,,6,B)

(,,N410000N420000W1220000W1230000,,1,C)

(,,,(in,North America),1,B)

lookup example feature type
Lookup Example: Feature Type
  • Fault: partial match: fault zones
  • Rocks: use: natural rock formations
lookup example gazetteer
Lookup Example: Gazetteer

Place Name exact partial

Skookum Gulch 1 0

Klamath Mountains 1 0

Northern California 1 0

California 1 492

Callahan* 1 1

Silurian 0 5

Siskiyou County* 1 14

United States 1 273

Yreka* 1 12

North America 0 8

*within footprint of California

analysis criteria
Analysis Criteria
  • Placement in document
    • e.g. keywords, title > body
  • Frequency in document
  • Exact match in gazetteer
  • Accuracy of gazetteer footprint
    • e.g. points < bounding boxes
  • Scale of gazetteer footprint
    • Size of focus area / size of footprint
analysis example results
High confidence

Callahan in California

Yreka in California

Skookum Gulch

Klamath Mountains (eastern)

Siskiyou County

Low confidence

Northern California

United States

North America

Analysis Example: Results
evaluation example

Derived footprint

  • Eastern Klamath Mountains
  • Area of Callahan-Yreka
  • Skookum Gulch
Evaluation Example

Skookum Gluch

Klamath Mountains

California

Callahan in California

Siskiyou County in California

United States

Yreka in California

  • Additional placenames
    • Shasta Butte City
    • Yreka City
    • Thompson's Dry Diggings
what s here service

AIRE

UserInterface

Query Parser

Document

Ranker

“What’s Here” Service

Query

Expansion

Gazetteer

Example

Query: Bodies of Water near Chicago

Expansion Terms: Lake Michigan, Chicago River

manual relevance feedback
Manual Relevance Feedback

Spatial Synonyms

“Chicago, IL”

“Chicago River”

AIRE

UserInterface

Gazetteer

Query

Place Names

“Chicago”

Query

Parser

automatic relevance feedback
Automatic Relevance Feedback

AIRE

Spatial Query Results

“Chicago River,

Lake Michigan”

Gazetteer

RF System

Place Names,

Surrounding

Type Terms

“Bodies of Water”

Expanded

Query

Document

Ranker

what s here components
“What’s Here” Components
  • Place names  footprints
    • Requires: place name ranking scheme
      • Chicago, IL > Chicago tectonic plate in Brazil
  • Type terms  classes
    • Requires: class thesaurus API
      • “Bodies of Water”  “Water Bodies”
  • 3. Gazetteer  spatial synonyms
    • Requires: gazetteer API; results ranking
      • “Bodies of Water near Chicago”  set of gazetteer queries
the light at the end of the tunnel
The Light at the End of the Tunnel
  • You submit:
    • a document
  • You get:
    • a place
      • Best
      • Also-rans
      • Alternatives
  • What you do with this is your business
brought to you by
UCSB

Linda Hill

Greg Janée

Dave Valentine

Satoshi Ikeda

(Japan Patent Office)

IIT

Steven Beitzel

Ophir Frieder

David Grossman

Eric Jensen

Vasif Shaikh

Brought To You By