1 / 26

“ Duplicate ” Entries in Gazetteers

“ Duplicate ” Entries in Gazetteers. jordan Hastings Department of Geography University of California Santa Barbara. Names & Features (1). Naming Features in the Environment Linguistic Necessity Identity and Ownership Navigation and Wayfinding Features Cover a Large Territory

rmargareta
Download Presentation

“ Duplicate ” Entries in Gazetteers

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. “Duplicate” Entries in Gazetteers jordan HastingsDepartment of GeographyUniversity of CaliforniaSanta Barbara

  2. Names & Features(1) • Naming Features in the Environment • Linguistic Necessity • Identity and Ownership • Navigation and Wayfinding • Features Cover a Large Territory • Crisp or Diffuse • Compact or Extended • Tangible or Abstract

  3. Names & Features(2) • Locations are Numerous & Various • Multiscale • Generalized • Dis-coordinated • Time-variant

  4. Names & Features(3) • Names are Numerous & Various • Polynymous • Mis-spelled • Multilingual • Time-variant

  5. Names & Features(4) Lake Bigler, thru 1920s Lake Bonpland (also Bondland), thru 1890s Da-ow-a-ga, thru 1850s

  6. Feature Types (1) • Dependable Type System • Because Features are “Objects” • Because Human Mind Categorizes • Types present in Taxonomy • Hierarchy is Natural in Environment • Because Human Mind Categorizes

  7. Feature Types(2) – Examples Cultural Environment • Nations -> States -> Provinces -> Districts

  8. Feature Types(2) - Examples • Physical Environment • Watersources: Springs-->Seeps • Watercourses: Rivers-->Streams-->Creeks • Waterbodies: Lakes-->Ponds-->Sloughs ?Glaciers

  9. Fundaments (1) • Definition: Gazetteer A spatial dictionary of named & typed features in the environment • Implications • Features uniquely identified • Searchable by name and type • Also searchable geospatially

  10. Fundaments (2) • Duplicates: An approximate notion • Firm types, ±close in hierarchy • Locations ±close dependent on scale • Names ±close dependent on language … or not at all • All aspects variant in time

  11. Fundaments (3) • Database Implications / Support • Custom Datatypes • Hierarchy • Geometry • Multiple Attribution (unlimited) • Names • Locations • Efficient Geospatial Processing

  12. Approach(1) • Independent Measures of Duplicates • 1. Type Thesaurus Metrics • Inter-feature: hierarchy, explicit linkages • 2. Geospatial Metrics • Intra-feature: size, compactness, … • Inter-feature: distance, overlap, … • 3. Geonomial Metrics • Intra-feature: NL translation [not considered yet] • Intra-feature: stemming, soundex, substitution

  13. Approach(2) • Unified Assessment of Duplicates • Weighted Combination of Measures • 1 Type • 2 Location(s) • 3 Name(s) • Geographic Visualization, over Maps • Final Authority of Human Cataloger

  14. Gazetteer “Duplicates”Processing Cycle random features prep grouped features rework

  15. Gazetteer “Duplicates”Processing Cycle random features prep grouped features rework

  16. random features prep grouped features weigh accepted suspended featuredatabase Gazetteer “Duplicates”Processing Cycle

  17. random features prep grouped features weigh accepted suspended featuredatabase Gazetteer “Duplicates”Processing Cycle review

  18. Gazetteer “Duplicates”Processing Cycle random features prep grouped features rework weigh review accepted suspended post featuredatabase reject trash

  19. [end]

More Related