Entry
Reader's guide
Entries A-Z
Subject index
Geoparsing
Geoparsing is the process of identifying geographic references in text and linking geospatial locations to these references so that the text can be accessed through spatial retrieval methods and suitable for spatial analysis. Geoparsing is used to add geospatial locations to written text, oral discourse, and legacy scientific data where referencing to location was done with placename references only. Applications include the processing of enterprise technical documents, intelligence surveillance, and unlocking a treasure trove of biological specimen and observation data heretofore not suitable for geospatial analysis.
The process, also known as toponym resolution, is based on linguistic analysis of text strings, looking for proper names in a context that indicates the likelihood that the name is a placename. For example, the capitalized word Cleveland can be identified as a potential placename on the basis of adjacent words and phrases, such as in, near, and south of, rather than being the name of a U.S. president (i.e., Grover Cleveland). These candidate names are submitted to a gazetteer lookup process. When a match is made to a single gazetteer entry, the associated information from the gazetteer can be linked to the text. The context of the proper names is used both to flag the name as a possible placename and to refine the meaning of the phrase containing the placename. For example, “in Cleveland” and “25 miles south of Cleveland” indicate different locations. The geoparsing software can use such information to assign a geospatial location derived from the geospatial footprint specified in a gazetteer entry, modified by any offset expressed in terms of distance, direction, and units of measure.
In many cases, more than one gazetteer entry is a potential match for the candidate proper name. There are several ways to refine the matching process. For example, if the text surrounding the name contains a type term, such as lake or mountains, or if a general location for the place has been named, such as a country or state, these clues can be added to the gazetteer lookup process. So, if the text that has references to “Cleveland” also references “Ohio” prominently or frequently, then the assumption can be made that the “Cleveland” reference is the city in Ohio rather than some other populated place named “Cleveland,” such as “Cleveland, New York.”
The level of confidence in the geoparsing results is often an issue because of many factors. The lexical analysis itself is not perfect when applied to unstructured text. The quality of the gazetteer is also a factor in terms of the completeness of its coverage, the inclusion of alternate forms of the placenames, and the accuracy and detail of its geospatial information. In some cases, the gazetteer itself might include confidence levels for its data—especially when covering ancient features where descriptive information is contradictory or incomplete. When the textual reference is of the form “25 miles south of Cleveland,” the actual location can be estimated only to be within a specified area south of coordinates given for Cleveland. For these reasons, geoparsing results are often accompanied by an indication of confidence. One method is to assign a point and a radius, with the length of the radius indicating the confidence level.
...
- Analytical Methods
- Analytical Cartography
- Cartographic Modeling
- Cost Surface
- Cost-Benefit Analysis
- Data Mining, Spatial
- Density
- Diffusion
- Ecological Fallacy
- Effects, First- and Second-Order
- Error Propagation
- Exploratory Spatial Data Analysis (ESDA)
- Fragmentation
- Geocoding
- Geodemographics
- Geographical Analysis Machine (GAM)
- Geographically Weighted Regression (GWR)
- Georeferencing, Automated
- Geostatistics
- Geovisualization
- Image Processing
- Interpolation
- Intervisibility
- Kernel
- Location-Allocation Modeling
- Minimum Bounding Rectangle
- Modifiable Areal Unit Problem (MAUP)
- Multicriteria Evaluation
- Multidimensional Scaling (MDS)
- Multivalued Logic
- Network Analysis
- Optimization
- Outliers
- Pattern Analysis
- Polygon Operations
- Qualitative Analysis
- Regionalized Variables
- Slope Measures
- Spatial Analysis
- Spatial Autocorrelation
- Spatial Econometrics
- Spatial Filtering
- Spatial Interaction
- Spatial Statistics
- Spatial Weights
- Spatialization
- Spline
- Structured Query Language (SQL)
- Terrain Analysis
- Cartography and Visualization
- Analytical Cartography
- Cartograms
- Cartography
- Choropleth Map
- Classification, Data
- Datum
- Generalization, Cartographic
- Geovisualization
- Isoline
- Legend
- Multiscale Representations
- Multivariate Mapping
- National Map Accuracy Standards (NMAS)
- Normalization
- Projection
- Scale
- Shaded Relief
- Symbolization
- Three-Dimensional Visualization
- Tissot's Indicatrix
- Topographic Map
- Virtual Environments
- Visual Variables
- Conceptual Foundations
- Accuracy
- Aggregation
- Cognitive Science
- Direction
- Discrete versus Continuous Phenomena
- Distance
- Elevation
- Extent
- First Law of Geography
- Fractals
- Geographic Information Science (GISci)
- Geographic Information Systems (GIS)
- Geometric Primitives
- Isotropy
- Layer
- Logical Expressions
- Mathematical Model
- Mental Map
- Metaphor, Spatial and Map
- Nonstationarity
- Ontology
- Precision
- Representation
- Sampling
- Scale
- Scales of Measurement
- Semantic Interoperability
- Semantic Network
- Spatial Autocorrelation
- Spatial Cognition
- Spatial Heterogeneity
- Spatial Reasoning
- Spatial Relations, Qualitatitve
- Topology
- Uncertainty and Error
- Data Manipulation
- Data Modeling
- z-Values
- Computer-Aided Drafting (CAD)
- Data Modeling
- Data Structures
- Database Management System (DBMS)
- Database, Spatial
- Digital Elevation Model (DEM)
- Discrete versus Continuous Phenomena
- Elevation
- Extensible Markup Language (XML)
- Geometric Primitives
- Index, Spatial
- Integrity Constraints
- Layer
- Linear Referencing
- Network Data Structures
- Object Orientation (OO)
- Open Standards
- Raster
- Scalable Vector Graphics (SVG)
- Spatiotemporal Data Models
- Structured Query Language (SQL)
- Tessellation
- Three-Dimensional GIS
- Topology
- Triangulated Irregular Networks (TIN)
- Virtual Reality Modeling Language (VRML)
- Design Aspects
- Geocomputation
- Geospatial Data
- Accuracy
- Address Standard, U.S.
- Attributes
- BLOB
- Cadastre
- Census
- Census, U.S.
- Computer-Aided Drafting (CAD)
- Coordinate Systems
- Data Integration
- Datum
- Digital Chart of the World (DCW)
- Digital Elevation Model (DEM)
- Framework Data
- Gazetteers
- Geodesy
- Geodetic Control Framework
- Geography Markup Language (GML)
- Geoparsing
- Georeference
- Global Positioning System (GPS)
- Interoperability
- LiDAR
- Linear Referencing
- Metadata, Geospatial
- Metes and Bounds
- Minimum Mapping Unit (MMU)
- National Map Accuracy Standards (NMAS)
- Natural Area Coding System (NACS)
- Photogrammetry
- Postcodes
- Precision
- Projection
- Remote Sensing
- Scale
- Semantic Network
- Spatial Data Server
- Standards
- State Plane Coordinate System
- TIGER
- Topographic Map
- Universal Transverse Mercator (UTM)
- Organizational and Institutional Aspects
- Address Standard, U.S.
- Association of Geographic Information Laboratories for Europe (AGILE)
- Canada Geographic Information System (CGIS)
- Census, U.S.
- Chorley Report
- Coordination of Information on the Environment (CORINE)
- COSIT Conference Series
- Data Access Policies
- Data Warehouse
- Digital Chart of the World (DCW)
- Digital Earth
- Digital Library
- Distributed GIS
- Enterprise GIS
- Environmental Systems Research Institute, Inc. (ESRI)
- ERDAS
- Experimental Cartography Unit (ECU)
- Federal Geographic Data Committee (FGDC)
- Framework Data
- Geomatics
- Geospatial Intelligence
- GIS/LIS Consortium and Conference Series
- Google Earth
- GRASS
- Harvard Laboratory for Computer Graphics and Spatial Analysis
- IDRISI
- Intergraph
- Interoperability
- Land Information Systems
- Life Cycle
- Location-Based Services (LBS)
- Manifold GIS
- MapInfo
- Metadata, Geospatial
- MicroStation
- National Center for Geographic Information and Analysis (NCGIA)
- National Geodetic Survey (NGS)
- National Mapping Agencies
- Open Geospatial Consortium (OGC)
- Open Source Geospatial Foundation (OSGF)
- Open Standards
- Ordnance Survey (OS)
- Quantitative Revolution
- Software, GIS
- Spatial Data Infrastructure
- Spatial Decision Support Systems
- Standards
- U.S. Geological Survey (USGS)
- University Consortium for Geographic Information Science (UCGIS)
- Web GIS
- Web Service
- Societal Issues
- Access to Geographic Information
- Copyright and Intellectual Property Rights
- Critical GIS
- Cybergeography
- Data Access Policies
- Digital Library
- Economics of Geographic Information
- Ethics in the Profession
- Geographic Information Law
- Historical Studies, GIS for
- Liability Associated With Geographic Information
- Licenses, Data and Software
- Location-Based Services (LBS)
- Privacy
- Public Participation GIS (PPGIS)
- Qualitative Analysis
- Quantitative Revolution
- Spatial Literacy
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches