Lecture 01: Introduction To GIS and GIS Data Models
Lecture 01: Introduction To GIS and GIS Data Models
Lecture 01: Introduction To GIS and GIS Data Models
2
Definition 1: A GIS is atoolbox
3
Definition 2: A GIS is anInformation System
(Database definitions)
• a System ‐ a group of connected entities and activities
4
Definition 3: GIS is an approach toscience
5
GIScience: Contributing Disciplines
• Geography • Statistics
• Cartography • Operations research
• Remote Sensing • Computer Science
• Photogrammetry • Mathematics
• Surveying • Information Science
• Geodesy • Management science
6
Scholarly Journals Emphasizing GIS Research
7
Components of a GIS
1. Data
2. Hardware (computer system)
3. Software
4. Brainware: People/Procedures/Plan
5. Infrastructure: GIS operation environ mental
8
Basic Elements of a GIS:Data
• Non‐Spatial Data
• Attributes or information that describes the spatialentity
9
GIS Data: Spatial is Special
• Experts estimate that as much as 80% of the cost associated with a GIS
system is related to the development and maintenance of its spatial data
10
DataInformation
Data – numbers, text, symbols Geographic Information
• Sea surface temperature,soil (map, digital form)
type, population density • Information about places onEarth’s surface
Geographic versus spatial
Information – differentiated Geographic refers to Earth’s surfaceand
from data near surface
• implying some degree of Spatial refers to any space (moregeneral)
selection, organization, and • Knowledge about where somethingis
preparation for particular • Knowledge about what is at a given location
purpose, or
• data given some degreeof
interpretation • Can be very detailed or verycoarse
• Can be relatively static or changerapidly
• Can be very sparse orvoluminous
11
Basic Elements of a GIS:Hardware
12
Basic Elements of a GIS:Software
13
Basic Elements of a GIS: aPlan
14
A typical GIS process: start to think about your
final project now
1. Understanding basic geographic concepts 4. Database manipulation
• Projections, datums, coordinate systems • Attribute data
• Reading maps • Database management
2. Formulating a game plan • Computer database types
• Planning the process 5. Analysis techniques
3. Acquiring data • Spatial analysis
• Data storage formats • Models andmodeling
• Data sources Cartographic
• Data challenges Interpolation
Dynamic modelling
6. Presenting the results
• Map creation anddesign
15
Evolution of GIS: From Stand‐alone to WebServices
Mapping TroupMovements
16
GIS Web Service Chain
Overlay
Service
Web
CoverageService Vector Data
Provider Service
17
Prevailing GIS Software
19
Part 2: GIS Data Models
GIS FileStructure
21
GIS Data Conceptual Model
The strategy chosen depends on whether one takes the field‐view versus
object‐view of reality…
22
GIS Data Model: Vector vs.Raster
Real World
0 1 2 3 4 5 6 7 8 9
0 R T
1 R T
point line 2 H R
3 R
4 R R
5 R
6 R T T H
7 R T T
polygon 8 R
9 R
23
Data Modeling Processing
25
Spatial Data
Spatial Data
Non-topological Topological
26
Vector Data Model
• point (node):0‐dimension
• single x,y coordinate pair
• zero area 2
y=2
Point: (1,2)
• tree, oil well, label location x=1
1
• line (arc):1‐dimension
• two (or more) connected x,y 1 2
coordinates 2
• road, stream 1 Line: (1,2), (2,1)
• polygon :2‐dimensions
1 2
• four or more ordered and connected
x,y coordinates 2
Polygon: (1,2),
• first and last x,y pairs are thesame (2,1), (1,1) , (1,2)
1
• encloses an area
• census tracts, county, lake 1 2
27
Two Common Vector Models
C A
28
Spaghetti Vector Model
• Lines and points are entered and may be visible, but the program does
not recognize the relationships betweenlines.
• Each line is represented as a separate feature with a start node and an
end node, possibly vertices inbetween.
• Still exists during data entry andediting
C
• You have to build topologysomehow A
29
Why Topology Matters
• Getting data to lineup,
connect, intersect,
move together
• Important for GIS
operations and analyses
• Coordinate
transformation
• Map projection
• Area calculations
• Queries
• In order to do this we
use topology
30
Definition: Topology
31
Topological Vector Data Model
32
Topology vs. Coordinate (Continue)
• A topologically accurate
map: relationships between
subway stations are
accurately shown.
33
How to define GISTopology
• Based on
• Point/Node: Where lines
begin, end, or intersect Point/Node
• Line/Link: Line segments
between two nodes
• Polygon: composed of
alternating links and
nodes Polygon Line/Link
• Unique identifiers are
assigned to each link,
node, and polygon
34
Define Topology
• Node Topology Table: List of the links that meet at each node.
• Link (or line segment) Topology Table: List of the beginning and end nodes for
each link; polygons to the right and left to the link. (“From node,” “To node,”
“right poly,” “left poly”
35
A Topology Example
36
Vector Data Creation
• input of the spatialdata
• Digitizing/scan then vectorized
• Build topology
• input of the attributedata
• linking spatial and attribute
data
37
Common Vector GIS DataFiles
• Coverage (topological)
• Shape File (non‐topological)
• GeoDatabase (topological)
• MapInfo (topological)
• TIN (topological)
• CAD (non topology)
38
Raster Data Model
Real World
Overlay grid on
the Real World
Raster representation
Cells
39
Raster Data Model
• The finer the grid size, the more precise the information about the real
world is
Real World
Cells are usually assigned the value of the object taking up the greatest part of
the cell area – Bolstad has a long discussion.
40
Attribute Data and Coor.System
41
Resolution
42
Raster Cell Size
43
Resolution and Scale
44
Raster as Thematic Layers
45
Primary Uses for RasterData
• Modelsdescribing continuous attributes of the real world.
• Elevation, soils, temperature, etc.
• Images (satellites, scanned maps, photographs).
• Output (e.G. Printers, plotters, monitors).
46
Vector vs. Raster Data Model
Continuous data (Raster) Discrete (Vector)
Simple data structure Complex data structure
Large data volumes Compact Data File
Easy overlay Overlay is more difficult
Rapid data collection Slow data collection
Poor network analysis Possibility of Network analysis
No topology stored (no relationships
Efficient Topology
shown)
High spatial variability Low spatial variability
Suitable for highly variable data Good for homogeneous data
Lower positional accuracy Potentially excellent positional accuracy
Determined by cell size Given by (X,Y) coordinates
Low geometric accuracy High geometric accuracy
Better suited for imagery Better suited for graphics
47
Data Conversion
• Data can be transformed from one of these data models to the other
Vectorization
•
Rasterization
•
Some information is always lost when converting from one data format to the
other.
48
Rasterization
49
Vectorization
50
Data In ArcCatalog
51
Data File Display Differently
52
ESRI GIS DataFiles
GeoDatabase
Shapefile Coverage
MXD file
Image
Pay attention to the icon shape and color
53
How shapefiles are stored
Shapefiles: simple, non‐topological format, storing the geometriclocation
and attribute information.
• .shp ‐ the file that stores the feature geometry.Required.
• .shx ‐ the file that stores the index of the feature geometry. Required.
• .dbf ‐ the dBASE file that stores the attribute information of features.
Required.
• .sbn and .sbx ‐ the files that store the spatial index of the features.
Optional.
• .fbn and .fbx ‐ the files that store the spatial index of the features for
shapefiles that are read‐only.Optional.
• .ain and .aih ‐ the files that store the attribute index of the active fields in
a table or a theme'sattribute table. Optional.
• .prj ‐ the file that stores the coordinate system information.Optional.
• .xml ‐ metadata for ArcGIS. Optional.
Tech details: http://www.esri.com/library/whitepapers/pdfs/shapefile.pdf
54
How coverage files are stored?
55
GeoDatabase
56
MXD FileFormat
57
Relative Path vs. Absolute Path
58
Absolute (full) and relative path
59
Why use relative vs. absolutepath?
60
Do not losethe big picture!
61