4.1. Raster datasets - formats and finding them on the Internet
Raster datasets are important in Web-based mapping. In this section, we will discuss two aspects related to Raster datasets: some common formats, and where you might go to find some already “ready made” for use in WebGIS projects.
4.1.1. Raster Data Formats
If you search Wikipedia [1] for GIS Data formats, you find the following definition under the subheading “Raster”:
“A raster data type is, in essence, any type of digital image represented by reducible and enlargeable grids. Anyone who is familiar with digital photography will recognize the Raster graphics pixel as the smallest individual grid unit building block of an image, usually not readily identified as an artifact shape until an image is produced on a very large scale. A combination of the pixels making up an image color formation scheme will compose details of an image, as is distinct from the commonly used points, lines, and polygon area location symbols of scalable vector graphics as the basis of the vector model of area attribute rendering. While a digital image is concerned with its output blending together its grid based details as an identifiable representation of reality, in a photograph or art image transferred into a computer, the raster data type will reflect a digitized abstraction of reality dealt with by grid populating tones or objects, quantities, cojoined or open boundaries, and map relief schemas. Aerial photos are one commonly used form of raster data, with one primary purpose in mind: to display a detailed image on a map area, or for the purposes of rendering its identifiable objects by digitization. Additional raster data sets used by a GIS will contain information regarding elevation, a digital elevation model, or reflectance of a particular wavelength of light, Landsat, or other electromagnetic spectrum indicators.
Raster data type consists of rows and columns of cells, with each cell storing a single value. Raster data can be images (raster images) with each pixel (or cell) containing a color value. Additional values recorded for each cell may be a discrete value, such as land use, a continuous value, such as temperature, or a null value if no data is available. While a raster cell stores a single value, it can be extended by using raster bands to represent RGB (red, green, blue) colors, colormaps (a mapping between a thematic code and RGB value), or an extended attribute table with one row for each unique cell value. The resolution of the raster data set is its cell width in ground units.”
As you search for data on the Internet, or if you create your own raster data by using a digital camera (e.g., such as a camera on an airplane or on a quadcopter to capture landcover imagery), you need to be familiar and be able to recognize, particular raster data formats. There is a large list of different formats. Take a moment to see the full list that the GDAL translation and utility software supports. Visit:
http://www.gdal.org/formats_list.html
There is a lot of raster data formats, isn’t there!
Understanding and recognizing at least the commonly used formats is an important skill to gain. Below, we provide some links to Internet information that describes some of the common formats you might encounter. Take a moment to familiarize yourself with these.
ESRI Grid. This is a proprietary raster data storage format that is native to the ArcGIS software package by ESRI. These are made up of a number of different files and are stored in a file directory or folder, not an individual file.
ESRI ASCII raster format. This is a non-proprieary raster format that is useful for transferring or importing ESRI GRID data into other raster software packages. These have an extension name of “.asc”.
GeoTIFF is a collaborative efforts by many different companies and organizations to establish an agreed upon TIFF-based interchange format for georeferenced raster imagery. The coordinate system information can be stored in the GeoTIFF file itself, or sometimes is accompanied by a Tiff World File (.tfw).
GIF. The Graphical Interchange Format is a raster image format that supports up to 8 bits per pixel per image, allowing a single raster image to be able to display up to 256 different colors. It also supports animations. Extension name: .gif.
IMG. While there are several uses of .img, in raster processing we often encounter .img files that are a proprietary raster format developed for the raster software ERDAS Imagine. For example, this format is one of the raster data formats provided by the National Map Viewer for the National Elevation dataset.
JPG. This stands for Joint Photographic Experts Group and is a commons standard for representing one band (greyscale) or three band (Red, Green, Blue or RGB) datasets. File extension name is .jpg.
JP2 (JPEG 2000) is an image compression standard that provides some increased compression performance compared to its predecessor JPG. File extension name is .jp2.
KML. Keyhole Markup Language. This is is a data format for storing place locations, polygons, images and other things for use with Google Earth, formally called Keyhole Earth Viewer. KML format is an accepted open standard by the OGC. KML files are often distributed in a .KMZ file, which is a zipped, compressed file format.
PNG. Portable Network Graphics. This is a raster graphics file format that supports “lossless data compression”, which means that the digital information is compressed by an algorithm that allows the original data to be completely reconstructed from the compressed data. This differs from “lossy compression” that reconstructs only an approximation of the original compressed data. PNG was invented as a replacement for another graphics format - GIF. Extension name .png.
TIF or TIFF. This Tagged Image File Format is a raster file format that has been around for some time, often used to store “raster graphics” images that were used in the publishing industry and by photographers. The extension name is either .tif or .tiff.
There is even a raster format for PostGIS, called PostGIS Raster (formerly called WKTRaster).
In short, for WebGIS, raster files are a critical component. You’ve already seen Web Mapping Services which transmit raster files. Also, soon we will be discussing the concept of “Tiling” or “Tiled Maps” which use raster images.
References
[1] https://en.wikipedia.org/wiki/GIS_file_formats#Raster_formats
4.1.2. Searching for Raster Data on the Internet
It is really incredible just how much available data these days. There are a variety of web-database repositories “out there” that provides access to GIS data (vector, raster) in general, and raster data specifically. You may have a lot of experience looking for GIS data on the Internet already. But if you don’t, take a few moments to visit some of the GIS data repositories in the table below, just to see the kinds of data that are available, and note that the table below is just a listing of some of the Internet data repositories available.
If you are looking to try out a couple of these sites to see what you can find for an area of your own interest, we’d recommend you try the National Map Viewer or the USGS Earth Explorer.
Name | Description | Website |
---|---|---|
GENERAL RESOURCES | ||
Mass GIS | Massachusetts spatial data | http://www.mass.gov/anf/research-and-tech/it-serv-and-support/application-serv/office-of-geographic-information-massgis/datalayers/layerlist.html |
National Atlas | U.S. spatial data | http://nationalatlas.gov/atlasftp.html |
USGS Sagemap | Spatial data for western US | http://sagemap.wr.usgs.gov/GISData.aspx |
UMass facilities | Campus projects, facilities & planning | http://www.umass.edu/cp/maps.htm |
National Map | U.S. spatial data | http://viewer.nationalmap.gov/viewer/ |
National Gap Analysis | Species and Ancillary Data | http://www.gap.uidaho.edu/species_data.html |
ArcGIS Open Data | Contributed data from the ESRI GIS community | https://opendata.arcgis.com/about |
Name | Description | Website |
---|---|---|
CLIMATE DATA | ||
NOAA National Weather Service | Current and past weather events | http://www.nws.noaa.gov/gis/ |
Worldclim | Global climate data (and 2050 and 2070 projections | http://worldclim.org |
Climate Wizard | US or global climate data (and 2050 projections) | http://www.climatewizard.org/ |
PRISM | US climate data (download the 'normals' for average climate) | http://www.prism.oregonstate.edu/ |
USGS climate change projections | US downscaled climate projections | http://cida.usgs.gov/gdp/ |
Name | Description | Website |
---|---|---|
OTHER ENVIRONMENTAL DATA | ||
Soil Information for Environmental Modeling and Ecosystem Management | Various soil metrics for the contiguous US | http://www.soilinfo.psu.edu/index.cgi?soil_data&conus&data_cov&poros&datasets |
Geospatial Data Gateway | U.S. Natural Resources | http://datagateway.nrcs.usda.gov/GDGHome.aspx |
US EPA Ecoregions | Ecoregions of North America | http://www.epa.gov/wed/pages/ecoregions/na_eco.htm |
Topography (fine resolution) | Under datasets -> Digital Elevation ->SRTM void filled | http://earthexplorer.usgs.gov/ |
Topography (coarse resolution) | 30 second is ~1 km, 2.5 min is ~ 5 km - download by tile if you want only part of the globe | http://www.worldclim.org/current |
Remotely sensed imagery | Various - select under datasets | http://earthexplorer.usgs.gov/ |
Hydrography | watersheds, catchments and other hydrography | http://www.horizon-systems.com/NHDPlus/NHDPlusV2_data.php |
Stream Stats - watershed delineation | Generates watershed above user-defined point location - Use Version 3 | http://water.usgs.gov/osw/streamstats/ |
Northeast terrestrial habitat | habitat layers that are northeast specific (big file) | http://conserveonline.org/workspaces/ecs/documents/ne-terrestrial-habitat-mapping-project |
Fire | Fire perimeters and land cover data for US | http://landfire.cr.usgs.gov/viewer/ |
WUI | Wildland urban interface (USA) | http://silvis.forest.wisc.edu/maps/wui_main |
Northeast ocean data | Various-marine and fisheries data, climate, managment, some marine fish | http://northeastoceanviewer.org/# |
Land cover data | National Land Cover Database (NLCD), raster | http://www.mrlc.gov/ |
Name | Description | Website |
---|---|---|
SPECIES DATA | ||
VegBank | Percent cover of plant species in the US and Canada | http://vegbank.org/vegbank/index.jsp |
Global biodiversity information facility | Global herbaria & specimen collections with location info | http://www.gbif.org/species |
Movebank | Animal track data for a few species | https://www.datarepository.movebank.org/ |
EDDMapps | Invasive plant distribution data in the US | http://www.eddmaps.org/ |
IOOS | Integrated Ocean Observing System (takes some digging) | http://www.ioos.noaa.gov/observing/welcome.html |
Name | Description | Website |
---|---|---|
HUMAN & POLITICAL | ||
TIGER shapefiles | Census demographic data and ACS block group data on economics | http://www.census.gov/geo/maps-data/data/tiger-data.html |
TIGER line files | Roads and political boundaries | http://www.census.gov/geo/maps-data/data/tiger.html |
Global anthromes | Anthropogenic land use in global ecosystems | http://ecotope.org/anthromes/v2 |