6 Spatial data acquisition and standards¶
Key messages¶
Spatial data collection is always in reference to some theoretical model of the earth. The recommended ‘model earth‘ or Terrestrial Reference Framework (TRF) to locate a point on the earth’s surface is GWS84 using decimal degrees Latitude and Longitude coordinates with at least four decimal places (d.dddd), for example Latitude: 52.0525, Longitude 5.2667. In this way locations are indicated with an accuracy of 10m.
Using a Navigational Satellite System (NSS) Modern smartphones have enough accuracy to map geolocations in the field using the internal receiver for NSS signals. Depending on the conditions accuracies between 2 and 9m can be reached.
The NAVSTAR GPS system is the standard system currently in use on most mobile devices, however some devices are capable of combining GPS with other navigation systems, providing a higher accuracy and shorter search time for satellites connections.
Make sure that the accuracy while measuring is < 10m. Automated averaging of multiple measurements of the same location is a practical solution to reduce error towards 1m.
To measure the boundary of an area (a polygon) only the corners should be measured manually minimizing the error of the area measured. On the contrary a strategy of measuring multiple points along the edge of an area or using an automated tracking function in your device reduces the accuracy.
Using screen digitization Locations and areas can also be mapped from a screen. Screen digitization is always in relation to another data source (aerial photograph, satellite image or GIS file). There are a number of points to take into account:
- The TRF of the reference source and the TRF that is used for the digitization. Local datasets used are possibly based on a different RTF. The combination of data sources with different TRF’s or other data handling using the wrong TRF leads to errors in the results. More advanced desktop GIS, like QGIS or ARCGIS are capable of making data transformations from one TRF to another TRF but need to be operated by a specialist.
- The accuracy of the reference data set is determining the accuracy of the end result. Field boundaries or boundaries between farms may not be clear on an aerial photograph for small holder farms. Boundaries may not be visible or appear fuzzy. It is best to check the quality of a third party data set by overlaying it with actual field measurements or other data with known accuracy.
Different approaches to digitize data from screen are described in this section.
6.1 Introduction¶
The collection of spatial information of objects, like the geolocation or the shape of objects, is of growing importance for first mile data collection. Spatial information helps :
- to identify objects (see the section on ‘Uniquely identifying data elements‘,
- to combine different sources of information, e.g. to identify all farms with a particular soil type using a soil map,
- to determine spatial relations, e.g. to select all farms with a market within 10 km and to determine spatial characteristics of an object e.g. the area of a field.
The geolocation is a description of the location of an object at the earth’s surface in coordinates relative to a mathematically defined Terrestrial Reference Frame (TRF). The standard TRF recommended in this document is GWS84 using decimal degrees Latitude and Longitude coordinates with at least four decimal places (d.dddd), for example Latitude: 52.0525, Longitude 5.2667. Four decimal places indicate an accuracy of about 10 meters and five decimal degrees indicate an accuracy of about 1 meter. Points in the Southern hemisphere have negative latitudes; points in the western hemisphere (the Americas) have negative longitudes. If the sign is positive (northern hemisphere latitudes and eastern hemisphere longitudes), it is not necessary to include a “+” sign. The WGS84 TRF is the first globally applicable TRF and is currently the default for world standards and for most applications and devices including the GeoJSON and the NAVSTAR Global Positioning System or GPS. When using spatial data, especially when using third party georeferenced data, it is essential to be aware of the coordinate system used to describe the geolocations. Combining 2 spatial datasets based on a different TRF will result in errors if not carried out properly. More background on fundamentals from geodesy and cartography needed for georeferencing can be found in the section on the complexity behind geodata.
Nowadays spatial information is often collected and stored digitally using an object model of points, lines and polygon or collections of these. A Point type object is described by a single coordinate pair. A line string is a one-dimensional object represented by a sequence of points and the line segments connecting them. A Polygon is a two-dimensional surface stored as a sequence of points defining an exterior bounding ring of at least 3 points and zero or more interior rings to represent holes in the polygon if needed. Spatial information about objects can be collected digitally in the field using a Navigational Satellite System (See section 6.2) or using a geographical information system( See section 6.3).
6.3 Using a Geographical Information System (GIS) to collect spatial data.¶
Another approach is to digitize areas and locations from a screen using a geografical information system or GIS. A IS can be desktop or webbased. The 2 main desktop applications are the open sources QGIS and ArcGIS. Both application has similar functionalities and need a specialist to operate. The community Geo for All has the mission to make geoanalysis accessible for all, providing freely accessible training programs on the use of QGIS. Beside desktop GIS systems, webbased GIS systems are available. Most of these web apps provide visualization and digitization functionalities. Examples are Google Earth and Google My Maps and are available free of charge.
Screen digitization is always in relation to another data source (aerial photograph, satellite image or GIS file). There are a number of points to take into account:
- The TRF of the reference source and the TRF that is used for the digitization. Local datasets used are possibly to based on a different RTF. The combination of data sources with different TRF’s or other data handling using the wrong TRF leads to errors in the results. More advanced desktop GIS, like QGIS or ARCGIS are capable of making data transformations from one TRF to another TRF but need to be operated by a specialist.
- The accuracy of the reference data set is determining the accuracy of the end result. Field boundaries or boundaries between farms may not be clear on an aerial photograph for small holder farms. Boundaries maybe not visible or appear fuzzy. It is best to check the quality of a third party data set by overlaying it with actual field measurements or other data with known accuracy.
The following procedure is derived from the SAN guidelines for certificate mapping produced by the Rainforest Alliance.
6.3.1 Point locations¶
If the location of interest can be identified in a web-map or via satellite view, then it may be easy to obtain the GPS coordinates from the screen. In Google Earth, just place the cursor at the desired location and read off the coordinate values from the “Status bar” at the bottom of the screen. In Google Maps, simply create a point or marker at the desired location and then click on that marker to display the properties, which will show the GPS coordinates (usually at the bottom left of the pop-up). When reporting these points, make sure that they are in Decimal Degrees.
6.3.2 The digitization of polygons¶
Using Google Earth Google Earth Pro is a program that you can download for free and install on your computer (download here). After installing the program, select from the main menu Tools -> Options and configure: show Lat/Long in Decimal Degrees (3D View tab).
The Google Earth screen is divided into 3 panels on the left-hand side of the screen and one larger panel with the “Map Viewing Area” in the center (see screenshot below). For the purpose of mapping farm polygons, you will mainly utilize the “Map Viewing Area” and the “Places panel”, which is where your data can be referenced and organized. The content layers under the folder “My Places” are automatically copied to your hard drive when you exit the program and re-loaded the next time it is launched.
If you have reference points collected in the field in .kml or .kmz format. They can simply be loaded into Google Earth by double-clicking on the .kml or .kmz file. Alternatively, you can also open kml/kmz, gpx and many other types of spatial data by clicking on File -> Open. Data imported into Google Earth are stored in the “Temporary Places” folder of the “Places panel.” You must remember to move the files up to one of your folders so that they are not lost when you exit the program.
When time-stamped GPS data are imported and displayed in Google Earth, the “Time slider” is automatically displayed at the top of the screen (figure 5). This slider has “Range Makers” that control the time and date range of the tracks and points that are displayed. This is important because sometimes you might think your data has disappeared, when it is only out of the range specified in the time slider. To view all the data, move the range markers to the far right and left sides to define a wider date range.
To draw a polygon, follow these steps (figure 6):
- Select the Add polygon tool.
- Click on the map at the location of the polygon corners (vertices), going around the entire edge of the polygon to define its shape.
- When done, give the polygon a name in the “Name” field and add any additional details in the “Description field.”
- Click on the “Style, Color” tab to define how the polygon is displayed.
- Click on OK to save the polygon.
Once you have created your polygon you should see it in the left Places panel. You can edit both the vertices and the properties by right-clicking on the item in the Places panel and selecting properties. To save the polygon(s) as a kml/kmz file, in the Place panel move all the polygons you want to save into one folder, right-click on the folder, select “Save Place As” and enter the name and location of the output file.
Using polygons with Google My Maps Google My Maps is an extension of Google Maps. It can be used as part of Google Maps on any internet browser (after you sign in using your Google account); on a smartphone it must be downloaded as a separate app. All maps and data created in either form can be displayed and edited in the other. Since maps are web-based, they can be shared and edited by multiple users. To use Google Maps, open the website and sign-in with your Google credentials. This will take you to the >My Maps home screen, where you can create a new map (figure 8). For a tutorial on how to create a Google My Maps clickhere.
Click on the “Create a New Map” button in the upper left corner to initiate your new map. Now you are ready to start using your map. Give the map a name, zoom-in to your area of interest and select the basemap from the various options (map, terrain, satellite, atlas and more).
Upload data into Google My Maps As Google My Maps is entirely web-based, whatever data you collect in the field on the My Maps app (waypoints, lines) will automatically sync into the desktop map. It is also possible to import spatial data stored as .kml, .kmz, .cvs or .gpx files. To do this, first click on Add layer and then click on the Import button that appears under the new layer. A window is displayed for you to upload or drag / drop files to be imported.
Create and edit polygons in Google My Maps Use the tools in the upper center of the map to create points, lines and polygons. To create a polygon, use the line tool and go around the perimeter of the desired polygon area while clicking on the map to create a vertex at every corner. To complete the polygon click again on the first point. If you have collected reference points from the field and imported them to Google My Maps, these can help indicate where to draw the polygon vertices. When you are finished, enter the polygon name and description, and save.
All features created can be symbolized by selecting the line thickness, icon type, and color. Note that when you create a point, the latitude / longitude coordinates are displayed at the bottom of the point’s information window. Maps are automatically saved as you work.
To export the map, click on the three dots in the top right of the map box, select “export to kml” and select whether you want to export all the map data, or just the data in a specific layer. Note that Google My Maps has limited ability to move and organize features into layers, so for complex sites, it might be necessary to export the map into a more full-featured program, such as Google Earth, to organization the multiple polygons.
6.4 Some more background on the complexity behind geodata¶
When working with spatial data it is important to be aware of the complexity behind it. Locations in space are generally described in mathematics, as points in a rectangular coordinate system consisting of three axis (x,y,z). However, when we are thinking and talking about the world around us in daily live, we are actually mentally flattening the curved surface of the earth in the 2 dimensions (x,y) of a flat plane and define height (z) as the elevation of the surface above sea level. This section explains some fundamentals from geodesy and cartography to get an understanding of this complexity in order to avoid mistakes during spatial data handling because of misinterpretations.
The description of the earth surface in coordinates is based on mathematical models called Terrestrial Reference Frames (TRF). A modern TRF is defined by a Datum and a Geoid model of the earth.
source=http://www.dauntless-soft.com source=http://www.asu.cas.cz)
A Datum is a numerical description of the earth surface as an ellipsoid. This ellipsoid is the used to pinpoint locations on the earth’s surface in latitude and longitude coordinates. Latitude is a geographic coordinate that specifies the north–south position of a point on the Earth’s surface. Latitude is an angle which ranges from 0° at the Equator to + 90° at the North Pole and -90° at the South pole. Lines of constant latitude are called parallels and run east–west as circles parallel to the equator. Longitude is a geographic coordinate that specifies the east-west position of a point on the Earth’s surface. Lines of constant longitude are lines running from the North Pole to the South Pole called meridians. The longitude of a location is measured as the angle east or west from a reference Meridian, ranging from 0° at the reference to +180° eastward and −180° westward. Different Datums are being used to describe different parts of the world, resulting into different locations for the same location when using the same coordinates in different reference frames. The distance between 2 points with the same coordinates using different datums may go up to several kilometers. Height in daily live is measured as the elevation of a point above sea level. This measurement is challenging when it needs to take place in landlocked country in the middle of a continent. As a starting point in a modern TRF uses an Earth Gravitational Model (EGM) to model the mean sea level for all water surfaces AND across all land surfaces. This model of the mean sea level across the globe is called a Geoid. Because the gravitaional field varies across the globe is the resulting model irregularry shaped. The Geoid’s of modern TRFs match each other within the centimeter meter accuracy, taking EGM as a starting point for the model.
Globally the WGS84 TRF developed by the US Department of Defense is the most widely used TRF for many years now as a result of the rise of the usage of Global Positioning Systems (GPS) for field work, mapping and navigation. This drastically simplified the exchange of spatial information and logically the WGS84 is also the standard recommended in this document. However there are still 2 points to be aware of:
- The WGS84 reference system has been updated several times in the past and will be updated in the future. For the mapping purposes as discussed in this document the WGS84 can be considered stable since 1996. Before this date the WGS84 TRF was using an ellipsoid rather than a geoid as a reference to express the height above the earth’s system. The difference in height between the WGS84 TRF and the height value based on the modern WGS84 is between −105 m and +85 m. The GeoidEval utility can be used to calculate the height difference between the old model and the new model at every location on earth. In addition paper maps or digital data coming from external sources may be using different datums. There is a whole history in mapping and geodesy finding the most accurate representation of the earth in different regions. The site Coordinate Systems Worldwide provides overview of all TRFs in use and the transformations between.
In practice when we are working with spatial data (e.g measuring distances, working on a paper map, a computer screen and even when using the GPS map display) we are working with spatial data presented on a flat surface not on a globe model. The mathematical transformation from a ellipsoid representation of x,y coordinates on a flat surface is called a map projection. To illustrate how different concepts work common projections are visualized by imaging the globe as a wire-frame model with a light source in the center, the shadows created beyond the sphere can be “projected” onto a flat surface.
source=Dylan Prentiss, Department of Geography, University of California, Santa Barbara.
Every type of projection is an imperfect representation of you data and will be distorted in some way. Some methodologies are specifically capable of preserving area relations, shapes objects or the distance between points, but none will preserve all. There are endless ways to make map projections depending on the region and the purpose of the map and this is important to realize when using external data sources. The Universal Transverse Mercator projection is popular world wide applicable map projection system. http://epsg.io/ also provides a comprehensive database and access to all coordinate systems worldwide, including the map projection and the TRF and mathematical transformations between them. The data based can easily be searched for your region by clicking on a map.
Popular applications, like google maps, bing maps, openstreet maps etc. are using the webmecator projection. Despite the popularity of these applications it has taken a long time before the webmecator projection was accepted by the Geodetic community as an appropriate map projection system. The key problem of this system is that the ellipsoid model of the earth to project lat long coordinates on a plane is replaced by a spherical model to speed up calculations and internet response times. This results in systematic errors up to 40 km’s when compared to a spheroid projection based on WGS84. In addition the ‘primitive’ mercator projection, dated from the middle ages, heavily distorts distances and areas at latitudes further from the equator, for example Greenland is projected with similar dimensions as latin America. The key advantage of the mercator is that it allows for effortless zooming in a squared world and angles between points on the map are preserved. In online applications as google maps, ArcGIS Online or bing maps the errors in distance and area calculations are generally fixed making calculations in an alternative TRF. However the National Geospatial-intelligence Agency still warns for the incompatibly with and non-compliance of this type of applications to the WGS 84 TRF. This ESRI website illustrates the differences that are produced by different map projections stressing the need to be aware of the TRF and map projection that is used while working spatial data, especially while calculating distance and surfaces.