DataClassesSpatial - AtlasOfLivingAustralia/ala-datamob GitHub Wiki

Introduction

2013/14 - i rescued this from the old sites portal, which is behind a closed door - maybe one day i'll revise it to read a little easier

Fundamentals of geospatial coordinates and datums

ALAhandlingspatiallocationdata.pdf

Regarding quality of coordinate locations

DataClassesCoordinateQuality

Regarding coordinate conversions

DataClassesCoordinateConversion#Conversion

Regarding spatial generalisation (location blurring / obfuscation)

DataClassesCoordinateConversion#Obfuscation

"Spatial Data" distinction

In the context of the Atlas there appear to be many forms of spatial data. Generally speaking, the notion relates a fact to a point in space. The fact may be one of a number of concepts that surface (i.e. appear, or are made accessible) in various sub-systems:

Context Fact type Notes
Occurrence point-location specimen/species observation

see darwincore terms dwc.decimalLatitude, dwc.decimalLongitude and associated metadata;

also, review DataClassesCoordinateQuality

Site point & radius, line, simple or complex boundary collecting event

can be defined as lines, eg: a trawl, or specified as closed boundaries such as a paddock or catchment boundary

dwc.footprintWKT

shape data define boundaries and are more complex than simple points, or point & radius

Environmental (gridded) sample at point

or more likely a 'smoothed' or averaged value across a defined area

gridded data are usually (/can be) environmental properties within a particular space

properties might be 'rainfall for summer months' or 'primary soil classification'

the areas are usually normalised to allow easy comparison of variables between data sets (eg: grids of 1km2 or 1 degree2), hence the name 'gridded'

Contextual (polygonal) simple or complex boundary a defined area, and usually some sort of associated classification/value, stored against a specific property

eg: property='state/territory', value='NSW'; or

property='soil classification', value='Pfry';

Descriptive locations textual often found against specimen records as verbatim data/quality caveats, an example might be '13 mi NW dry ck'

dwc.verbatimLocality

may also be used as evidence to georeference or standardise locations

Occurrence

Key terminology

Coordinates (aka. geocodes, latitude/longitude, lat/long, grid reference, ...) - effectively an x,y pair

Where data surface

as points in: biocache, spatial portal

Where data come from

Site

Key terminology

Where data surface

as shapes in: biocache?, spatial portal

Where data come from

Environmental (gridded)

Key terminology

Where data surface

'add to map->layers' in spatial portal,

biocache also stores a matrix of all environmental data at any given point - somewhat of an 'inverted index' for analysis purposes - visible as 'property-value' pairs within an occurrence record that has been ingested completely

(section 'Environmental sampling for this location' at time of writing, example: http://biocache.ala.org.au/occurrences/2f2cb5b5-819b-4bd3-854b-59849cd13622)

Where data come from

Contextual (polygonal)

Key terminology

Where data surface

'add to map->areas' in spatial portal,

as for gridded environmental properties (samples), placement within areas/classifications are also stored against occurrences

(section 'Additional political boundaries information' at time of writing, see previous biocache occurrence example)

Where data come from

Other information

Regarding Lambert Conformal Conic projection (a grid projection in use in some? rural areas)

the extent to which this grid projection is used is currently not known - research continues to discover this information - no data sets have been encountered to date that use this form of projection.

it is expected that the above logic (psuedo-code) and such will need to be modified to handle lcc if it proves to be widely used.

it may be that the prototype mobilisation utility will wind up using the geotrans libraries to handle conversions in a more robust fashion, although my gut feeling at this stage is that the complexity is better handled here within ala servers (maybe called up as a web-service?); in any case the utility should include a switch to enter a test-mode, to ensure that the version of the geotrans native code library that winds up being installed can be accessed from the jvm & performs correct conversions:
⚠️ **GitHub.com Fallback** ⚠️