5. AGS - ufarrell/sgp_phase2 GitHub Wiki

Alberta Geological Survey (AGS) Publications can be searched for here: https://ags.aer.ca/publications/all-publications

Six datasets were chosen for inclusion in SGP in 2022, selected based on their alignment with SGP goals (table below)

These included 4192 samples with 332226 results from 550 sites in Alberta, Canada.

Publication ID Publication Name Authors Published Date Link
DIG 2013-0003 Rock Eval and Total Organic Carbon of Sedimentary Rocks in Alberta (tabular data, tab-delimited format) Rokosh, C.D., Crocq, C.S., Pawlowicz, J.G., Brazzoni, T. 2013-06-06 https://ags.aer.ca/publications/all-publications/dig-2013-0003
DIG 2016-0001 Inorganic Geochemistry of Alberta Geological Units for Shale- and Siltstone-Hosted Hydrocarbon Evaluation (tabular data, tab-delimited format) Rokosh, C.D., Crocq, C.S., Pawlowicz, J.G., Brazzoni, T. 2016-07-12 https://ags.aer.ca/publication/dig-2016-0001
DIG 2016-0003 Bulk Mineralogy from X-Ray Diffraction Analysis of Alberta Stratigraphic Units Evaluated for Shale- and Siltstone-Hosted Hydrocarbon Resource Potential (tabular data, tab-delimited format) Rokosh, C.D., Crocq, C.S., Pawlowicz, J.G., Brazzoni, T. 2017-04-23 https://ags.aer.ca/publication/dig-2016-0003
DIG 2016-0006 Clay Mineral X-Ray Diffraction Analysis Results of Outcrop Samples from Alberta Stratigraphic Units Evaluated for their Shale- and Siltstone-Hosted Hydrocarbon Resource Potential (tabular data, tab delimited format) Rokosh, C.D., Crocq, C.S., Pawlowicz, J.G., Brazzoni, T. 2017-04-23 https://ags.aer.ca/publication/dig-2016-0006
DIG 2016-0007 Bulk Rock Geochemistry Data from X-Ray Fluorescence Analysis of Samples from Alberta Stratigraphic Units Evaluated for their Shale- and Siltstone-Hosted Hydrocarbon Resource Potential (tabular data, tab-delimited format) Rokosh, C.D., Crocq, C.S., Pawlowicz, J.G., Brazzoni, T. 2016-07-12 https://ags.aer.ca/publication/dig-2016-0007
DIG 2019-0021 Clay Mineral X-Ray Diffraction Analysis Results of Outcrop Samples from Alberta Stratigraphic Units Evaluated for their Shale- and Siltstone-Hosted Hydrocarbon Resource Potential (tabular data, tab delimited format) Lopez, G.P., Rokosh, C.D., Weiss, J.A., Pawlowicz, J.G.. 2020-11-03 https://ags.aer.ca/publication/dig-2019-0021

Geography

Lithology

73% fine-grained siliciclastics (shale, mudstone, siltstone), 13% carbonate (limestone, lime mudstone, carbonate).

Age

The figure below is based on interpreted age in Ma, and does not represent all samples - see Completeness and Data Collection/Processing below for more details.

Data

Data were grouped together in batches based on publication (as above), lab and lab method code. Each batch has one or more analyses based on experimental and analytical methods. Batch details are available here.

Categories below are based on those used on our search website (http://sgp-search.io/).

Completeness

Note that a batch of standard samples is included with this data source - none of those standards have any context information (block with no information at the bottom of this figure).

Data Collection/Processing

Data sheets were initially processed to remove duplicate samples in individual files. Samples from all sheets were then pasted together with the same column headers. Complete duplicate rows were deleted using the ‘delete duplicates’ function in Excel. This left a number of duplicates where there were slight differences in lithology, or differences in what was included in a data spreadsheet. For instance, some samples will have Depth_Unit_of_Measure while others don’t have that filled out (and thus are not exact duplicate rows). Some duplicate samples had different stratigraphic units or lithologies listed (yet were still from the same stratigraphic horizon and same locality). In these cases, this was noted and the lithology or formation that best matched surrounding samples was chosen. Sample standards were also included (242 samples).

In most cases AGS samples had only one of the following, which were used to populate SGP height_depth_m and/or min and max depth fields:

  • Depth_m
  • EL_MASL28
  • Upper_Core_Depth and Lower_Core_Depth
  • Upper_Log_Depth and Lower_Log_Depth
  • Top_EL_MASL28 and Base_EL_MASL28

Any measurements in feet were converted to m. See below for exact translation of these columns into the SGP format.

In order to identify unique sites the following columns were extracted (standards removed): Sample_ID, Site_Type, Site_Type2, Material, UWI_Label, Rig_Release_Date, Site_Name, Lat_NAD83, Long_NAD83, Lat_short, Long_short, KB_MASL28, Depth_m, EL_MASL28, Stratigraphic_Unit. Individual sites were best represented by unique combinations of Site_Name, Site_Type, Lat_NAD83, Long_NAD83.

Some contradictions/inconsistencies were cleaned as follows.

  • Some sites existed with the same exact lat-long but different names. In all cases the site names were clearly related. One name was chosen, the other is stored in the site notes.
  • Rock Eval data sheet (DIG 2013-0003) provided Site_IDs and dates which were not available in other sheets. Site_IDs appear to distinguish between related outcrop sites with the same name but different lat-longs. In these cases the Site_ID was added to the section name to help match the correct version of the site and coordinates to the corresponding samples e.g. Birch Mountains - Alice Creek, NTS 84I/03 (9000027), Birch Mountains - Alice Creek, NTS 84I/03 (9000080).
  • In some outcrop sections samples were linked to the same site with different lat-longs, but with no Site_ID. Where stratigraphy was different it was added to the site name in brackets e.g. Bolton Creek(Brazeau), Bolton Creek (Paskapoo), Bolton Creek (Scollard).
  • In rare cases where site name AND stratigraphy were the same, the sample IDs were added in brackets e.g. Cadomin area (sample 13197), Cadomin area (sample 13196).
  • Some minor inconsistencies were cleaned up - e.g. Bighorn River - subsection A = Bighorn River, subsection A.

SGP excel templates were used to format and import AGS data. For a list of included lithologies and matched SGP terms see here.

Interpreted Age

Interpreted ages for 3109 samples were determined by Erik Sperling and Edward Huang, using a combination of sources including, but not limited to, Macrostrat and Weblex (Natural Resources Canada Lexicon of Geologic Units). The justification and logic for age calls was recorded in every case (‘interpreted age justification’), meaning that a user can see exactly why a sample was given a specific age. 289 samples from 13 stratigraphic units remain without an interpreted age.

Data entry - AGS vs. SGP

An effort was made to match AGS columns to SGP columns (see table below), but in some cases compromises were required e.g. concatenating data into one SGP column. Where information was particularly important (e.g. stratigraphical names, lithology names) the data was cleaned so that it could be matched to the existing dictionaries, although verbatim was also included.

The table below is a summary based on the metadata provided by AGS. Most of the datasets share the column names listed here. However, there are some minor differences, including new column names and additional columns in later publications. In addition, full methodology details were provided for DIG 2019-0021 in a separate file. Information from that table was matched to columns in SGP batch and analysis tables (e.g. lab, lab method code, experimental method). Full metadata is available for each publication from AGS (see links above).

AGS Column Name AGS Description SGP table.column_name Notes
Sample_ID An integer used to identify the sample. sample.original_num Duplicate samples were replaced using the lower number.
Original_Sample_ID The Sample_ID's used to identify the original samples in a combined or duplicate sample type. sample.original_num
Site_ID An integer used to identify a site. SEE NOTES Only available in DIG XXXX-XX. Combined with site name to differentiate sites with the same name, different coordinates.
Site_Type Identifies the type of site, as described by the geologist. site.site_type, site.site_notes Site_Type of “oil or gas well” and “minerals borehole” were called “core”, but the original was included in site notes as follows: AGS Site_Type: oil or gas well
Material Specifies if a sample is composed of glass, pulp, sediment, rock, or rock chips.
UWI (or UWI_Label) Unique identifier assigned by the AER to a licensed wellbore.
Rig_Release_Date Date the drilling rig was released from operations at the well site. NOT IMPORTED
Site_Name Descriptive label given to each site to aid in identifying the site. For oil and gas wells, it is the official well name as it appears on the well licence. For outcrops, it is the descriptive name assigned by the geologist collecting the sample. site.section_name Combined occasionally with site_id, sample_id or unit name, where sites exist with the same name, different lat-longs/other details.
Lat_NAD83 Latitude in [decimal degrees] of the location of the well in NAD83. Oil and gas wells use bottom-hole coordinates. Outcrop sites are converted to latitude and longitude from UTM locations. site.lat_original, site.lat_dec NAD83 included as site.datum_original
Long_NAD83 Longitude in [decimal degrees] of the location of the well in NAD83. Oil and gas wells use bottom-hole coordinates. Outcrop sites are converted to latitude and longitude from UTM locations. site.long_original, site.long_dec NAD83 included as site.datum_original
KB_EL_MASL28 Elevation of the well’s kelly bushing in [metres above sea level], Canadian Geodetic Vertical Datum of 1928. sample.height_depth_m, sample.sample_notes Feet converted to m, verbatim is included in the sample_notes
Top_EL_MASL28 Upper limit elevation of the sampled interval in [metres above sea level], Canadian Geodetic Vertical Datum of 1928. sample.max_depth_m, sample.sample_notes Feet converted to m, verbatim is included in the sample_notes
Base_EL_MASL28 Lower limit elevation of the sampled interval in [metres above sea level], Canadian Geodetic Vertical Datum of 1928. This is a calculated number based on sampled thickness. If the interval was not stated, the sampled thickness was estimated at 40 cm. sample.min_depth_m, sample.sample_notes Feet converted to m, verbatim is included in the sample_notes
Depth_Unit_of_Measure Unit of measure for the depth of the sample. SEE NOTES Feet converted to m, verbatim is included in the sample_notes
Upper_Core_Depth Core depth of the upper limit of the sample thickness in the specified unit of measure. [see Depth_Unit_of_Measure] sample.min_depth_m, sample.sample_notes Feet converted to m, verbatim is included in the sample_notes
Lower_Core_Depth Core depth of the lower limit of the sample thickness in the specified unit of measure. If lower core depth was not recorded during sampling, we assigned a sampled interval of 10 cm (0.1 m) or 4.8 inches (0.4 foot) per the original unit of measure. sample.max_depth_m, sample.sample_notes Feet converted to m, verbatim is included in the sample_notes
Upper_Log_Depth Upper limit of the sample thickness in the specified unit of measure as determined by correcting the core depth using geophysical well logs, core gamma-ray logs, and/or core marker beds. sample.min_depth_m, sample.sample_notes Feet converted to m, verbatim is included in the sample_notes
Lower_Log_Depth Lower limit of the sample thickness in the specified unit of measure as determined by correcting the core depth using geophysical well logs, core gamma-ray logs, and/or core marker beds. sample.max_depth_m, sample.sample_notes Feet converted to m, verbatim is included in the sample_notes
Lithology Basic lithology of the sample. sample.lith_id, sample.verbatim_lith Lithology matched to SGP dictionaries, AGS version stored in verbatim_lith
Stratigraphic_Unit Stratigraphic unit assigned to the sample. lithostrat.strat_id, lithostrat.verbatim_strat Unit names matched to SGP dictionary format/names, AGS version stored in verbatim_strat
Date_Collected/Date_Clctd Date the sample was collected. For oil and gas well core samples, it is the date when the sample was collected at the Alberta Energy Regulator's Core Research Centre. For outcrop samples, it is the date the sample was collected in the field. NOT IMPORTED
Sample_Type Description of the sample type. SEE NOTES values e.g. core or combined. Not imported, but used to check site type, to remove sample duplicates.
Lab_Name The name of the laboratory performing the analyses. batch.lab_id Linked to the 'institution' table.
Methodology Methodology used by the laboratory for the analysis. analysis.ana_method_id Matched with the SGP dictionary of analytical methods.