Difference between revisions of "CETAF Specimen Preview Profile (CSPP)"

From CETAF Identifiers Wiki
Jump to: navigation, search
m (Additional recommended data terms: +sec)
Line 2: Line 2:
 
<!-- no need to double the title which is already h1 -->
 
<!-- no need to double the title which is already h1 -->
 
The {{abbr|CSPP}} is a minimal set of agreed ({{abbr|RDF}}) collection metadata elements implemented consistently across {{abbr|CETAF}} organisations (see [[Geneva meeting|minutes of the Geneva meeting]] of the ISTC stable identifier initiative). The purpose of CSPP is to provide a stable resource enabling preview functions in specimen portals. The CSPP is not meant to be comprehensive, which means that Linked Open (collection) Data implementations of CETAF institutions will usually provide much richer metadata with additional RDF-elements.
 
The {{abbr|CSPP}} is a minimal set of agreed ({{abbr|RDF}}) collection metadata elements implemented consistently across {{abbr|CETAF}} organisations (see [[Geneva meeting|minutes of the Geneva meeting]] of the ISTC stable identifier initiative). The purpose of CSPP is to provide a stable resource enabling preview functions in specimen portals. The CSPP is not meant to be comprehensive, which means that Linked Open (collection) Data implementations of CETAF institutions will usually provide much richer metadata with additional RDF-elements.
 +
__TOC__
  
 
== CSPP Elements ==
 
== CSPP Elements ==
Line 36: Line 37:
 
|}
 
|}
  
== Example CSPP-compliant {{abbr|RDF}}-document {{anchor|example CSPP-compliant RDF}} ==
+
== CSPP-compliant {{abbr|RDF}}-documents {{anchor|example CSPP-compliant RDF}} ==
  
 
The RDF in this example is for the {{abbr|CETAF-ID}} http://herbarium.bgbm.org/object/B100068798 and it can contain the following:
 
The RDF in this example is for the {{abbr|CETAF-ID}} http://herbarium.bgbm.org/object/B100068798 and it can contain the following:
Line 121: Line 122:
  
 
<div class="mw-collapsible mw-collapsed">
 
<div class="mw-collapsible mw-collapsed">
The provided RDF can then be checked by Apache Jena’s <code>rdfparse</code> command to quickly see all the properties but now being resolved into <code>subject</code>—<code>predicate</code>—<code>object</code>.
+
The RDF-example provided above can then be checked online by https://www.w3.org/RDF/Validator/ or on command line by Apache Jena’s <code>rdfparse</code> command to quickly see all the properties but now being resolved into <code>subject</code>—<code>predicate</code>—<code>object</code>. (toggle on the right to see hidden content)
 
<div class="mw-collapsible-content"><pre>
 
<div class="mw-collapsible-content"><pre>
 
<http://herbarium.bgbm.org/data/rdf/B100068798> <http://purl.org/dc/terms/subject> <http://herbarium.bgbm.org/object/B100068798> .
 
<http://herbarium.bgbm.org/data/rdf/B100068798> <http://purl.org/dc/terms/subject> <http://herbarium.bgbm.org/object/B100068798> .
Line 163: Line 164:
 
</div>
 
</div>
 
</div>
 
</div>
 +
 +
=== Additional recommended data terms ===
 +
 +
{| class="booktabledotted vertical-align-top"
 +
|-
 +
! Data Term !! {{abbr|RDF}} resource {{abbr|URI}} !! Notes
 +
|-
 +
| <code>dwc:institutionCode</code> || http://rs.tdwg.org/dwc/terms/institutionCode || The name (or acronym) in use by the institution having custody of the object(s) or information referred to in the record. ''Examples:'' MVZ, BGBM, CLO, UCMP
 +
|-
 +
| <code>dwc:basisOfRecord</code> || http://rs.tdwg.org/dwc/terms/basisOfRecord || The specific nature of the data record. ''Comments:'' Recommended best practice is to use the standard label of one of the Darwin Core classes. ''Examples:'' [http://rs.tdwg.org/dwc/terms/PreservedSpecimen PreservedSpecimen], [http://rs.tdwg.org/dwc/terms/FossilSpecimen FossilSpecimen], [http://rs.tdwg.org/dwc/terms/LivingSpecimen LivingSpecimen], [http://rs.tdwg.org/dwc/terms/MaterialSample MaterialSample], [http://rs.tdwg.org/dwc/terms/Event Event], [http://rs.tdwg.org/dwc/terms/HumanObservation HumanObservation], [http://rs.tdwg.org/dwc/terms/MachineObservation MachineObservation], [http://rs.tdwg.org/dwc/terms/Taxon Taxon], [http://rs.tdwg.org/dwc/terms/Occurrence Occurrence]
 +
|}
  
 
[[Category:Handbook for CETAF Stable Identifiers]]
 
[[Category:Handbook for CETAF Stable Identifiers]]

Revision as of 16:04, 9 June 2020

The CSPP is a minimal set of agreed (RDF) collection metadata elements implemented consistently across CETAF organisations (see minutes of the Geneva meeting of the ISTC stable identifier initiative). The purpose of CSPP is to provide a stable resource enabling preview functions in specimen portals. The CSPP is not meant to be comprehensive, which means that Linked Open (collection) Data implementations of CETAF institutions will usually provide much richer metadata with additional RDF-elements.

CSPP Elements

CSPP element Mandatory RDF resource URI Notes
title yes http://purl.org/dc/terms/title Mandatory. Collection may decide themselves on the content of the title element. This could usually be a scientific name (with or without authorship), but it could also contain additional information on collectors and projects for example.
kindOfMaterial no http://purl.org/dc/terms/type The nature of the specimen. Example: herbarium sheet, fossil, conserved in alcohol
scientificNameCurrent no http://rs.tdwg.org/dwc/terms/scientificName The latest full scientific name, with authorship and date information if known.
family no http://rs.tdwg.org/dwc/terms/family The full scientific name of the family in which the taxon is classified.
scientifcNameOriginal no http://rs.tdwg.org/dwc/terms/previousIdentifications The original name appearing on the specimen.
collectorNumber no http://rs.tdwg.org/dwc/terms/fieldNumber An identifier given to the Occurrence at the time it was recorded. Often serves as a link between field notes and an Occurrence record, such as a specimen collector’s number.
collectorName no http://rs.tdwg.org/dwc/terms/recordedBy A list (concatenated and separated) of names of people, groups, or organizations responsible for recording the original Occurrence. The primary collector or observer, especially one who applies a personal identifier (recordNumber), should be listed first.
webscaledImageLink no http://rs.tdwg.org/dwc/terms/associatedMedia Link to “nice” webscaled image.
latitude no http://rs.tdwg.org/dwc/terms/decimalLatitude The geographic latitude (WGS84, in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive.
longitude no http://rs.tdwg.org/dwc/terms/decimalLongitude The geographic longitude (WGS84, in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive.
isoCountry no http://rs.tdwg.org/dwc/terms/countryCode The standard (ISO 3166-1 alpha-2) code for the country in which the specimen has been collected. Recommended best practice is to use ISO 3166-1-alpha-2 country codes.
collectionDate no http://purl.org/dc/terms/created (Iso) date of the gathering event.
sourceLink no http://purl.org/dc/terms/publisher URI to a web page with information about the institution/collection/owner responsible for publishing the specimen.

CSPP-compliant RDF-documents

The RDF in this example is for the CETAF-ID http://herbarium.bgbm.org/object/B100068798 and it can contain the following:

<?xml version="1.0" encoding="utf-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:dwc="http://rs.tdwg.org/dwc/terms/"
    xmlns:dwciri="http://rs.tdwg.org/dwc/iri/"
    xmlns:dc="http://purl.org/dc/terms/"
    xmlns:owl="http://www.w3.org/2002/07/owl#">

  <!--This is metadata about this very RDF document (got to by redirect via HTTP response code 303 (=See Other)) -->
  <rdf:Description rdf:about="http://herbarium.bgbm.org/data/rdf/B100068798">
    <!-- relate to the CETAF-ID -->
    <dc:subject rdf:resource="http://herbarium.bgbm.org/object/B100068798"/>
    <!-- the date time this very RDF file was created -->
    <dc:created>2019-11-11T15:41:25+01:00</dc:created>
  </rdf:Description>
  <!--This is metadata about this specimen and the originally requested CETAF-ID -->
  <rdf:Description rdf:about="http://herbarium.bgbm.org/object/B100068798">
    <!--   CSPP element: title (mandatory)  -->
    <dc:title>Erysimum salangense Polatschek &amp; Rech.f.</dc:title>
    <dc:description>A herbarium specimen of Erysimum salangense Polatschek &amp; Rech.f. collected by Rechinger,K.H.</dc:description>
    <dc:creator>Rechinger, K.H.</dc:creator>
    <!--   CSPP element: collectionDate   -->
    <dc:created>1967-07-14</dc:created>
    <!--   CSPP element: kindOfMaterial   -->
    <dc:type>Specimen</dc:type>
    <!--   CSPP element: sourceLink   -->
    <dc:publisher>BGBM</dc:publisher>
    <dwc:materialSampleID>http://herbarium.bgbm.org/object/B100068798</dwc:materialSampleID>
    <dwc:basisOfRecord>Specimen</dwc:basisOfRecord>
    <dwc:collectionCode>B</dwc:collectionCode>
    <dwc:catalogNumber>B 10 0068798</dwc:catalogNumber>
    <!--   CSPP element: scientificNameCurrent   -->
    <dwc:scientificName>Erysimum salangense Polatschek &amp; Rech.f.</dwc:scientificName>
    <!--   CSPP element: scientifcNameOriginal   -->
    <dwc:previousIdentifications>Erysimum salangense Polatschek &amp; Rech.f.</dwc:previousIdentifications>
    <dwc:family>CRUCIFERAE</dwc:family>
    <dwc:genus>Erysimum</dwc:genus>
    <dwc:specificEpithet>salangense</dwc:specificEpithet>
    <dwc:country>Afghanistan</dwc:country>
    <!--   CSPP element: isoCountry   -->
    <dwc:countryCode>AF</dwc:countryCode>
    <dwc:locality>
      Afghanistan: NE-Afghanistan, Kathagan. Sar-i Hauz, in declivibus borealibus jugi Salang. substr. granit. Alt.: 2600 m. 14.07.1967, Leg.: K. H. Rechinger 37047.
    </dwc:locality>
    <!--   CSPP element: longitude   -->
    <dwc:decimalLongitude>69.033332824707</dwc:decimalLongitude>
    <!--   CSPP element: latitude   -->
    <dwc:decimalLatitude>35.366664886475</dwc:decimalLatitude>
    <dwc:eventDate>1967-07-14</dwc:eventDate>
    <dwc:recordNumber>37047</dwc:recordNumber>
    <!--   CSPP element: collectorName   -->
    <dwc:recordedBy>Rechinger,K.H.</dwc:recordedBy>
    <!--   CSPP element: collectorNumber   -->
    <dwc:fieldNumber>37047</dwc:fieldNumber>
    <dwciri:recordedBy rdf:resource="http://www.wikidata.org/entity/Q78738"/>
    <!--   CSPP element: webscaledImageLink   -->
    <dwc:associatedMedia rdf:resource="http://ww2.bgbm.org/herbarium/images/B/10/00/68/79/B_10_0068798.jpg"/>
    <!-- Experimental: This is a link to iiif data (see https://iiif.io) -->
    <dc:relation>
      <rdf:Description rdf:about="http://herbarium.bgbm.org/iiif/B100068798">
        <dc:identifier rdf:resource="http://herbarium.bgbm.org/iiif/B100068798"/>
        <dc:type rdf:resource="http://iiif.io/api/presentation/3#Manifest"/>
        <dc:subject rdf:resource="http://herbarium.bgbm.org/object/B100068798"/>
        <dc:format>application/ld+json</dc:format>
        <dc:description xml:lang="en">A IIIF resource for this specimen.</dc:description>
          <dc:created/>
        </rdf:Description>
    </dc:relation>
  </rdf:Description>
  <rdf:Description rdf:about="http://www.wikidata.org/entity/Q78738">
    <owl:sameAs rdf:resource="http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/d5fea488-5786-4106-af90-396ef452c3aa"/>
    <owl:sameAs rdf:resource="https://viaf.org/viaf/100383596/"/>
  </rdf:Description>
</rdf:RDF>

Note that …

  • rdf:Description first describes the very RDF resource (got to by redirect) and it relates to the requested CETAF-ID by dc:subject
  • the following rdf:Description elaborate the CSPP-details and more related properties to this species data (images, other same data objects etc.)

The RDF-example provided above can then be checked online by https://www.w3.org/RDF/Validator/ or on command line by Apache Jena’s rdfparse command to quickly see all the properties but now being resolved into subjectpredicateobject. (toggle on the right to see hidden content)

<http://herbarium.bgbm.org/data/rdf/B100068798> <http://purl.org/dc/terms/subject> <http://herbarium.bgbm.org/object/B100068798> .
<http://herbarium.bgbm.org/data/rdf/B100068798> <http://purl.org/dc/terms/created> "2019-11-11T15:41:25+01:00" .
<http://herbarium.bgbm.org/object/B100068798> <http://purl.org/dc/terms/title> "Erysimum salangense Polatschek & Rech.f." .
<http://herbarium.bgbm.org/object/B100068798> <http://purl.org/dc/terms/description> "A herbarium specimen of Erysimum salangense Polatschek & Rech.f. collected by Rechinger,K.H." .
<http://herbarium.bgbm.org/object/B100068798> <http://purl.org/dc/terms/creator> "Rechinger, K.H." .
<http://herbarium.bgbm.org/object/B100068798> <http://purl.org/dc/terms/created> "1967-07-14" .
<http://herbarium.bgbm.org/object/B100068798> <http://purl.org/dc/terms/type> "Specimen" .
<http://herbarium.bgbm.org/object/B100068798> <http://purl.org/dc/terms/publisher> "BGBM" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/materialSampleID> "http://herbarium.bgbm.org/object/B100068798" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/basisOfRecord> "Specimen" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/collectionCode> "B" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/catalogNumber> "B 10 0068798" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/scientificName> "Erysimum salangense Polatschek & Rech.f." .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/previousIdentifications> "Erysimum salangense Polatschek & Rech.f." .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/family> "CRUCIFERAE" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/genus> "Erysimum" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/specificEpithet> "salangense" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/country> "Afghanistan" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/countryCode> "AF" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/locality> "\n      Afghanistan: NE-Afghanistan, Kathagan. Sar-i Hauz, in declivibus borealibus jugi Salang. substr. granit. Alt.: 2600 m. 14.07.1967, Leg.: K. H. Rechinger 37047.\n    " .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/decimalLongitude> "69.033332824707" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/decimalLatitude> "35.366664886475" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/eventDate> "1967-07-14" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/recordNumber> "37047" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/recordedBy> "Rechinger,K.H." .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/fieldNumber> "37047" .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/iri/recordedBy> <http://www.wikidata.org/entity/Q78738> .
<http://herbarium.bgbm.org/object/B100068798> <http://rs.tdwg.org/dwc/terms/associatedMedia> <http://ww2.bgbm.org/herbarium/images/B/10/00/68/79/B_10_0068798.jpg> .
<http://herbarium.bgbm.org/object/B100068798> <http://purl.org/dc/terms/relation> <http://herbarium.bgbm.org/iiif/B100068798> .
<http://herbarium.bgbm.org/iiif/B100068798> <http://purl.org/dc/terms/identifier> <http://herbarium.bgbm.org/iiif/B100068798> .
<http://herbarium.bgbm.org/iiif/B100068798> <http://purl.org/dc/terms/type> <http://iiif.io/api/presentation/3#Manifest> .
<http://herbarium.bgbm.org/iiif/B100068798> <http://purl.org/dc/terms/subject> <http://herbarium.bgbm.org/object/B100068798> .
<http://herbarium.bgbm.org/iiif/B100068798> <http://purl.org/dc/terms/format> "application/ld+json" .
<http://herbarium.bgbm.org/iiif/B100068798> <http://purl.org/dc/terms/description> "A IIIF resource for this specimen."@en .
<http://herbarium.bgbm.org/iiif/B100068798> <http://purl.org/dc/terms/created> "" .
<http://www.wikidata.org/entity/Q78738> <http://www.w3.org/2002/07/owl/sameAs> <http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/d5fea488-5786-4106-af90-396ef452c3aa> .
<http://www.wikidata.org/entity/Q78738> <http://www.w3.org/2002/07/owl/sameAs> <https://viaf.org/viaf/100383596/> .

Additional recommended data terms

Data Term RDF resource URI Notes
dwc:institutionCode http://rs.tdwg.org/dwc/terms/institutionCode The name (or acronym) in use by the institution having custody of the object(s) or information referred to in the record. Examples: MVZ, BGBM, CLO, UCMP
dwc:basisOfRecord http://rs.tdwg.org/dwc/terms/basisOfRecord The specific nature of the data record. Comments: Recommended best practice is to use the standard label of one of the Darwin Core classes. Examples: PreservedSpecimen, FossilSpecimen, LivingSpecimen, MaterialSample, Event, HumanObservation, MachineObservation, Taxon, Occurrence