Standards compliance dashboard

From CETAF Identifiers Wiki
Revision as of 09:47, 28 January 2020 by Maarten Trekels (Talk | contribs) (NHM: completing information given by NHM)

Jump to: navigation, search

Introduction

The goal of this webpage is to provide an overview of the use of different (biodiversity) standards within the institutes. The dashboard is created in the framework of work package 4 of Synthesys+.

The table contains a list of standards (columns) and institutes (rows). Whenever an institute is using one of the standards, the corresponding cell is colored yellow. For some of the standards, extra information is given in the cell (level of adopting the standard or version of the standard used).

List of standards

The different standards considered in this dashboard are listed here. Links to a more detailed description on the standard are given.

This list of standards is not exhaustive and this is definitely not a complete overview. In a first version of this dashboard, we focus on some of the more widely used/known standards. A larger overview of the different standards can be found at FAIRsharing.org. In case another (important) standard is supported and missing from the list, don't hesitate to propose an addition.

Dashboard

Standards compliance dashboard
Institute Acronym CETAF ID IIIF DwC ABCD2 ABCD3 AC BCO DELTA ABCDEFG SDD
NATURAL HISTORY MUSEUM (London) NHM L2
NATURHISTORISCHES MUSEUM WIEN (Vienna) NHMW L0
ZOOLOGISCHES FORSCHUNGSMUSEUM ALEXANDER KOENIG (Bonn) ZFMK L3
HELSINGIN YLIOPISTO (Finland) LUOMUS L3
MUSEUM NATIONAL D'HISTOIRE NATURELLE (Paris) MNHN L3 v2.1
THE HEBREW UNIVERSITY OF JERUSALEM (Israel) HUJI
NATURHISTORISKA RIKSMUSEET (Stockholm) NRM L0
AGENTSCHAP PLANTENTUIN MEISE (Meise) BGM L3 v2.1
FREIE UNIVERSITÄT BERLIN BGBM L3 v2.1
AGENCIA ESTATAL CONSEJO SUPERIOR DE INVESTIGACIONES CIENTIFICAS (Madrid) CSIC
MUSEUM FUR NATURKUNDE (Berlin) MfN L3
ROYAL BOTANIC GARDEN EDINBURGH RBGE L3 v3.0 DwC Archive
ROYAL BOTANIC GARDENS KEW RBGK L3?
INSTITUT ROYAL DES SCIENCES NATURELLES DE BELGIQUE (Brussels) RBINS L0
MUSEE ROYAL DE L'AFRIQUE CENTRALE (Tervuren) RMCA L1
SENCKENBERG GESELLSCHAFT FUR NATURFORSCHUNG (Frankfurt) SGN L0
STAATLICHES MUSEUM FUER NATURKUNDE STUTTGART SMNS L1
STAATLICHE NATURWISSENSCHAFTLICHE SAMMLUNGEN BAYERNS SNSB L3

Adoption of the CETAF stable identifier

A full overview on the adoption of the CETAF stable identifier can be found here. This document contains the syntax that is used by the different institutes.

In the table above, the level of adoption of the CETAF stable identifier is indicated. The levels are defined as follows:

  • Level 0: Identifier syntax chosen
  • Level 1: Web page for each specimen, identifier redirects to it[1]
  • Level 2: Machine readable representation for each specimen, accept-type[2] aware redirection
  • Level 3: Machine readable representation is CSPP conform

Extra information

This section will provide the possibility to detail more on the adoption of the different standards.

NHM


  • Collections and bio/geodiversity data:
    • Darwin Core: A set of Darwin Core fields have been implemented in the Collections Management System (EMu). These are not part of the core schema, but are mapped to from EMu's native fields and updated daily via a script. The NHM Data Portal publishes collection data in Darwin Core format to the extent of the standard.
    • LOD-related: The NHM Data Portal has implemented N3, Turtle, XML and JSON-LD formats for LOD, with vocabularies used listed at view-source:https://data.nhm.ac.uk/catalog.ttl. It uses the DCAT standard (https://www.w3.org/TR/vocab-dcat-2/) for describing data catalogues.
    • Gradstein and Ogg (2012) timescale for Chronostratigraphy: Used for chronostratigraphy module in the CMS and Data Portal. Likely to be updated to the International Chronostratigraphic Chart (http://www.stratigraphy.org/index.php/ics-chart-timescale) at some point in the future.
  • Molecular:
    • GGBN data standards (https://wiki.ggbn.org/ggbn/GGBN_Data_Standard): Not yet incorporated into the NHM's freezer management software (FreezerPro), but mapped to when publishing data to GGBN. Included in the NHM's collection data standards, and also used for a number of molecular datasets e.g. ABYSSLINE deep-sea benthic macroinvert dataset.
    • OBIS-ENV-DATA: DwC extension used for marine environmental datasets
  • Digital media:
    • IIIF: Not yet implemented in the Data Portal, as the DAMS which serves images to the Portal doesn't support it (and won't in the forseeable future). There is also an ongoing project to transfer collection images from tape to faster disk storage, once complete the Data Portal team will build a IIIF service to sit on top of the Portal architecture. This will piloted beforehand on a beetle drawer annotation system.
  • Library materials:
    • Resource Description and Access (RDA)
    • Descriptive Cataloguing of Rare Materials (DCRM)
    • MARC 21
    • Library of Congress Name Authority File (LCNAF)
    • Library of Congress Subject Headings (LCSH)
    • Library of Congress Genre/Form Terms (LCGFT)
  • Archives:
    • ISAD(G) (General International Standard Archival Description): Used for archive cataloguing
    • ISAAR(CPF) (International Standard Archival Authority Record for Corporate Bodies, Persons and Families) and NCA Rules (NCA Rules for the Construction of Personal, Place and Corporate Names): Used for authority records

NHMW

  • IIIF: implemented through the JACQ system

LUOMUS

MNHM

Information for MNHN and the national french infrastructure recolnat (https://www.recolnat.org/) Both implement the Darwin core to publish specimens to GBIF .

BGM

BGBM

RBGE

  • DwC: We publish our herbarium and living collections data to GBIF (and anyone else who wants it) through daily updated DwC Archive files.
  • CETAF IDs: All our herbarium specimens and living collection specimens are identified by CETAF ID. e.g.
  • The RDF metadata returned by our CETAF URIs includes links to other resources that we intend to build on over the next year. Currently the data includes links to:
    • A limited number of collectors based on ORCID or Wikidata URIs
    • IIIF Manifest files for herbarium specimen images
    • JPEG images
  • We have developed and maintain the CETAF Specimen URI Tester which is a resource to test/validate/develop CETAF Specimen URIs
  • IIIF: We have a IIIF endpoint implementing the Presentation API (manifests) and Image API both version 3.0 for our herbarium specimen images and some experimental resources - details here.

ZFMK

SGN

  • ABCD2: publishing to GBIF

SMNS

  • ABCD2: publishing to GBIF, BOLD, Genbank, UNITE, MycoBank, TreeBASE, VertNet.

RMCA

SNSB

  • CETAF-ID: Example: UNIT GUID http://id.snsb.info/snsb/collection/*
  • DwC: the standard is used as Simple Darwin Core Text for publishing regional taxonomic lists with common names to GBIF.
  • ABCD: the standard is used for publishing specimen and observational data to GBIF, BioCASe, GFBio (German Federation for Biological Data) as well as various Special Interest Networks.
  • SDD: the standard is used for publishing trait and other kind of descriptive data via GFBio; data management done via DiversityDescriptions; SDD via NaviKey is used for taxonomic keys

Technical Notes

  1. If a redirect is implemented it should be the HTTP response status code 303 “See Other” instead of the sometimes used 302 code “Found” or “Moved Temporarily” which is hard to tell how the client would interpret it
  2. An identifier with header of 'application/rdf+xml' will be redirected to the objects machine-readable RDF metadata record (you can test this by using the CETAF URI Tester)