Skip to content
Draft
Technical guidance for DCAT-AP-ES Application Profile implementation
DCAT-AP-ES 1.0.0

  • State secretariat for Digitization and Artificial intelligence
  • datos.gob.es
  • Aporta Initiative
More details about this document
Latest Published Version:
https://datosgobes.github.io/DCAT-AP-ES
Latest Editor's Draft:
https://github.com/datosgobes/DCAT-AP-ES/releases/tag/1.0.0
History:
Change History
Editors:
datos.gob.es platform Management Team (Aporta Initiative)
Feedback:
GitHub DCAT-AP-ES
Owners:
Spanish Government Open Data Initiative

DCAT-AP-ES technical guide

Introduction#

This document presents a detailed specification of the metadata used to describe catalogs and reusable data resources.

The metadata is described based on the Semantic Web paradigm, which implements resource description using the standard model for data exchange on the Web, RDF (Resource Description Framework). This approach allows different data cataloging systems to interact and exchange information effectively and consistently, achieving semantic interoperability to facilitate the discovery and findability of data resources, thus significantly increasing their value for reuse.

The application profile, hereinafter referred to as DCAT-AP-ES, is the metadata model included in the new version of the Public Sector Information Resources Interoperability Technical Standard (NTI-RISP), which is currently under administrative processing. The model adopts the guidelines of the European metadata exchange schema DCAT-AP with some additional restrictions and adjustments. This application profile is in turn based on the DCAT specification, an RDF vocabulary created with the objective of improving interoperability among online data catalogs, developed by the Data Exchange Working Group since it was published as a W3C Recommendation in 2014. The European profile version used as a reference for the preparation of DCAT-AP-ES is DCAT-AP 2.1.1, together with the elements described in the extension DCAT-AP HVD 2.2.0 to incorporate the modeling of High Value Datasets.

As is known, an open data catalog may consist solely of datasets or data services, although it is common to have both datasets and services represented by instances of the classes and properties specified in this model.

In this document, the main classes of the application profile are detailed: Catalog, Dataset, Distribution, and Data Service, as well as other classes relevant for providing comprehensive descriptive information about the reusable resources cataloged according to the DCAT-AP-ES model. The set of controlled vocabularies that must be used to harmonize the properties describing the cataloged elements is also specified.

Conventions Guide

As additional material to this technical guide, the datos.gob.es Conventions are included. These establish specific conventions where additional rules are defined to address practical, semantic, or technical needs in the application of DCAT-AP-ES that complement this formal specification.

High Value Datasets#

In response to the growing importance of data in society and the economy, the European Commission adopted the European Commission Implementing Regulation (EU) 2023/138 (High Value Datasets Implementing Regulation, HVD IR) on December 21, 2022. This regulation establishes clear guidelines for public bodies regarding the availability of high value datasets and aims to improve the quality, accessibility, and use of a specific set of key data within the public sector. To achieve this, the HVD regulation sets specific requirements for the metadata associated with the published datasets.

Relationship between the specification and data-specific regulations

The DCAT-AP-ES application profile represents the minimum core metadata applied to all entities in the model. However, compliance with the specification does not exempt compliance with the specific regulations applicable in each sector, as is the case with the HVD regulation. In particular, datasets that meet the HVD criteria may include additional metadata or restrictions that are not covered by DCAT-AP-ES but are required by law.

DCAT-AP-ES Metadata Model#

The fundamental elements of the model are detailed below, beginning with the UML diagram, the class relationships, the namespace used in the specification, and the set of controlled vocabularies.

Model Diagram#

The DCAT-AP-ES model is presented below as a UML diagram that illustrates the specification described in this document. To facilitate interpretation, details included in the description of each metadata element have been omitted. In essence, the key classes and some relevant supporting ones are included.

DCAT-AP-ES Application Profile Classes#

The most relevant classes used in the model are listed below:

  • Catalog. The dcat:Catalog class represents a catalog, which is a collection of data where each individual element is a metadata record describing some resource. The content of a catalog consists of collections of metadata about datasets, data services, or other types of resources, including other catalogs. It functions as a unified access point that facilitates the search and reuse of data resources.
  • Catalog Record. The Catalog Record class (dcat:CatalogRecord) describes individual entries within a data catalog, each being a specific metadata record. A catalog record references an entity in the catalog, which can be a dataset or a data service. It is mainly used to explicitly collect provenance information about the entries in a catalog.
  • Data Service. The Data Service class (dcat:DataService) represents a collection of operations accessible through an interface (API) that provides access to one or more datasets or data processing functions. Its use allows the cataloging of various types of data services, facilitating the implementation of functionalities for the programmatic handling and/or exploitation of data.
  • Dataset. The Dataset class (dcat:Dataset) represents a conceptualization of a collection of information published by a single identifiable agent. The notion of a dataset is broad, intending to accommodate the types of resources that arise from a publication context, which can be represented in many forms, including numbers, text, images, sound, and other media or types, any of which could be collected in a dataset.
  • Distribution. The Distribution class of a dataset (dcat:Distribution) represents an accessible and reusable form of a dataset, such as a downloadable file.
  • Agent. The Agent class (foaf:Agent) is used to represent any organization or person that has the competence to perform actions on a catalog and the cataloged resources. Its main function is to provide concrete references about the different actors who can intervene with different roles in the management of a data catalog.
  • Identifier. The Identifier class of a dataset (dct:Identifier) is used to express the unique reference assigned to a dataset within the context of a specific identifier scheme.
  • Location. The Location class (dct:Location) is used to identify a geographic region or place. It can be represented using a controlled vocabulary or by expressing geographic coordinates that delimit a specific area.
  • Period of Time. The Period of Time class (dct:PeriodOfTime) is used to define a time interval that is delimited by a start date and an end date.
  • Checksum. The Checksum class (spdx:Checksum) is used to specify the method implemented and the result obtained to ensure the integrity of dataset distributions, i.e., that their content has not been altered.
  • Relationship. The Relationship class between resources (dcat:Relationship) is used to specify additional information regarding a relationship between resources or agents, providing context on how these resources are interrelated.

Namespaces Used in the Model#

Each property of a class that describes an attribute of the catalog, catalog record, data service, dataset, distribution, etc., reuses terms from other existing vocabularies. They are specified via a URI determined by the combination of the corresponding vocabulary’s prefix (referenced in the DCAT-AP-ES model’s namespace) and the name of the class or property. For example, the property dct:issued of the Catalog class is expressed equivalently in its abbreviated and extended forms as follows:

Note on Namespaces

dct:issued is equivalent to http://purl.org/dc/terms/issued

Below, generic vocabularies that configure the namespaces reused in the implementation of the DCAT-AP-ES model are listed:

Vocabulary Prefix URI
Asset Description Metadata Schema adms: http://www.w3.org/ns/adms#
Dataset Catalog (dcat) dcat: http://www.w3.org/ns/dcat#
DCAT Application profile for data portals dcatap: http://data.europa.eu/r5r/
Dublin Core Terms dct: http://purl.org/dc/terms/
Friend Of A Friend (FOAF) foaf: http://xmlns.com/foaf/0.1/
Location Core Vocabulary locn: http://www.w3.org/ns/locn#
Web Ontology Document owl: http://www.w3.org/2002/07/owl#
Open Digital Rights Language odrl: http://www.w3.org/ns/odrl/2/
Prov Family of Documents prov: http://www.w3.org/ns/prov#
Resource Description Framework rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns#
Resource Description Framework Schema rdfs: http://www.w3.org/2000/01/rdf-schema#
Simple Knowledge Organization System (SKOS) skos: http://www.w3.org/2004/02/skos/core#
Software Package Data Exchange spdx: http://spdx.org/rdf/terms#
W3C Time Ontology time: http://www.w3.org/2006/time#
vCard Ontology vcard: http://www.w3.org/2006/vcard/ns#
XML Schema xsd: http://www.w3.org/2001/XMLSchema#

Controlled Vocabularies used in the model#

The following is a list of properties that must be adjusted using the controlled vocabularies indicated in the table below, in order to guarantee a minimum level of interoperability.

Property Entity Vocabulary Vocabulary URI
dcatap:availability Distribution Planned availability http://publications.europa.eu/resource/authority/planned-availability
dct:accessRights Dataset
DataService
Access right http://publications.europa.eu/resource/authority/access-right
dct:accrualPeriodicity Dataset Frequency http://publications.europa.eu/resource/authority/frequency
dct:format Distribution File type http://publications.europa.eu/resource/authority/file-type
dcatap:hvdCategory Dataset
DataService
HVD Category http://data.europa.eu/bna/asd487ae75
dct:language Catalog
Dataset
CatalogRecord
Distribution
Language http://publications.europa.eu/resource/authority/language
dct:license Catalog
DataService
Distribution
Licence http://publications.europa.eu/resource/authority/licence
dcat:mediaType Distribution IANA Media Types http://www.iana.org/assignments/media-types/
dct:spatial Catalog
Dataset
  • http://datos.gob.es/es/recurso/sector-publico/territorio
  • http://publications.europa.eu/resource/authority/continent
  • http://publications.europa.eu/resource/authority/country
  • http://publications.europa.eu/resource/authority/atu
  • http://publications.europa.eu/resource/authority/place
  • http://sws.geonames.org/
dcat:theme Dataset
  • http://datos.gob.es/kos/sector-publico/sector
  • http://publications.europa.eu/resource/authority/data-theme
  • http://inspire.ec.europa.eu/theme
dcat:themeTaxonomy Catalog
  • http://datos.gob.es/kos/sector-publico/sector
  • http://publications.europa.eu/resource/authority/data-theme
  • http://inspire.ec.europa.eu/theme
dct:type Agent ADMS publisher type vocabulary http://purl.org/adms/publishertype/1.0
dct:type Dataset Dataset type http://publications.europa.eu/resource/authority/dataset-type
adms:status Distribution Distribution status http://publications.europa.eu/resource/authority/distribution-status