DataCatalog DRAFT Profile
Version: 0.1-DRAFT-2018_04_25 (25 April 2018)
Bioschemas specification for describing data repositories and data catalogues in the life-sciences
If you spot any errors or omissions with this type, please file an issue in our GitHub.
Key to specification table
- Green properties/types are proposed by Bioschemas, or indicate proposed changes by Bioschemas to Schema.org
- Red properties/types exist in the core of Schema.org
- Blue properties/types exist in the pending area of Schema.org
- Black properties/types are reused from external vocabularies/ontologies
CD = Cardinality
Property | Expected Type | Description | CD | Controlled Vocabulary | Example |
---|---|---|---|---|---|
Marginality: Minimum. | |||||
@context | URL | Used to provide the context (namespaces) for the JSON-LD file. Not needed in other serialisations. |
ONE | ||
@type | Text | Schema.org/Bioschemas class for the resource declared using JSON-LD syntax. For other serialisations please use the appropriate mechanism. While it is permissible to provide multiple types, it is preferred to use a single type. |
MANY | Schema.org, Bioschemas | |
@id | IRI | Used to distinguish the resource being described in JSON-LD. For other serialisations use the appropriate approach. | ONE | ||
dct:conformsTo | IRI | Used to state the Bioschemas profile that the markup relates to. The versioned URL of the profile must be used. Note that we use a CURIE in the table here but the full URL for Dublin Core terms must be used in the markup (http://purl.org/dc/terms/conformsTo), see example. |
ONE | Bioschemas profile versioned URL | |
description |
Text |
Schema: A description of the item. |
ONE | ||
keywords |
Text |
Schema: Keywords or tags used to describe this content. Multiple entries in a keywords list are typically delimited by commas. Bioschemas: Use terms from Controlled Vocabularies where possible. |
ONE | ||
name |
Text |
Schema: The name of the item. |
ONE | ||
provider |
Organization Person |
Schema: The service provider, service operator, or service performer; the goods producer. Another party (a seller) may offer those services or goods on behalf of the provider. A provider may also serve as the seller. Supersedes carrier. Bioschemas: Contact information for this data repository/catalog. |
MANY | ||
rdf:type |
URL |
Bioschemas: This is used by validation tools to indentify the profile used. You must use the value specified in the Controlled Vocabulary column. |
ONE | ||
url |
URL |
Schema: URL of the item. |
ONE | ||
Marginality: Recommended. | |||||
alternateName |
Text |
Schema: An alias for the item. |
MANY | ||
citation |
CreativeWork Text |
Schema: A citation or reference to another creative work, such as another publication, web page, scholarly article, etc. Bioschemas: CreativeWork:Name,URL |
MANY | ||
dataset |
Dataset |
Schema: A dataset contained in this catalog. Inverse property: includedInDataCatalog. |
MANY | ||
dateCreated |
Date DateTime |
Schema: The date on which the CreativeWork was created or the item was added to a DataFeed. |
ONE | ||
identifier |
PropertyValue Text URL |
Schema: The identifier property represents any kind of identifier for any kind of Thing, such as ISBNs, GTIN codes, UUIDs etc. Schema.org provides dedicated properties for representing many of these, either as textual strings or as URL (URI) links. See background notes for more details. Bioschemas: Unique identifier for the data catalog. |
ONE | ||
license |
CreativeWork URL |
Schema: A license document that applies to this content, typically indicated by URL. Bioschemas: CreativeWork:Name,URL |
ONE | ||
publication |
PublicationEvent |
Schema: A publication event associated with the item. |
MANY | ||
sourceOrganization |
Organization |
Schema: The Organization on whose behalf the creator was working. |
MANY | ||
Marginality: Optional. | |||||
dateModified |
Date DateTime |
Schema: The date on which the CreativeWork was most recently modified or when the item’s entry was modified within a DataFeed. Bioschemas: The date on which the data catalog/repository was most recently modified. |
ONE | ||
fileFormat |
Text URL |
Schema: Media type, typically MIME format (see IANA site) of the content e.g. application/zip of a SoftwareApplication binary. In cases where a CreativeWork has several media type representations, ‘encoding’ can be used to indicate each MediaObject alongside particular fileFormat information. Unregistered or niche file formats can be indicated instead via the most appropriate URL, e.g. defining Web page or a Wikipedia entry. |
MANY |