DigitalNZ metadata dictionary

This metadata dictionary provides detailed information about the most common metadata fields used by the DigitalNZ API to describe items from content partner collections and make them easier to find, share, and use.


What metadata information does DigitalNZ collect?

Metadata is any information that describes digital content. When metadata is added to digital items in a standardised and consistent way (using prescribed metadata field names and schema), the items can be managed and organised so users are better able to discover, share, and use that content.

DigitalNZ’s field names and schema are based on Dublin Core but have been adapted to be more flexible and cater for a range of content and metadata levels.

Settings for metadata fields

Each field has settings that indicate whether the field is required, if it is searchable, and whether it can contain multiple values. These settings are listed for each field type.

Is the field required?

  • Compulsory - all records harvested into DigitalNZ must have these fields

  • Recommended - makes a record considerably more useful and should be provided wherever possible

  • Optional - additional fields available to capture any extra information

Is the field searchable?

  • Searchable - this field can be searched using the DigitalNZ search records API

  • Not searchable - this field cannot be searched

Can the field contain multiple values?

  • Single-valued - this field only has one value, for example, a DigitalNZ item can only have one thumbnail (dnz:thumbnail_url)

  • Multi-valued - this field can have many values, for example, a DigitalNZ item may be created by several different people (dc:creator)


Metadata field types

dnz : category

Compulsory | Searchable | Multi-value

This field is used for filtering, via tabs, on the DigitalNZ.org website. It is also used for the API and possibly other hosted/custom search integrations.

Category Definition Usage notes
Images Image is a primary/major component of resource. Items can also use other categories such as ‘Newspapers’ (eg images from Papers Past), ‘Articles’ (eg Kete), or ‘Books’ (eg for images from books at NZETC).
Videos Video is a primary/major component of resource. Items can also use other categories such as ‘Articles’ (eg a video included in a Te Papa article).
Audio Audio is a primary/major component of resource. Items can also be other categories such as ‘Articles’ (eg Te Ara).
Books Digitised books in various forms, including ephemeral booklets or born digital books. Maybe combined with ‘Images’ if the image exists as significant component of an individual item.
Newspapers Digitised newspapers. Maybe combined with ‘Images’ if the image exists as significant component of an individual item.
Manuscripts Digitised unpublished archival material such as letters, scrapbooks, diaries etc. Usually discrete collections so doesn’t need additional categories. Use dc_subject to add useful descriptors if not already supplied by content partner (eg letters, diaries).
Data Datasets or unprocessed information. Includes raw data, csv files, and downloadable sets of resources (eg map tiles/layers). Due to the diversity of this set there may be times where it is appropriate to combine, eg if maps tiles are available in a usable image format, or if data has been sourced in bulk from a newspaper collection.
Research papers Individual published outputs of research. Usually discrete collections so doesn’t need additional categories. Do check for ‘Videos’.
Articles Single items or groups (eg journals) of news, opinion, fact, or general information. Includes blogs. Not for use for newspaper articles, use ‘Newspaper’ instead. Examples include guides and reports. Can be combined with ‘Audio’, ‘Videos’, or ‘Image’ if there is enough contextual information to warrant it. However if the main focus of the item is the information and the image is only illustrative (ie secondary) do not apply the ‘Image’ category. Do utilise the thumbnail if possible.
Archives Other types of unpublished archival material not covered by ‘Manuscripts’ etc. Examples include Archives NZ NZDF Personnel Files.
Other Items that do not fit into existing categories. Examples include interactive web resources, slideshows, and any other outlying item types.
Stories DigitalNZ Stories created by users. Not strictly an item category, but needed.

dnz : collection

Compulsory | Searchable | Multi-value

The name of the collection(s) and/or website that the item belongs to. This field can also contain sub-collections or groupings within the main collection. This field is searchable but does not actually get displayed directly on DigitalNZ.org. A separate (single value) field named ‘display_collection’ containing the primary top level collection is used for displaying alongside the record and in the collection filter drop downs.

For example:

  • Papers Past

  • Grey River Argus

  • Te Ao Hou


dnz : content_partner

Compulsory | Searchable | Multi-value

The name of the organisation(s), institution(s), or individual(s) making content available through DigitalNZ. Usually the name of the organisation that has agreed to the DigitalNZ Metadata Contribution Terms.

For example:

  • Archives New Zealand Te Rua Mahara o te Kāwanatanga

  • Alexander Turnbull Library


dc : description

Compulsory | Searchable | Multi-value

Descriptive information about the record. Can be a summary, abstract, or any information about the record. In most cases this is limited to 350 characters, except where the provided descriptions are clear abstracts or summaries from trusted sources that are not too much longer than 350 characters.

For example:

  • ‘The drawing is from the book Why Cats Paint: a theory of feline aesthetics. Published by Heather Busch and Burton Silver …’

  • ‘During the late 1980s, Kiwi inventor John Britten developed and built a revolutionary racing motorcycle ...’


dnz : landing_url

Compulsory | Not searchable | Single-value

An HTTP URL resolving to a landing page for the content.

For example:


dnz : thumbnail_url

Compulsory | Searchable | Single-value

A URL resolving to a thumbnail of the content for display in search results list. Preferably around 250 px wide. If the image is larger than 255 px wide or 138 px high it will be resized on the DigitalNZ website.

For example:


dc : title

Compulsory | Searchable | Single-value

A name given to the resource or content object being described. If the titles of a series of resources are all identical then some extra information, like a date or identifier, can be added to the title so that the titles are unique.

For example:

  • US forces at Camp McKay store

  • [Looking east from Ponsonby over Auckland City to harbour,....]

  • The paleoaustral genus Protodolium Wilckens, 1922 (Mollusca: Gastropoda), and a new species from the Late Cretaceous of Chatham Islands, New Zealand


dnz : usage

Compulsory | Searchable | Multi-value

Human-understandable label defined by the DigitalNZ project team, ie ‘CC-BY-NC’ would have ‘Share’ and ‘Modify’; ‘No known copyright’ would have: ‘Share’, Modify’, and ‘Use commercially’. If the status of content is not identified by the content provider the value ‘Unknown’ is used.

For example:

  • A combination of ‘All rights reserved’, ‘Share’, ‘Modify’, ‘Use commercially’, ‘Unknown’


dc : creator

Recommended | Searchable | Multi-value

The name of the person, organisation, institution, or service who created the content, ie the photographer, artist, writer, or author. Often this will be the original copyright holder. Where the content is born digital, this may be the creator of the digital file. Where content has been digitised, it is usually the creator of the analogue source.

For example:

  • John Pine Snadden


dc : date

Recommended | Searchable | Multi-value

Stores date information to be used in the filters/facets. Because it is being used in the filters, this information is transformed during harvesting to a computer readable format. DigitalNZ isn’t able to currently deal with date ranges, so the work around is to take the first date of the date range. Also this date must be expressed down to a single day, so will default to 1st of the 1st if only a year is specified in the original source.

For example:

  • 2011-12-15T00:00:00.000Z


dnz : display_date

Recommended | Not searchable | Single-value

Stores date information in any format. This field does not have to be computer readable like dc:date so can store any text-based date expression.

For example:

  • circa 1960s

  • 1920s to 1930s

  • c1895

  • 15 Jan 1992


dnz : locations

Recommended | Not searchable | Single-value

This field contains geographical location information including latitude/longitude coordinates, text-based location info (ie city/region/street), and details about where the location information comes from (ie ‘Location provided by Museum of New Zealand Te Papa Tongarewa’).

For example:

  • <entry lat="-40.0733" lng="+176.8967" area="North Island, Hawkes Bay" specific_location="750 m South of Tuingara Point, Pourerere, Hawkes Bay" comment="Location provided by Museum of New Zealand Te Papa Tongarewa"/>


dnz : placename

Recommended | Searchable | Multi-value

Human-readable place name. Ideally from a controlled vocabulary such as the Getty Thesaurus of Geographic names or the New Zealand Gazetteer of Official Placenames.

For example:

  • Castlepoint, Wairarapa, New Zealand


dc : rights

Recommended | Searchable | Single-value

Rights information. Can be a rights statement explaining the rights of the record or a link to a webpage with more detailed rights information.

For example:

  • Crown Copyright 2009 administered through the Ministry for Culture and Heritage Te Manatū Taonga.


dnz : rights_url

Recommended | Not searchable | Multi-value

An HTTP URL resolving to a rights or terms of use page statement for the resource. Where records have a rights statement rather than a URL, this is recorded in dc:rights.

For example:


dc : subject

Recommended | Searchable | Multi-value

A repeatable field containing keywords about the record content. The harvester will typically split any comma (or similar character) separated lists into separate elements. A harvester may add useful subjects to records to aid findability and grouping if those terms don’t already exist in the data.

For example:

  • Clothing & dress

  • LB Theory and practice of education

  • Caring for sick

  • World War I


dc : type

Recommended | Not searchable | Multi-value

Terms describing general categories, functions, genres, or aggregation levels for content. This field is one of the main factors used to determine the TYPE search filter (such as memorabilia, magazine, people etc).

For example:

  • MovingImage

  • StillImage

  • Thesis


dc : contributor

Optional | Searchable | Multi-value

If the content comes from an aggregation service, this field can optionally provide more context as to the original organisation that provided the content.

For example:

  • School of Biological Sciences

  • Jordan, Bill


dc : coverage

Optional | Not searchable | Multi-value

The extent or scope of the content of the resource. Coverage can include spatial (geographical) location or temporal period (a period label, date, or date range). This is not the primary field for location or date information within DigitalNZ and is mainly used to match existing metadata field names.

DigitalNZ’s primary spatial/location field is dnz:location (which holds latitude and longitude coordinates if available) and dnz:placename for place names as text.

DigitalNZ’s primary date fields are dc:date (strict formated date) and dnz:display_date (more flexible text based date information).

For example:

  • Auckland Region (N.Z.)

  • 1980


dc : format

Optional | Searchable | Multi-value

Contains information about the physical attributes of the item or record. Often used to match an existing field name from the content source.

For example:

  • Watercolour/pencil

  • 112 x 112 mm


dc : identifier

Optional | Searchable | Multi-value

Keeps information like content provider record IDs, ISBNs, serial numbers etc.

For example:

  • KF5-803


dc : language

Optional | Searchable | Multi-value

Indicates the languages of the resource. Harvested as a language code like ‘en’ .

For example:

  • en

  • bo


dnz : object_url

Optional | Searchable | Single-value

A single HTTP URL resolving to the resource or content object itself, such as a PDF or image file. Where multiple versions exist, use the highest resolution version.

For example:


dc : publisher

Optional | Searchable | Multi-value

The name of the publisher of the digital object, or the original physical object if it has been digitised.

For example:

  • Taylor and Francis


dc : relation

Optional | Searchable | Multi-value

Information about a related resource. It could contain a URL to a related resource, or just be the title of a related collection.

For example:

  • K E Niven and Co :Commercial negatives


dc : source

Optional | Not searchable | Multi-value

Information about where the resource has come from, or a reference to another resource from which the content is derived, such as a book in the case of a digitised book.

For example:

  • Amer. fiction, 1774-1910 (microfilm, 1970-1978 ed.), v. 2 (1851-1875), reel R-19, no. 2144A

  • Archives footage preserved and made available by Archives New Zealand/Te Rua Mahara o te Kāwanatanga