Proposed Specification 8 September 2011
A diff of changes from the previous draft is available at this link.
Please refer to the errata for this document, which may include some normative corrections.
Copyright © 2010, 2011 International Digital Publishing Forum™
All rights reserved. This work is protected under Title 17 of the United States Code. Reproduction and dissemination of this work with changes is prohibited except with the written permission of the International Digital Publishing Forum (IDPF).
EPUB is a registered trademark of the International Digital Publishing Forum.
Table of Contents
package
Elementmetadata
Elementidentifier
Elementtitle
Elementlanguage
Elementmeta
Elementmeta
Element (OPF2) [OBSOLETE]link
Elementmanifest
Elementitem
Elementspine
Elementitemref
Elementguide
Element [DEPRECATED]bindings
ElementmediaType
Elementapplication/oebps-package+xml
Media TypeThis section is informative
This specification, EPUB Publications 3.0, defines publication-level semantics and conformance requirements for EPUB® 3, including the format of the Package Document and rules for how this document and other Publication Resources are associated to create a conforming EPUB Publication.
This specification is one of a family of related specifications that compose EPUB 3, the third major revision of an interchange and delivery format for digital publications based on XML and Web Standards. It is meant to be read and understood in concert with the other specifications that make up EPUB 3:
The EPUB 3 Overview [EPUB3Overview], which provides an informative overview of EPUB and a roadmap to the rest of the EPUB 3 documents. The Overview should be read first.
EPUB Content Documents 3.0 [ContentDocs30], which defines profiles of XHTML, SVG and CSS for use in the context of EPUB Publications.
EPUB Open Container Format (OCF) 3.0 [OCF3], which defines a file format and processing model for encapsulating a set of related resources into a single-file (ZIP) EPUB Container.
EPUB Media Overlays 3.0 [MediaOverlays30], which defines a format and a processing model for synchronization of text and audio.
This specification supersedes Open Package Format 2.0.1 [OPF2]. Refer to [EPUB3Changes] for information on differences between this specification and its predecessor.
A logical document entity consisting of a set of interrelated resources and packaged in an EPUB Container, as defined by this specification and its sibling specifications.
A resource that contains content or instructions that contribute to the logic and rendering of the EPUB Publication. In the absence of this resource, the Publication might not render as intended by the Author. Examples of Publication Resources include the Package Document, EPUB Content Documents, EPUB Style Sheets, audio, video, images, embedded fonts and scripts.
With the exception of the Package Document itself, Publication Resources must be listed in the manifest and must be bundled in the EPUB container file unless specified otherwise in Publication Resource Locations.
Examples of resources that are not Publication Resources include those identified by the Package Document link element and those identified in outbound hyperlinks that resolve outside the EPUB Container (e.g., referenced from an [HTML5] a
element href
attribute).
A Publication Resource that is not a Core Media Type. A Foreign Resource requires at least one fallback, as defined in Restrictions and Fallbacks.
A Publication Resource that is a Core Media Type and may therefore be included in the EPUB Publication without the provision of fallbacks.
A Publication Resource that conforms to one of the EPUB Content Document definitions (XHTML or SVG).
An EPUB Content Document is a Core Media Type, and may therefore be included in the EPUB Publication without the provision of fallbacks.
An EPUB Content Document conforming to the profile of [HTML5] defined in XHTML Content Documents [ContentDocs30].
XHTML Content Documents use the XHTML syntax of [HTML5].
An EPUB Content Document conforming to the constraints expressed in SVG Content Documents [ContentDocs30].
A specialization of the XHTML Content Document, containing human- and machine-readable global navigation information, conforming to the constraints expressed in EPUB Navigation Documents [ContentDocs30].
An EPUB Content Document that includes scripting or an XHTML Content Document that contains HTML5 forms elements.
Refer to Scripted Content Documents [ContentDocs30] for more information.
An EPUB Content Document referenced directly from the spine
A set of Publication Resource types for which no fallback is required. Refer to Publication Resources for more information.
A Publication Resource carrying bibliographical and structural metadata about the EPUB Publication, as defined in Package Documents.
The digital (or physical) embodiment of a work of intellectual content. Changes to the content such as significant revision, abridgement, translation, or the realization of the content in a different digital or physical form result in a new manifestation. There may be many individual but identical copies of a manifestation, termed 'instances' or 'items'. The ISBN is an example of a manifestation identifier, and is shared by all instances of that manifestation.
All instances of a manifestation need not be bit-for-bit identical, as minor corrections or revisions are not judged to create a new manifestation or work.
The Unique Identifier is the primary identifier for an EPUB Publication, as identified by the unique-identifier
attribute. The Unique Identifier may be shared by one or many Manifestations of the same work that conform to the EPUB standard and embody the same content, where the differences between the Manifestations are limited to those changes that take account of differences between EPUB Reading Systems (and which themselves may require changes in the ISBN).
The Unique Identifier is less granular than the ISBN. However, significant revision, abridgement, etc. of the content requires a new Unique Identifier.
The Package Identifier allows any instance of an EPUB Publication to be compared against another to determine if they are identical, different versions of the same Manifestation, or unrelated.
Refer to Package Identifier for more information.
A list of all Publication Resources that constitute the EPUB Publication.
Refer to manifest for more information.
An ordered list of Publication Resources, typically EPUB Content Documents, representing the default reading order of the Publication.
Refer to spine for more information.
An XML document that associates the XHTML Content Document with pre-recorded audio narration in order to provide a synchronized playback experience, as defined in [MediaOverlays30].
The rendering of the textual content of an EPUB Publication as artificial human speech using a synthesized voice.
A CSS Style Sheet conforming to the CSS profile defined in EPUB Style Sheets [ContentDocs30].
The region of an EPUB Reading System in which the content of an EPUB Publication is rendered visually to a User.
A Viewport capable of displaying CSS-styled content.
The ZIP-based packaging and distribution format for EPUB Publications defined in [OCF3].
The person(s) or organization responsible for the creation of an EPUB Publication, which is not necessarily the creator of the content and resources it contains.
An individual that consumes an EPUB Publication using an EPUB Reading System.
A system that processes EPUB Publications for presentation to a User in a manner conformant with this specification and its sibling specifications.
The keywords "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119].
All sections of this specification are normative except where identified by the informative status label "This section is informative". The application of informative status to sections and appendices applies to all child content and subsections they may contain.
All examples in this specification are informative.
This section defines conformance requirements for EPUB Publications and EPUB Reading Systems at the Publication level. Conformance requirements particular to specific Publication Resources and processing contexts are located in the specifications referenced herein.
An EPUB Publication must meet all of the following criteria:
› All Publication Resources it contains must be represented in the Package Document (as defined in manifest), adhere to the constraints for Core Media Types and Fallback and be located as per Publication Resource Locations.
› It must contain exactly one Package Document, which must conform to the content requirements defined in Package Document — Content Conformance.
› It must contain at least one EPUB Content Document conformant to the content requirements defined in EPUB Content Documents [ContentDocs30].
› It may contain zero or more EPUB Style Sheets conformant to the content requirements defined in EPUB Style Sheets — Content Conformance [ContentDocs30].
› It may contain zero or more PLS Documents conformant to the content requirements defined in PLS Documents — Content Conformance [ContentDocs30].
› It may contain zero or more Media Overlay Documents conformant to the content requirements defined in [MediaOverlays30].
› It may contain zero or more Publication Resources in addition to those listed above, each of which must adhere to the requirements in All Publication Resources.
› It must be packaged in a EPUB Container as defined in [OCF3].
An EPUB Reading System must meet all of the following criteria:
› It must process the EPUB Container as defined in [OCF3].
› It must process the Package Document as defined in Package Document — Reading System Conformance, and honor all presentation logic expressed through the Package Document (e.g., the reading order, fallback chains and bindings).
› It must not fail catastrophically if it encounters two distinct EPUB Publications with the same Unique Identifier.
› Unless specified as conditional behavior in this section, it must support all Core Media Type Resources.
› It may support an arbitrary set of Foreign Resource types, and must process fallbacks for unsupported Foreign Resources as defined in Restrictions and Fallbacks if not.
› It must process XHTML Content Documents as defined in XHTML Content Documents — Reading System Conformance [ContentDocs30].
› It must process SVG Content Documents as defined in SVG Content Documents — Reading System Conformance [ContentDocs30].
› If it has a CSS Viewport, it must support visual rendering of XHTML Content Documents as defined in EPUB Style Sheets — Reading System Conformance [ContentDocs30].
› If it has the capability to render raster images, it must support the raster image Core Media Types.
› If it has the capability to render vector images, it must support the vector image Core Media Types.
› If it has the capability to render pre-recorded audio, it must support the MP3 audio Core Media Type, should support the MP4 audio Core Media Type and should support Media Overlays [MediaOverlays30].
› If it supports Text-to-Speech (TTS) rendering, it should support PLS Documents [ContentDocs30], the CSS3 Speech features of the EPUB CSS Profile [ContentDocs30] and SSML attributes [ContentDocs30] in XHTML Content Documents.
› It must support the EPUB Canonical Fragment Identifiers scheme [EPUBCFI] for linking, and may support additional linking schemes as defined in the EPUB Linking Scheme Registry.
It is recommended that Reading Systems support at least one of the [H.264] and [VP8] video codecs, but this is not a conformance requirement; a Reading System may support no video codecs at all. Content creators and Reading System developers should take into consideration factors such as breadth of adoption, video playback quality, and technology usage royalty requirements when making a choice to include or implement video in either (or potentially, both) formats.
› It should process EPUB version 2 Publications as defined in [OPF2], [OPS2] and [OCF2].
› It must attempt to process any Publication whose Package Document version
attribute designates a version lower than 3.0
or which omits the version
attribute.
› It should attempt to process any Publication whose Package Document version
attribute designates a version higher than 3.0
or which omits the version
attribute.
› It must be a conformant non-validating processor [XML].
› It must be a conformant processor as defined in [XMLNS].
› It must support xml-stylesheet
processing instructions [ASSOCSS], and may support additional processing instructions.
› It must be a conformant application as defined by [XML Base].
A conforming Reading System is not necessarily a single dedicated program or device, but may exist as a distributed system.
This section is informative
The Package Document carries bibliographic and structural metadata about an EPUB Publication, and is thus the primary source of information about how to process and display it.
The Package Document is an XML document consisting of a set of container elements, each dedicated to housing information about a particular aspect of the Publication. These containers effectively centralize metadata for the Publication, detail the individual resources that compose it and provide reading order and other information for rendering the Publication to a User.
The following list summarizes the information a Package Document contains:
Publication metadata — mechanisms for including and/or referencing metadata applicable to the entire Publication and particular resources within it.
A Publication manifest — identifies (via IRI) and describes (via MIME media type) the set of resources that collectively compose the Publication.
A spine — an ordered sequence of ID references to top-level resources in the manifest from which all other resources in the set can be reached or utilized. The spine defines the default reading order of the Publication.
Fallback chains — an optional means for Publications to define an ordered list of top-level resources that can be considered content equivalents that a Reading System can choose between for rendering.
Bindings — an optional means of associating script-based implementations with custom media types.
A Package Document must meet all of the following criteria:
› It must meet the conformance constraints for XML documents defined in XML Conformance.
› It must be valid to the Package Document schema, as defined in Appendix A, Package Document Schema, and conform to all content conformance constraints expressed in Package Document Definition.
› The Package Document filename should use the file extension .opf
.
Package Documents have the MIME media type application/oebps-package+xml
[RFC4839].
An EPUB Reading System must meet all of the following criteria:
› It must process the Package Document in conformance with all Reading System conformance constraints expressed in Package Document Definition.
All elements [XML] defined in this section are in the http://www.idpf.org/2007/opf
namespace [XMLNS] unless otherwise specified.
package
ElementThe package
element is the root container of the Package Document and encapsulates Publication metadata and resource information.
package
The package
element is the root element of the Package Document.
version
[required]
Specifies the EPUB specification version to which the Publication conforms.
The attribute must have the value 3.0
to indicate compliance with this version of the specification.
unique-identifier
[required]
An IDREF [XML] that identifies the dc:identifier
element that provides the package's preferred, or primary, identifier.
Refer to Publication Identifiers for more information.
prefix
[optional]
Declaration mechanism for prefixes not reserved by this specification.
Refer to The prefix
Attribute for more information.
xml:lang
[optional]
Specifies the language used in the contents and attribute values of the carrying element and its descendants, as defined in section 2.12 Language Identification of [XML].
dir
[optional]
Specifies the base text direction of the content and attribute values of the carrying element and its descendants.
Inherent directionality specified using [Unicode] takes precedence over this attribute.
Allowed values are ltr
(left-to-right) or rtl
(right-to-left).
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
In this order: metadata
[required]
, manifest
[required]
, spine
[required]
, guide
[optional/deprecated]
, bindings
[optional]
metadata
ElementThe metadata
element encapsulates Publication meta information.
metadata
Required first child of package
.
The metadata
element has no attributes defined in this specification.
In any order: dc:identifier
[1 or more]
, dc:title
[1 or more]
, dc:language
[1 or more]
, DCMES Optional Elements
[0 or more]
, meta
[1 or more]
, OPF2 meta
[0 or more]
, link
[0 or more]
The minimal required metadata that Publications must include consists of three elements from the Dublin Core Metadata Element Set [DCMES] — title
, identifier
and language
— together with the modified
property from DCMI Metadata Terms [DCTERMS]. Refer to the example at the end of this section for an instance of a complete minimal metadata set.
Additional optional metadata is expressed using the DCMES optional elements and the meta
element.
Examples
The following example represents the minimal set of metadata that all Publications must contain.
<package … unique-identifier="pub-id"> … <metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> <dc:identifier id="pub-id">urn:uuid:A1B0D67E-2E81-4DF5-9E67-A64CBE366809</dc:identifier> <dc:title>Norwegian Wood</dc:title> <dc:language>en</dc:language> <meta property="dcterms:modified">2011-01-01T12:00:00Z</meta> </metadata> … </package>
identifier
ElementThe [DCMES] identifier
element contains a single identifier associated with the EPUB Publication, such as a UUID, DOI, ISBN or ISSN.
dc:identifier
http://purl.org/dc/elements/1.1/
Required child of metadata
. Repeatable.
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
The id
attribute is required on the identifier
element containing the unique identifier. See below.
Text
Every metadata
section must include at least one identifier
element containing an unambiguous identifier for the Publication. Multiple identifier
elements are permitted, but only one can be marked as the Unique Identifier via the package
element unique-identifier
attribute.
The following example shows the unique identifier
element for a Publication.
<package … unique-identifier="pub-id"> <metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> <dc:identifier id="pub-id">urn:uuid:A1B0D67E-2E81-4DF5-9E67-A64CBE366809</dc:identifier> … </metadata> </package>
This specification makes a distinction between the Unique Identifier for an EPUB Publication and the identifier that uniquely identifies a specific version of it (i.e., to be able to differentiate EPUB Publications containing different versions of the same Manifestation). Two copies of an EPUB that are bit-for-bit identical are the same version and must retain the same last modified date. If they are not bit-for-bit identical, they represent different versions, and must have different last modified dates.
To identify a specific version of a packaged Publication, a Package Identifier can be constructed by combining the Unique Identifier with the last modified date of the Publication. Changes between versions may include minor typographic or markup corrections, without affecting the Unique Identifier. Significant revisions to the content that result in a new edition require a change of the Unique Identifier. For more information on the semantics and requirements of the Package Identifier, refer to Package Identifier.
This specification imposes no additional restrictions or requirements on identifiers except that they must be at least one character in length. It is strongly recommended that all identifiers be fully qualified URIs, however.
Reading Systems must trim all leading and trailing whitespace from the element value, as defined by the XML specification [XML], before processing the value.
To determine whether an identifier
conforms to an established system or has been granted by an issuing authority, Reading Systems should parse the value of the property. For additional precision (e.g., if the scheme cannot be determined from the value or could lead to an ambiguous result), Authors may attach an identifier-type
property to assist in Reading System identification. When included, the identifier-type
property should take precedence over value parsing the identifier
.
The following example shows how an identifier
can be additionally marked as a DOI using the identifier-type
property.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> <dc:identifier id="pub-id">urn:doi:10.1016/j.iheduc.2008.03.001</dc:identifier> <meta refines="#pub-id" property="identifier-type" scheme="onix:codelist5">06</meta> … </metadata>
This specification does not require or endorse the use of any specific scheme for identifiers, and imposes no restrictions or requirements on identifier-type
identifiers beyond those specified in the property definition.
When an EPUB Publication is derived from another publication, the identifier for that source publication may be included in the Publication metadata, and must be represented using the DCMES source
element.
title
ElementThe [DCMES] title
element represents an instance of a name given to the EPUB Publication.
dc:title
http://purl.org/dc/elements/1.1/
Required child of metadata
. Repeatable.
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
xml:lang
[optional]
Specifies the language used in the contents and attribute values of the carrying element and its descendants, as defined in section 2.12 Language Identification of [XML].
dir
[optional]
Specifies the base text direction of the content and attribute values of the carrying element and its descendants.
Inherent directionality specified using [Unicode] takes precedence over this attribute.
Allowed values are ltr
(left-to-right) or rtl
(right-to-left).
Text
Every metadata
section must include at least one title
element containing the title for the Publication. Multiple title
elements are permitted, but the title-type
property should be attached to indicate the type of title (e.g., the main title of a work, a subtitle, etc.).
The following example shows how to indicate different title types.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:title id="t1">A Dictionary of Modern English Usage</dc:title> <meta refines="#t1" property="title-type">main</meta> <dc:title id="t2">First Edition</dc:title> <meta refines="#t2" property="title-type">edition</meta> <dc:title id="t3">Fowler's</dc:title> <meta refines="#t3" property="title-type">short</meta> … </metadata>
When adding the title-type
property, Authors should designate only one title
element as containing the main title for the Publication. If no means of determining title types is provided, or understood, Reading Systems must treat the first title
element in document order as the main title. This specification does not define how additional title
elements should be processed in such situations.
The optional display-seq
property may also be attached to each title
to indicate their primacy for display and other rendering purposes.
The following example shows how to indicate display sequence.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:title id="t1">The Red and the Black</dc:title> <meta refines="#t1" property="title-type">main</meta> <meta refines="#t1" property="display-seq">1</meta> <dc:title id="t2">A Chronicle of the Nineteenth Century</dc:title> <meta refines="#t2" property="title-type">subtitle</meta> <meta refines="#t2" property="display-seq">2</meta> <dc:title id="t3">A Chronicle of 1830</dc:title> <meta refines="#t3" property="title-type">subtitle</meta> <meta refines="#t3" property="display-seq">3</meta> … </metadata>
This specification imposes no additional restrictions or requirements on titles except that they must be at least one character in length.
Reading Systems must trim all leading and trailing whitespace from the element value, as defined by the XML specification [XML], before processing the value.
Examples
The following example shows how the title "THE LORD OF THE RINGS, Part One: The Fellowship of the Ring" could be classified.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> <dc:title id="t1">The Fellowship of the Ring</dc:title> <meta refines="#t1" property="title-type">main</meta> <dc:title id="t2">The Lord of the Rings</dc:title> <meta refines="#t2" property="title-type">collection</meta> <meta refines="#t2" property="group-position">1</meta> <dc:title id="t3">THE LORD OF THE RINGS, Part One: The Fellowship of the Ring</dc:title> <meta refines="#t3" property="title-type">extended</meta> … </metadata>
The following example shows how the complex title "The Great Cookbooks of the World: Mon premier guide de cuisson, un Mémoire. The New French Cuisine Masters, Volume Two. Special Anniversary Edition" could be classified.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> <dc:title id="t1" xml:lang="fr">Mon premier guide de cuisson, un Mémoire</dc:title> <meta refines="#t1" property="title-type">main</meta> <meta refines="#t1" property="display-seq">2</meta> <dc:title id="t2">The Great Cookbooks of the World</dc:title> <meta refines="#t2" property="title-type">collection</meta> <meta refines="#t2" property="display-seq">1</meta> <dc:title id="t3">The New French Cuisine Masters</dc:title> <meta refines="#t3" property="title-type">collection</meta> <meta refines="#t3" property="group-position">2</meta> <meta refines="#t3" property="display-seq">3</meta> <dc:title id="t4">Special Anniversary Edition</dc:title> <meta refines="#t4" property="title-type">edition</meta> <meta refines="#t4" property="display-seq">4</meta> <dc:title id="t5">The Great Cookbooks of the World: Mon premier guide de cuisson, un Mémoire. The New French Cuisine Masters, Volume Two. Special Anniversary Edition</dc:title> <meta refines="#t5" property="title-type">extended</meta> … </metadata>
language
ElementThe [DCMES] language
element specifies the language of the Publication content.
Every metadata
section must include at least one language
element with a value conforming to [RFC5646].
The following example shows a Publication is in U.S. English.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:language>en-US</dc:language> … </metadata>
Additional language
elements may be included for multilingual Publications, but each element's value must conform to [RFC5646].
Reading Systems must trim all leading and trailing whitespace from the element value, as defined by the XML specification [XML], before processing the value.
All elements from the [DCMES] element set — except for identifier
, language
and title
, as defined above — are designated as optional. These elements all conform to the following generalized definition:
contributor | coverage | creator | date | description | format | publisher | relation | rights | source | subject | type
http://purl.org/dc/elements/1.1/
Optional child of metadata
. Repeatable.
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
xml:lang
* [optional]
Specifies the language used in the contents and attribute values of the carrying element and its descendants, as defined in section 2.12 Language Identification of [XML].
dir
* [optional]
Specifies the base text direction of the content and attribute values of the carrying element and its descendants.
Inherent directionality specified using [Unicode] takes precedence over this attribute.
Allowed values are ltr
(left-to-right) or rtl
(right-to-left).
Text
* The xml:lang
and dir
attributes are permitted only on the following elements: contributor
, coverage
, creator
, description
, publisher
, relation
, rights
and subject
.
The value of all optional [DCMES] elements must be at least one character in length.
Reading Systems must trim all leading and trailing whitespace from the element value, as defined by the XML specification [XML], before processing the value.
Except as detailed below, this specification does not modify the [DCMES] definitions for these elements.
The DCMES contributor
Element
The contributor
element is used to represent the name of a person, organization, etc. that played a secondary role in the creation of the content of a Publication.
The use of the contributor
element is identical to the use of the creator
element in all other respects, as detailed in the next section.
The DCMES creator
Element
The creator
element represents the name of a person, organization, etc. responsible for the creation of the content of a Publication. The role
property can be attached to the element to indicate the function the creator played in the creation of the content.
The following example shows how to represent a creator
as an author using a MARC relators term.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:creator id="creator">Haruki Murakami</dc:creator> <meta refines="#creator" property="role" scheme="marc:relators" id="role">aut</meta> … </metadata>
The creator
element should contain the name of the creator as a Reading System will present it to a User. The file-as
property may be attached to include a normalized form of the name, and the alternate-script
property can be used to represent a creator's name in another language or script.
The following example shows the different ways a creator's name can be included to facilitate processing and rendering.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:creator id="creator">Haruki Murakami</dc:creator> <meta refines="#creator" property="role" scheme="marc:relators" id="role">aut</meta> <meta refines="#creator" property="alternate-script" xml:lang="ja">村上 春樹</meta> <meta refines="#creator" property="file-as">Murakami, Haruki</meta> … </metadata>
If a Publication has more than one creator, each should be included in a separate creator
element. The order in which to render the creator
names should be specified using the display-seq
property.
The following example shows how to indicate the display order for creator
elements.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:creator id="creator01">Lewis Carroll</dc:creator> <meta refines="#creator01" property="role" scheme="marc:relators">aut</meta> <meta refines="#creator01" property="display-seq">1</meta> <dc:creator id="creator02">John Tenniel</dc:creator> <meta refines="#creator02" property="role" scheme="marc:relators">ill</meta> <meta refines="#creator02" property="display-seq">2</meta> … </metadata>
If no means of establishing the primacy of creators is included, Reading Systems must use the order of creator
elements.
Secondary contributors should be represented using DCMES contributor
elements.
The DCMES date
Element
The date
element must only be used to define the publication date of the EPUB Publication. The publication date is not the same as the last modified date (the last time the content was changed), which must be included using the [DCTERMS] modified
property.
For compliance with EPUB 2 Reading Systems, the date string should conform to Date and Time Formats.
The following example shows a publication date.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:date>2000-01-01T00:00:00Z</dc:date> … </metadata>
Additional dates should be expressed using the specialized date properties available in the [DCTERMS] vocabulary, or similar.
The publication date may be common to all instances of a Publication or may change from instance to instance (if the Publication gets generated on demand, for example).
Only one date
element is allowed.
The DCMES source
Element
The source
element must only be used to specify the identifier of the source publication from which this EPUB Publication is derived.
The following example shows the ISBN identifier for a Publication together with the source ISBN identifier for the print work it was derived from.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:identifier id="isbn-id">urn:isbn:9780101010101</dc:identifier> <meta refines="#isbn-id" property="identifier-type" scheme="onix:codelist5">15</meta> <dc:source id="src-id">urn:isbn:9780375704024</dc:source> <meta refines="#src-id" property="identifier-type" scheme="onix:codelist5">15</meta> … </metadata>
The source
element allows the print source of the pagination of a Publication to be determined.
Only one source
element is allowed.
The DCMES type
Element
The type
element is used to indicate that the given Publication is of a specialized type (e.g., annotations packaged in EPUB format or a dictionary).
This specification does not define values for this element, however. The development of specialized Publication types, and the assignment of formal identifiers to represent them, will occur independently of this specification.
Only one type
element is allowed.
meta
ElementThe meta
element provides a generic means of including package metadata, allowing the expression of primary metadata about the package or content and refinement of that metadata.
meta
As child of the metadata
element. Repeatable.
property
[required]
A property.
Refer to Vocabulary Association Mechanisms for more information.
refines
[context dependent]
Identifies the expression or resource augmented by this element. The value of the attribute must be a relative IRI [RFC3987] pointing to the resource or element it describes.
The refines
attribute is optional depending on the type of metadata being expressed. When omitted, the meta
element defines a primary expression.
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
scheme
[optional]
A property data type value indicating the source the value of the element is drawn from.
Text
Each meta
element defines a metadata expression, where the property
attribute defines the statement being made in the expression and the text content of the element represents the assertion.
This specification defines two types of metadata expressions that can be defined using the meta
element:
A primary expression is one in which the expression defined in the meta
element establishes some aspect of the EPUB Publication. A meta
element that omits a refines
attribute defines a primary expression.
A subexpression is one in which the expression defined in the meta
element enhances the meaning of the expression or resource referenced in its refines
attribute. A subexpression may refine a media clip, for example, by expressing its duration, or refine a creator or contributor expression by defining the person's role.
Subexpressions are not limited to refining only primary expressions and resources; they may be used to refine the meaning of other subexpressions, thereby creating chains of information.
All of the [DCMES] elements represent primary expressions, and permit refinement by meta
element subexpressions.
This specification reserves a set of vocabularies for use in the property
attribute, but terms from any vocabulary may be used so long as a prefix is declared for the vocabulary.
The scheme
attribute can be used to identify the system or scheme that a meta
element's value is drawn from. The value of the scheme attribute is a property data type that resolves to the resource that defines the scheme.
The following example shows how a subexpression can be attached to an creator
to indicate it represents an author. The scheme
indicates the value is drawn from the MARC relators terms.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:creator id="creator">Haruki Murakami</dc:creator> <meta refines="#creator" property="role" scheme="marc:relators" id="role">aut</meta> … </metadata>
If a Reading System does not recognize the scheme
attribute value, it should treat the value of the element as a string.
Reading Systems should ignore all meta
elements whose property
attributes define expressions they do not recognize. A Reading System must not fail when encountering unknown expressions.
In order to ensure that a Package Identifier can be constructed, the metadata
element must contain exactly one meta
element defining a [DCTERMS] modified
property for the Publication. Additional modified
properties may be included, but they must have a different subject (i.e., they must include a refines
attribute pointing to an element or resource).
Every meta
element must express a value that is at least one character in length after whitespace normalization.
Unless an individual property explicitly defines a different whitespace normalization algorithm, Reading Systems must trim all leading and trailing whitespace from the meta
element values, as defined by the XML specification [XML], before further processing them.
Examples
The following example represents a more complete set of metadata that typical Publications will contain.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> … <dc:identifier id="pub-id">urn:uuid:A1B0D67E-2E81-4DF5-9E67-A64CBE366809</dc:identifier> <meta refines="#pub-id" property="identifier-type" scheme="xsd:string">uuid</meta> <dc:identifier id="isbn-id">urn:isbn:9780101010101</dc:identifier> <meta refines="#isbn-id" property="identifier-type" scheme="onix:codelist5">15</meta> <dc:source id="src-id">urn:isbn:9780375704024</dc:source> <meta refines="#src-id" property="identifier-type" scheme="onix:codelist5">15</meta> <dc:title id="title">Norwegian Wood</dc:title> <meta refines="#title" property="title-type">main</meta> <dc:language>en</dc:language> <dc:creator id="creator">Haruki Murakami</dc:creator> <meta refines="#creator" property="role" scheme="marc:relators" id="role">aut</meta> <meta refines="#creator" property="alternate-script" xml:lang="ja">村上 春樹</meta> <meta refines="#creator" property="file-as">Murakami, Haruki</meta> <meta property="dcterms:modified">2011-01-01T12:00:00Z</meta> </metadata>
The following example shows an identifier that has been issued by a metadata authority.
<package version="3.0" unique-identifier="pub-id" xmlns="http://www.idpf.org/2007/opf"> <metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> <dc:identifier id="pub-id">urn:uuid:1234-5678</dc:identifier> <dc:identifier id="isbn-id">urn:isbn:9780101010101</dc:identifier> <meta refines="#isbn-id" property="meta-auth" id="meta-authority-01">Metadata Authority Inc.</meta> <link refines="#meta-authority-01" rel="xml-signature" href="../META-INF/Signatures.xml#MAI-Signature"/> … </metadata> </package> <!-- in Signatures.xml --> <signatures> <Signature Id="MAI-Signature" xmlns="http://www.w3.org/2000/09/xmldsig#"> … </Signature> </signatures>
meta
Element (OPF2) [OBSOLETE]The meta
element defined in [OPF2] has been obsoleted and replaced by the new meta
element, but may be included as an optional repeatable child of the metadata
element for forwards compatibility purposes.
EPUB 3 Reading Systems must ignore this element.
link
ElementThe link
element is used to associate resources with a Publication, such as metadata records.
link
As a child of metadata
. Repeatable.
href
[required]
An absolute or relative IRI reference [RFC3987] to a resource.
rel
[required]
A space-separated list of property values.
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
refines
[optional]
Identifies the expression or resource augmented by this element. The value of the attribute must be a relative IRI [RFC3987] pointing to the resource or element it describes.
When the refines
attribute is omitted, the expression applies to the EPUB Publication as a whole.
media-type
[optional]
A media type [RFC2046] that specifies the type and format of the resource referenced by this link
.
Empty
The metadata
element may contain zero or more link
elements.
The href
attribute of the link
element identifies the location of the resource — inclusion of which is optional in the container file — and the rel
attribute defines the nature of the resource (i.e., its relation to the Publication or property specified in the refines
attribute). Reading Systems are not required to dereference these resources. Refer to Metadata link
Properties for the list of resource types that are recognized by this specification.
Resources identified by the link
element href
attribute must not be represented as item
s in the manifest.
When the link
element references a metadata record, precedence must be given to metadata defined inline in the Package Document metadata
element in the case of conflicts.
The optional refines
attribute can be attached when the referenced resource applies to another metadata item (e.g., to tie an XML Signature [XML DSIG Core] to a metadata authority). The resource applies to the Publication as a whole when the attribute is not present.
If a Reading System does not recognize the relationship of the resource as defined in the rel
attribute, it should ignore the link
element.
Examples
The following example shows the link
element used to associate three metadata resources with the Publication: an ONIX record, an XMP record, and a link to an informational web page. Note that as foaf
is not a predefined prefix, the metadata extensibility mechanism is employed to associate the vocabulary.
<package … prefix="foaf: http://xmlns.com/foaf/spec/"> <metadata> … <link rel="onix-record" href="http://example.org/onix/12389347"/> <link rel="xmp-record" href="http://example.org/xmp/12389347"/> <link rel="foaf:homepage" href="http://example.org/book-info/12389347" /> … </metadata> … </package>
manifest
ElementThe manifest
element provides an exhaustive list of the Publication Resources that constitute the EPUB Publication, each represented by an item
element.
This specification supports internationalized resource naming, so elements and attributes that reference Publication Resources accept IRIs as their value. For compatibility with older Reading Systems that only accept URIs, resource names should be restricted to the ASCII character set.
item
ElementThe item
element represents a Publication Resource.
item
As a child of manifest
. Repeatable.
id
[required]
The ID [XML] of this element, which must be unique within the document scope.
href
[required]
An IRI [RFC3987] specifying the location of the Publication Resource described by this item
.
media-type
[required]
A media type [RFC2046] that specifies the type and format of the Publication Resource described by this item
.
fallback
[conditionally required]
An IDREF [XML] that identifies the fallback for a non-Core Media Type.
Refer to Manifest Fallbacks for more information.
properties
[optional]
A space-separated list of property values.
Refer to Manifest item
Properties for a set of properties defined by this specification.
media-overlay
[optional]
An IDREF [XML] that identifies the Media Overlay Document for the resource described by this item
.
Refer to Packaging [MediaOverlays30] for more information.
Empty
Each item
element in the manifest
identifies a Publication Resource by the IRI provided in its href
attribute. The IRI may be absolute or relative. In the case of relative IRIs, Reading Systems must use the IRI of the Package Document as the base when resolving these to absolute IRIs. The resulting absolute IRI must be unique within the manifest
scope.
All Publication Resources must be referenced from the manifest
, regardless of whether they are included in the EPUB Container or made available remotely. Refer to Publication Resource Locations for media type-specific requirements regarding resource locations.
The Publication Resource identified by an item
element must conform to the applicable specification(s) as inferred from the MIME media type provided in the media-type
attribute. Core Media Type Resources must use the media type designated in EPUB Core Media Types.
All Foreign Resources must provide a fallback as defined in Restrictions and Fallbacks.
All Publication Resources must declare any applicable descriptive metadata properties as defined in Manifest item
Properties via the item
element properties
attribute. Exactly one item
must be declared as the EPUB Navigation Document using the nav
property.
Reading Systems must ignore all descriptive metadata properties that they do not recognize.
The manifest is not self-referencing: it must not include an item
element that refers to the Package Document itself.
The order of item
elements in the manifest is not significant. The presentation sequence of content documents is provided in the spine
.
Examples
The following example shows a manifest
that only contains Core Media Type Resources.
<manifest> <item id="nav" href="nav.xhtml" properties="nav" media-type="application/xhtml+xml"/> <item id="intro" href="intro.xhtml" media-type="application/xhtml+xml"/> <item id="c1" href="chap1.xhtml" media-type="application/xhtml+xml"/> <item id="c1-answerkey" href="chap1-answerkey.xhtml" media-type="application/xhtml+xml"/> <item id="c2" href="chap2.xhtml" media-type="application/xhtml+xml"/> <item id="c2-answerkey" href="chap2-answerkey.xhtml" media-type="application/xhtml+xml"/> <item id="c3" href="chap3.xhtml" media-type="application/xhtml+xml"/> <item id="c3-answerkey" href="chap3-answerkey.xhtml" media-type="application/xhtml+xml"/> <item id="notes" href="notes.xhtml" media-type="application/xhtml+xml"/> <item id="cover" href="./images/cover.svg" properties="cover-image" media-type="image/svg+xml"/> <item id="f1" href="./images/fig1.jpg" media-type="image/jpeg"/> <item id="f2" href="./images/fig2.jpg" media-type="image/jpeg"/> <item id="css" href="./style/book.css" media-type="text/css"/> <item id="pls" href="./speech/dict.pls" media-type="application/pls+xml"/> </manifest>
The following example shows a manifest
that references two Foreign Resources, and therefore uses the fallback chain mechanism to supply content alternatives. The fallback chain terminates with a Core Media Type.
<manifest> <item id="item1" href="chap1_docbook.xml" media-type="application/docbook+xml" fallback="fall1"/> <item id="fall1" href="chap1.xml" media-type="application/z3986-auth+xml" fallback="fall2" /> <item id="fall2" href="chap1.xhtml" media-type="application/xhtml+xml"/> … </manifest>
Refer also to the Manifest item properties examples for use of the properties
attribute.
spine
ElementThe spine
element defines the default reading order of the EPUB Publication content by defining an ordered list of manifest item
references.
spine
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
toc
[optional]
An IDREF [XML] that identifies the manifest item
that represents the superseded NCX
.
Refer to NCX Superseded for more information.
page-progression-direction
[optional]
The global direction in which the Publication content flows.
Allowed values are ltr
(left-to-right), rtl
(right-to-left) and default
.
When the default
value is specified, the Author is expressing no preference and the Reading System may chose the rendering direction. This value must be assumed when the attribute is not specified.
Multiple itemref
elements [required]
The spine
represents an ordered subset of the Publication Resources listed in the manifest
, with content items not being referenced being ancillary to those that do.
Reading Systems must provide a means of rendering a Publication in the order defined by the spine
, which includes: 1) recognizing the first primary (linear='yes'
) item
in the spine
as the beginning of the main reading order of the Publication; and, 2) rendering successive primary items in the order given in the spine
.
Although the page-progression-direction
attribute sets the global flow direction for a Publication, individual Content Documents and parts of Content Documents may override this setting (e.g., via the direction
and writing-mode
CSS properties). Reading Systems may also provide mechanisms to override the default direction (e.g., buttons or settings that allow the application of alternate style sheets).
NCX Superseded
The NCX feature defined in [OPF2] is superseded by the EPUB Navigation Document [ContentDocs30]. EPUB 3 Publications may include an NCX (as defined in OPF 2.0.1) for EPUB 2 Reading System forwards compatibility purposes, but EPUB 3 Reading Systems must ignore the NCX in favor of the EPUB Navigation Document.
itemref
ElementThe child itemref
elements of the spine
represent a sequential list of Publication Resources (typically EPUB Content Documents). The order of the itemref
elements defines the default reading order of the Publication.
itemref
As a child of spine
. Repeatable.
idref
[required]
linear
[optional]
Specifies whether the referenced content is primary.
The value of the attribute must be yes
or no
. The default value is yes
.
id
[optional]
The ID [XML] of this element, which must be unique within the document scope.
properties
[optional]
A space-separated list of property values.
Refer to Spine itemref
Properties for a set of properties defined by this specification.
Empty
Each itemref
element must reference an item
in the manifest via its idref
attribute.
Each referenced manifest item
must be either a) an EPUB Content Document or b) another type of Publication Resource which, regardless of whether it is a Core Media Type Resource or a Foreign Resource, must include an EPUB Content Document in its fallback chain.
Although the EPUB Navigation Document is required in EPUB Publications, it is optional to include it in the spine
.
The itemref
element linear
attribute indicates whether referenced item is considered primary (yes
) or auxiliary (no
) in the spine
. This attribute may be used to enable Reading Systems to distinguish presentation of body content from supplementary content which might be, for example, presented in a popup window or omitted from an aural rendition.
Any applicable descriptive metadata properties, such as those defined in the Spine itemref
Properties, should be declared via the properties
attribute.
Reading Systems must ignore all metadata properties expressed in the properties
attribute that they do not recognize.
Examples
The following example shows a spine
element corresponding to the manifest example above.
<spine page-progression-direction="ltr"> <itemref idref="intro"/> <itemref idref="c1"/> <itemref idref="c1-answerkey" linear="no"/> <itemref idref="c2"/> <itemref idref="c2-answerkey" linear="no"/> <itemref idref="c3"/> <itemref idref="c3-answerkey" linear="no"/> <itemref idref="notes" linear="no"/> </spine>
guide
Element [DEPRECATED]The guide
element [OPF2] is deprecated in favor of the landmarks
feature in the EPUB Navigation Document. Refer to The landmarks nav Element [ContentDocs30] for more information.
Authors may include the guide
element in the Package Document for EPUB 2 Reading System forwards compatibility purposes. EPUB 3 Reading Systems must ignore the guide
element when provided in EPUB 3 Publications whose EPUB Navigation Document includes the landmarks
feature.
bindings
ElementThe bindings
element defines a set of custom handlers for media types not supported by this specification.
The package
element may contain at most one bindings
element.
The bindings
element provides a means for Authors to include more sophisticated fallbacks than would otherwise be possible with the [HTML5] object
element's intrinsic fallback mechanisms. When present, Reading Systems that support scripting must utilize the bindings
element to handle object
elements that reference unsupported media types.
Each of the bindings
element's child mediaType
elements defines a unique handler for one of the foreign media types referenced in the Publication's XHTML Content Documents.
When an unsupported media type is encountered during processing of a document, the Reading System must look up the handler in the bindings
element by checking the media-type
attribute of each mediaType
element for a match (and before attempting any other type of fallback processing). If a match is found, the XHTML Content Document referenced in the element's handler
attribute must be instantiated instead of the referenced resource. If no match is found, the Reading System should continue with normal fallback processing (i.e., check for an intrinsic fallback for the object
).
The Reading System must instantiate the designated handler as if it had been referenced from the object
element's data
attribute with the following parameters:
src
the value of which must be an IRI [RFC3987] to the resource (i.e., the value of the object
element's data
attribute).
type
the value of which must be the resource media type (i.e., the value of the object
element's media-type
attribute).
Any additional param
children of the object
element must be similarly added as parameters using the param
's name
attribute as the new parameter name and its value
attribute as the new value.
For example, the following object
element containing a foreign media type:
<object data="horse.ogg" media-type="audio/ogg"/> <param name="autoplay" value="false"> </object>
would result in the following query string being sent to the handler XHTML Content Document after processing:
src=horse.ogg&type=audio/ogg&autoplay=false
All IRI reserved characters, plus the characters <
, >
, "
, space
, {
, }
, |
, \
, ^
and `
, in the generated query string must be encoded and decoded as per [RFC3987].
object
elements that reference media types handled by the bindings
element are only processed in spine-referenced XHTML Content Documents (i.e., they are ignored in container-constrained scripting contexts).
Example
The following partial example illustrates how bindings can be used to provide a slideshow.
Consider a Publication with the following Package Document:
<package …> … <manifest> <item id="pict1" href="images/Pict1.jpg" media-type="image/jpeg"/> … <item id="content" href="content.xhtml" media-type="application/xhtml+xml"/> <item id="impl" href="impl.xhtml" media-type="application/xhtml+xml" properties="scripted"/> <item id="slideshow" href="slideshow.xml" media-type="application/x-demo-slideshow"/> </manifest> <bindings> <mediaType handler="impl" media-type="application/x-demo-slideshow"/> </bindings> … </package>
and the following content in the file content.xhtml
:
<html …> … <body> … <object data="slideshow.xml" type="application/x-demo-slideshow"> <img src="images/Pict1.jpg"/> <img src="images/Pict2.jpg"/> <img src="images/Pict3.jpg"/> <img src="images/Pict4.jpg"/> </object> … </body> </html>
and the following content in the file slideshow.xml
:
<slides> <slide src="images/Pict1.jpg" dur="3"/> <slide src="images/Pict2.jpg" dur="3"/> <slide src="images/Pict3.jpg" dur="3"/> <slide src="images/Pict4.jpg" dur="3"/> </slides>
Depending on the capabilities of the User's Reading System, they will see one of the following renditions of the slideshow:
If the Reading System supports the native slideshow format, it will render a rotating set of images as specified in slideshow.xml
.
If the Reading System cannot support the slideshow media type but supports scripting, it can check the bindings
element in the Package Document for a scripted fallback. There it will find a reference to the item
element containing the handler document (impl.xhtml
). The Reading System can now load this document to render a JavaScripted equivalent of the slideshow (source not shown).
If the Reading System does not support the slideshow media type and also does not support scripting, it will use the fallback images specified in the object
element to show a static set of all the images.
mediaType
ElementThe mediaType
element associates a Foreign Resource media type with a handler XHTML Content Document.
mediaType
As a child of bindings
. Repeatable.
Empty
Each child mediaType
of a bindings
element must define a unique content type in its media-type
attribute, and the media type specified must not be a Core Media Type.
The required handler
attribute must reference the ID [XML] of an item
in the manifest
of the default implementation for this media type. The referenced item
must be an XHTML Content Document.
All XHTML Content Documents designated as handlers must have the scripted
property set in their manifest item
's properties
attribute.
The Package Document's author is responsible for including a primary identifier that is unique to one and only one particular EPUB Publication. This Unique Identifier, whether chosen or assigned, must be stored in a dc:identifier
element in the Package metadata and be referenced as the Unique Identifier in the package
element unique-identifier
attribute.
Although not static, changes to the Unique Identifier for a Publication should be made as infrequently as possible. New identifiers should not be issued when updating metadata, fixing errata or making other minor changes to the Publication.
The Unique Identifier of an EPUB Publication typically should not change with each minor revision to the package or its contents, as Unique Identifiers are intended to have maximal persistence both for referencing and distribution purposes. Each release of a Publication normally requires that the new version be uniquely identifiable, however, which results in the contradictory need for reliable Unique Identifiers that are changeable.
To redress this problem of identifying minor modifications and releases without changing the Unique Identifier, this specification defines the semantics for a Package Identifier, or means of distinguishing and sequentially ordering Publications with the same Unique Identifier. The Package Identifier is not an actual property in the package metadata
section, but is a value that can be obtained from two required pieces of metadata: the Unique Identifier and the last modification date of the Publication.
When the taken together, the combined value represents a unique identity that can be used to distinguish any particular version of an EPUB Manifestation from another. To ensure that a Package Identifier can be constructed, the Publication must include exactly one [DCTERMS] modified
property containing the last modification date (see meta). The value of this property must be an XML Schema [XSD-DATATYPES] dateTime conformant date of the form:
CCYY-MM-DDThh:mm:ssZ
The modification date must be expressed in Coordinated Universal Time (UTC) and must be terminated by the Z
time zone indicator.
Although not a part of the package metadata, for referencing and other purposes this specification requires that all string representations of the identifier be constructed using the at sign (@
) as the separator (i.e., of the form "id@
date"). Whitespace must not be included when concatenating the strings.
The following example shows how a Unique Identifier and modification date are combined to form the Package Identifier.
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/"> <dc:identifier id="pub-id">urn:uuid:A1B0D67E-2E81-4DF5-9E67-A64CBE366809</dc:identifier> <meta property="dcterms:modified">2011-01-01T12:00:00Z</meta> … </metadata> results in the Package ID: urn:uuid:A1B0D67E-2E81-4DF5-9E67-A64CBE366809@2011-01-01T12:00:00Z
Note that it is possible that the separator character may occur in the Unique Identifier, as these identifiers may be any string value. The Package Identifier consequently must be split on the last instance of the at sign when decomposing it into its component parts.
The Package Identifier does not supersede the Unique Identifier, but represents the means by which different versions of the same Publication can be distinguished and identified in distribution channels and by Reading Systems. The sequential, chronological order inherent in the required format of the timestamp also places Publications in order without requiring knowledge of the exact identifier that came before.
The Package Identifier consequently allows a set of Publications to be inspected to determine if they represent the same version of the same Publication, different versions of a single Publication, or any combination of differing and similar Publications.
This section is informative
The property
, properties
, rel
and scheme
attributes use the property data type to represent terms from metadata vocabularies. Similar to a CURIE [RDFa10], the property data type represents an IRI [RFC3987] in compact form and simplifies the authoring of metadata from standardized vocabularies.
A property value is an expression that consists of a prefix and a reference, where the prefix — whether literal or implied — is a shorthand mapping of an IRI that typically resolves to a term vocabulary. When the prefix is converted to its IRI representation and combined with the reference, the resulting IRI normally resolves to a fragment within that vocabulary that contains human- and/or machine-readable information about the term.
To assist Reading Systems in processing property values, the means of establishing the IRI a prefix maps to is required, and this specification defines three such mechanisms:
a default vocabulary — defines the mapping when a property value does not include a prefix;
a set of reserved prefixes — these mappings are predefined (i.e., all Reading Systems recognize them) and can be used without having to be declared; and
the prefix
attribute — a declarative means of creating new prefix mappings on the root package
element.
The default vocabulary is a vocabulary that does not require a prefix to be declared in order to use its terms in package metadata, and whose terms must always be unprefixed.
To facilitate the inclusion of package metadata, this specification defines the Package Metadata Vocabulary as the default vocabulary for Package Documents.
If a property value does not include a prefix, the IRI [RFC3987] stem http://idpf.org/epub/vocab/package/#
must be used to generate the resulting IRI.
The IRI associated with the Package Metadata Vocabulary must not be assigned a prefix using the prefix
attribute.
This specification exclusively defines the following set of prefixes for use in package metadata.
Prefix | IRI |
---|---|
dcterms | http://purl.org/dc/terms/ |
marc | http://id.loc.gov/vocabulary/ |
media | http://www.idpf.org/epub/vocab/overlays/# |
onix | http://www.editeur.org/ONIX/book/codelists/current.html# |
xsd | http://www.w3.org/2001/XMLSchema# |
The prefixes listed in the previous table must not be redeclared using the prefix
attribute declaration mechanism. Similarly, the IRIs associated with each prefix must not be assigned to another prefix.
prefix
AttributeThe prefix
attribute defines additional prefix mappings not reserved by the specification.
The value of the prefix
attribute is a whitespace-separated list of one or more prefix-to-IRI mappings of the form:
prefixes | = | mapping , { whitespace, { whitespace } , mapping } ; | |
mapping | = | prefix , ":" , space , { space } , ? xsd:anyURI ? ; | |
prefix | = | ? xsd:NCName ? ; | |
space | = | #x20 ; | |
whitespace | = | (#x20 | #x9 | #xD | #xA) ; |
The following example shows prefixes for the Friend of a Friend (foaf
) and DBPedia (dbp
) vocabularies being declared using the prefix
attribute.
<package … prefix="foaf: http://xmlns.com/foaf/spec/ dbp: http://dbpedia.org/ontology/"> … </package>
The prefix
attribute must not be used to redefine the default vocabulary or the predefined prefixes.
The prefix '_' is reserved for future compatibility with RDFa [RDFa10] processing, so must not be defined.
The property data type is a compact means of expressing an IRI [RFC3987] and consists of an optional prefix separated from a reference by a colon.
property | = | [ prefix , ":" ] , reference; | |
prefix | = | ? xsd:NCName ? ; | |
reference | = | ? irelative-ref ? ; | /* as defined in [RFC3987] */ |
The property data type is derived from the CURIE data type defined in [RDFa10], and represents a subset of CURIEs.
The following example shows a property value composed of the prefix dcterms
and the reference modified
.
<meta property="dcterms:modified">2011-01-01T12:00:00Z</meta>
After processing, this property would expand to the following IRI:
http://purl.org/dc/terms/modified
as the dcterms:
prefix is a reserved prefix that maps to the IRI http://purl.org/dc/terms/
.
When a prefix is omitted from the property value, the expressed reference represents a term from the default vocabulary.
The following example shows a property value taken from the default vocabulary.
<meta … property="role">aut</meta>
This property would expand to:
http://idpf.org/epub/vocab/package/#role
when the IRI for the default vocabulary is concatenated with the reference.
An empty string does not represent a valid property value, even though it is valid to the definition above.
A Reading System must use the following rules to create an IRI [RFC3987] from a property:
If the property consists only of a reference, the IRI is obtained by concatenating the IRI stem associated with the default vocabulary to the reference.
If the property consists of a prefix and reference, the IRI is obtained by concatenating the IRI stem associated with the prefix to the reference. If no matching prefix has been defined, the property is invalid.
The resulting IRI must be valid to [RFC3987]. Reading Systems are not required to resolve this IRI, however.
This section is informative
The following sections both define a set of properties for use in package metadata and constitute a referenceable vocabulary. This vocabulary is the default vocabulary reserved by this specification for the use of unprefixed terms in package metadata.
The properties defined in this vocabulary are referenceable using the base IRI http://idpf.org/epub/vocab/package/#
.
meta
PropertiesThe meta
element properties enhance Publication metadata by providing additional level(s) of detail.
These properties must reference the expression or resource they augment in the refines
attribute on their parent meta
element.
The following tables detail the available properties.
› alternate-script | |
Description: | The This property is typically attached to |
Allowed value(s): | xsd:string |
Cardinality: | In the metadata section: Attached to other metadata: |
Extends: | All properties. |
Example: | <meta refines="#creator" property="alternate-script" xml:lang="ja">村上 春樹</meta> |
› display-seq | |
Description: | The When the |
Allowed value(s): | xsd:unsignedInt |
Cardinality: | In the metadata section: Attached to other metadata: |
Extends: | All properties. |
Example: | <meta refines="#t2" property="display-seq">1</meta> |
› file-as | |
Description: | The file-as property provides the normalized form of the associated property for sorting. |
Allowed value(s): | xsd:string |
Cardinality: | In the metadata section: Attached to other metadata: |
Extends: | All properties. |
Example: | <meta refines="#creator" property="file-as">Murakami, Haruki</meta> |
› group-position | |
Description: | The The A Publication can belong to more than one group. |
Allowed value(s): | A single xsd:unsignedInt or series of decimal-separated numbers (e.g., 1 or 2.2.1 ). |
Cardinality: | In the metadata section: Attached to other metadata: |
Extends: | All properties. |
Example: | <meta refines="#t3" property="group-position">2</meta> |
› identifier-type | |
Description: | The When the |
Allowed value(s): | xsd:string |
Extends: | identifier |
Cardinality: | In the metadata section: Attached to other metadata: |
Example: | <meta refines="#src-id" property="identifier-type" scheme="onix:codelist5">15</meta> |
› meta-auth | |
Description: | The meta-auth property provides the name of a party or authority responsible for an instance of package metadata. |
Allowed value(s): | xsd:string |
Cardinality: | In the metadata section: Attached to other metadata: |
Extends: | All properties. |
Example: | <meta refines="isbn-id" property="meta-auth" id="meta-authority-01">Metadata Authority Inc.</meta> |
› role | |
Description: | The When the |
Allowed value(s): | xsd:string |
Cardinality: | In the metadata section: Attached to other metadata: |
Extends: | contributor , creator |
Example: | <meta refines="#creator" property="role" scheme="marc:relators">aut</meta> |
› title-type | |
Description: | The When the |
Allowed value(s): | xsd:string |
Extends: | title |
Cardinality: | In the metadata section: Attached to other metadata: |
Example: | <meta refines="#title" property="title-type">main</meta> |
link
PropertiesThe following tables define properties for use in the metadata
link
element rel
attribute.
› marc21xml-record | |
Description: | The marc21xml-record property indicates the referenced resource is a MARC21 record [MARC21XML]. |
Cardinality: | Zero or one |
Extends: | Only applies to the Publication. Must not be used when the refines attribute is present. |
Example: | <link rel="marc21xml-record" href="pub/meta/nor-wood-marc21.xml"/> |
› mods-record | |
Description: | The mods-record property indicates the referenced resource is a MODS record [MODS]. |
Cardinality: | Zero or one |
Extends: | Only applies to the Publication. Must not be used when the refines attribute is present. |
Example: | <link rel="mods-record" href="pub/meta/nor-wood-mods.xml"/> |
› onix-record | |
Description: | The onix-record property indicates the referenced resource is an ONIX record [ONIX]. |
Cardinality: | Zero or one |
Extends: | Only applies to the Publication. Must not be used when the refines attribute is present. |
Example: | <link rel="onix-record" href="pub/meta/nor-wood-onix.xml"/> |
› xml-signature | |
Description: | The The |
Cardinality: | Zero or more |
Extends: | All properties. |
Example: | <link refines="#meta-authority-01" rel="xml-signature" href="../META-INF/signatures.xml#MAI-Signature"/> |
› xmp-record | |
Description: | The xmp-record property indicates the referenced resource is an XMP record [XMP]. |
Cardinality: | Zero or one |
Extends: | Only applies to the Publication. Must not be used when the refines attribute is present. |
Example: | <link rel="xmp-record" href="pub/meta/nor-wood-xmp.xml"/> |
item
PropertiesThe following tables define properties for use in the manifest
item
element properties
attribute.
The Applies to
field indicates which Publication Resource type(s) the given property may be specified on, the Cardinality
field indicates the number of times the property must appear within the Package Document scope, and the Usage
field indicates usage conditions.
› cover-image | |
Description: | The cover-image property identifies the described Publication Resource as the cover image for the Publication. |
Applies to: | All raster and vector image types |
Cardinality: | Zero or one |
Usage: | Optional. |
› mathml | |
Description: | The mathml property indicates that the described Publication Resource contains one or more instances of MathML markup. |
Applies to: | EPUB Content Documents |
Cardinality: | Zero or more |
Usage: | Must be set if and only if the criterion specified in Description above is met. |
› | |
Description: | The nav property indicates that the described Publication Resource constitutes the EPUB Navigation Document of the Publication. |
Applies to: | The EPUB Navigation Document |
Cardinality: | Exactly one |
Usage: | Required. |
› remote-resources | |
Description: | The (refer to Publication Resource Locations for more information). |
Applies to: | All Publication Resources with the capability of internal referencing (e.g., XHTML Content Documents, SVG Content Documents, EPUB Style Sheets and Media Overlay Documents). |
Cardinality: | Zero or more |
Usage: | Must be set if and only if the criterion specified in Description above is met. |
› scripted | |
Description: | The scripted property indicates that the described Publication Resource is a Scripted Content Document (i.e., contains scripted content and/or elements from HTML5 forms ). |
Applies to: | EPUB Content Documents |
Cardinality: | Zero or more |
Usage: | Must be set if and only if the criterion specified in Description above is met. |
› svg | |
Description: | The svg property indicates that the described Publication Resource contains one or more instances of SVG markup. |
Applies to: | XHTML Content Documents; the value is implied for SVG Content Documents. |
Cardinality: | Zero or more |
Usage: | Must be set if and only if the criterion specified in Description above is met. |
› switch | |
Description: | The switch property indicates that the described Publication Resource contains one or more instances of the epub:switch element. |
Applies to: | XHTML Content Documents. |
Cardinality: | Zero or more |
Usage: | Must be set if and only if the criterion specified in Description above is met. |
The mathml
, remote-resources
, scripted
, svg
and switch
properties must be specified whenever the resource referenced by an item
matches their respective definitions. These properties do not apply recursively to content included into a resource (e.g., via the HTML5 iframe
element). For example, if a non-scripted XHTML Content Document embeds a scripted Content Document, only the embedded document's manifest item
properties
attribute will have the scripted
value.
Examples
The following example shows a manifest
item
element that represents the cover image of a Publication.
<item properties="cover-image" id="ci" href="cover.svg" media-type="image/svg+xml" />
The following example shows a manifest
item
element representing a Scripted Content Document that also contains embedded MathML
.
<item properties="scripted mathml" id="c2" href="c2.xhtml" media-type="application/xhtml+xml" />
itemref
PropertiesThe following tables define properties for use in the itemref
element properties
attribute.
The Cardinality
field indicates the number of times the property must appear within the Package Document scope, and the Usage
field indicates usage conditions.
› page-spread-left | |
Description: | The page-spread-left property indicates that the first page of the associated item 's EPUB Content Document represents the left-hand side of a two-page spread. |
Cardinality: | Zero or more |
Usage: | Optional. This property must not be specified on an itemref that also specifies the page-spread-right property. |
› page-spread-right | |
Description: | The page-spread-right property indicates that the first page of the associated item 's EPUB Content Document represents the right-hand side of a two-page spread. |
Cardinality: | Zero or more |
Usage: | Optional. This property must not be specified on an itemref that also specifies the page-spread-left property. |
Examples
The following example shows how a two-page spread of a map might be indicated in the spine
.
<spine> <itemref idref="title"/> <itemref idref="ps-1-l" properties="page-spread-left"/> <itemref idref="ps-1-r" properties="page-spread-right"/> <itemref idref="toc"/> … </spine>
The following table lists the EPUB 3 Core Media Types. When a Publication Resource conforms to a Core Media Type specification, it is a Core Media Type Resource and can be included in the Publication without the provision of fallbacks (refer to Restrictions and Fallbacks for more information).
The columns in the table represent the following information:
The MIME media type [RFC2046] used to represent the given Publication Resource in the manifest.
The specification to which the given Core Media Type Resource must conform.
The Publication Resource type(s) that the Media Type and Content Type Definition applies to.
Media Type | Content Type Definition | Applies to |
---|---|---|
Image Types | ||
image/gif | [GIF] | GIF Images |
image/jpeg | [JPEG] | JPEG Images |
image/png | [PNG] | PNG Images |
image/svg+xml | SVG Content Documents [ContentDocs30] | SVG documents |
Application Types | ||
application/xhtml+xml | XHTML Content Documents [ContentDocs30] | XHTML Content Documents and the EPUB Navigation Document. |
application/x-dtbncx+xml | [OPF2] | The superseded NCX |
application/vnd.ms-opentype | [OpenType] | OpenType fonts |
application/font-woff | [WOFF] | WOFF fonts |
application/smil+xml | [MediaOverlays30] | EPUB Media Overlay documents |
application/pls+xml | [PLS] | Text-to-Speech (TTS) Pronunciation lexicons |
Audio Types | ||
audio/mpeg | [MP3] | MP3 audio |
audio/mp4 | [AAC LC], [MP4] | AAC LC audio using MP4 container |
Text Types | ||
text/css | EPUB Style Sheets [ContentDocs30] | EPUB Style Sheets. |
text/javascript | [RFC4329] | Scripts |
This specification does not define any video codecs as Core Media Types. Refer to the note in EPUB Publications — Reading System Conformance above for informative recommendations on support for video codecs in EPUB Publications.
All Publication Resources of an EPUB Publication must be Core Media Type Resources or must provide a Core Media Type fallback. The cases in which Foreign Resource may be used, and the requirement and rules for Core Media Type fallback provision in such cases, are detailed below.
› Foreign Resources may be referenced from EPUB Content Document elements that have explicit intrinsic fallback mechanisms (e.g., the [HTML5] object
, canvas
, audio
and video
elements). A Core Media Type resource must be provided via the given element's intrinsic fallback mechanism in such cases.
› For the [HTML5] video
element, the image referenced by the poster
attribute and text content embedded within the video
element are also considered valid Core Media Type fallbacks in addition to the video
element's intrinsic fallback capabilities. For the purpose of providing a last resort fallback for Reading Systems that do not support video or the given video format(s), at least one of these should be included with each occurrence of the video
element.
› For the [HTML5] audio
element, text content embedded within the element is also considered a valid Core Media Type fallback in addition to the audio
element's intrinsic fallback capabilities. For the purpose of providing a last resort fallback for Reading Systems that do not support audio, embedded text content should be included with each occurrence of the audio
element.
› In this version of this specification, the [HTML5] track
element is exempt from the Core Media Type usage rule: Foreign Resources may be referenced from track
without the provision of a Core Media Type fallback.
› Fonts embedded in Content Documents or EPUB Style Sheets using the @font-face
mechanism may be Foreign Resources. Reading Systems must use the rules for matching font styles [CSS3Fonts] when identifying a fallback for an unsupported font type.
› Foreign Resources may be referenced directly from spine itemref
elements, and in this case Manifest fallbacks must be provided.
Fallbacks must be provided for each Publication Resource referenced in a spine itemref
element that is not an EPUB Content Document.
Fallbacks are provided using the fallback
attribute on the manifest item
element that represents the Publication Resource. The fallback
attribute's IDREF [XML] value must resolve to another item
in the manifest
. This fallback item
may itself specify another fallback item
, and so on.
The ordered list of all the ID references that can be reached starting from a given item's fallback
attribute represents the fallback chain for that item. The order of the resources in the fallback chain represents the Authors' preferred fallback order.
A Reading System that does not support the Media Type of a given Publication Resource must traverse the fallback chain until it has identified at least one supported Publication Resource to be used in place of the unsupported resource. If the Reading System supports multiple Publication Resources in the fallback chain, it may select the resource to use based on specific properties of that resource, otherwise it should honor the Authors' preferred fallback order.
A fallback chain must contain at least one EPUB Content Document and must not contain any circular- or self-references to item
s in the chain.
Fallbacks may also be provided for Top-level Content Documents that are EPUB Content Documents; a Reading System may choose to utilize such fallbacks in order to find the optimal version of a Content Document to render in a given context. An example of when this feature can be utilized is when providing fallbacks for scripted content [ContentDocs30].
All Publication Resources must be located in the EPUB Container, with the following exceptions:
› Audio resources may be located in the Container or remotely.
› Video resources may be located in the Container or remotely.
Authors should prefer locating audio and video resources in the Container to allow the user access to the entire presentation regardless of connectivity status.
The above rules for Publication Resource locations apply regardless of whether the given resource is a Core Media Type Resource or a Foreign Resource.
The inclusion of remote resources in an EPUB Publication is indicated via the remote-resources
property on the manifest item
element.
Any Publication Resource that is an XML-Based Media Type must meet the following constraints:
› It must be a conformant XML 1.0 Document as defined in Conformance of Documents [XMLNS].
› External identifiers must not appear in the document type declaration [XML].
› It must not make use of XInclude [XInclude].
The above constraints apply regardless of whether the given Publication Resource is a Core Media Type Resource or a Foreign Resource.
The schema for Package Documents is available at http://www.idpf.org/epub/30/schema/package-30.nvdl.
This schema is normative.
Validation using this schema will require a processor that supports [NVDL], [RelaxNG] and [ISOSchematron].
Note, however, that the NVDL schema layer can be substituted by a multi-pass validation using the embedded RELAX NG and ISO Schematron schemas alone.
application/oebps-package+xml
Media TypeThis appendix registers the media type application/oebps-package+xml
for the EPUB Package Document. This registration supersedes [RFC4839].
The Package Document is an XML file that describes an EPUB Publication [Publications30]. It identifies the resources in the Publication and provides metadata information. The Package Document and its related standards are maintained and defined by the International Digital Publishing Forum (IDPF).
application
oebps-package+xml
None.
None.
Package Documents are UTF-8 or UTF-16 encoded XML.
Package Documents contain well-formed XML conforming to the XML 1.0 specification.
Clearly, it is possible to author malicious files which, for example, contain malformed data. Most XML parsers protect themselves from such attacks by rigorously enforcing conformance.
All processors that read Package Documents should rigorously check the size and validity of data retrieved.
There is no current provision in the EPUB Publications 3.0 standard for encryption, signing, or authentication within the Package Document format.
None.
This media type registration is for the EPUB Package Document, as described by the EPUB Publications 3.0 specification located at http://www.idpf.org/epub/30/spec/epub30-publications.html.
The EPUB Publications 3.0 specification supersedes the Open Packaging Format 2.0.1 specification, which is located at http://idpf.org/epub/20/spec/OPF_2.0.1_draft.htm and which also uses the application/oepbs-package+xml
media type.
This media type is in wide use for the distribution of ebooks in the EPUB format. The following list of applications is not exhaustive.
Adobe Digital Editions
Aldiko
Azardi
Apple iBooks
Barnes & Noble Nook
Calibre
Google Books
Ibis Reader
MobiPocket reader
Sony Reader
Stanza
none
.opf
TEXT
The IDPF maintains a registry of linking schemes at http://idpf.org/epub/linking/. Some of these schemes define custom fragment identifiers that resolve to application/oebps-package+xml
documents.
William McCoy, [email protected]
COMMON
International Digital Publishing Forum (http://www.idpf.org)
This appendix is informative
EPUB has been developed by the International Digital Publishing Forum in a cooperative effort, bringing together publishers, vendors, software developers, and experts in the relevant standards.
The EPUB 3 specifications were prepared by the International Digital Publishing Forum’s EPUB Maintenance Working Group, operating under a charter approved by the membership in May, 2010 under the leadership of:
Active members of the working group included:
IDPF Members
Invited Experts/Observers
For more detailed acknowledgements and information about contributors to each version of EPUB, refer to Acknowledgements and Contributors [EPUB3Overview].
[AAC LC] ISO/IEC 14496-3:2009 - Information technology -- Coding of audio-visual objects -- Part 3: Audio .
[ASSOCSS] Associating Style Sheets with XML documents 1.0 (Second Edition) . 28 October 2010.
[CSS3Fonts] CSS Fonts Module Level 3 .
[ContentDocs30] EPUB Content Documents 3.0 .
[DCTERMS] DCMI Metadata Terms .
[ISOSchematron] ISO/IEC 19757-3: Rule-based validation — Schematron .
[MARC21XML] MARC 21 XML Schema .
[MediaOverlays30] EPUB Media Overlays 3.0 .
[OCF2] Open Container Format 2.0.1 .
[OCF3] Open Container Format 3.0 .
[ONIX] ONIX for Books .
[OPF2] Open Packaging Format 2.0.1 .
[OPS2] Open Publication Structure 2.0.1 .
[OpenType] ISO/IEC 14496-22:2009 - Information technology -- Coding of audio-visual objects -- Part 22: Open Font Format .
[PLS] Pronunciation Lexicon Specification 1.0 (PLS) . 14 October 2008.
[PNG] Portable Network Graphics (PNG) Specification (Second Edition) . 10 November 2003.
[Publications30] EPUB Publications 3.0 .
[RDFa10] RDFa in XHTML: Syntax and Processing . A collection of attributes and processing rules for extending XHTML to support RDF. 14 October 2008.
[RFC2046] Multipurpose Internet Mail Extensions (MIME) Part Two: Media Types (RFC 2046) . November 1996.
[RFC2119] Key words for use in RFCs to Indicate Requirement Levels (RFC 2119) . March 1997.
[RFC3987] Internationalized Resource Identifiers (IRIs) (RFC 3987) . January 2005.
[RFC4839] Media Type Registrations for the Open eBook Publication Structure (OEBPS) Package File (OPF) (RFC 4839) . April 2007.
[RFC5646] Tags for Identifying Languages (RFC 5646) . September 2009.
[RelaxNG] ISO/IEC 19757-2: Regular-grammar-based validation — RELAX NG. Second Edition . 2008-12-15.
[Unicode] The Unicode Consortium. The Unicode Standard, Version 5.0.0, defined by: The Unicode Standard, Version 5.0 (Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0).
[WOFF] WOFF File Format 1.0 .
[XInclude] XML Inclusions (XInclude) Version 1.0 (Second Edition) . 15 November 2006.
[XML] Extensible Markup Language (XML) 1.0 (Fifth Edition) . 26 November 2008.
[XML Base] XML Base (Second Edition) . 28 January 2009.
[XML DSIG Core] XML-Signature Syntax and Processing Version 1.1 . 3 March 2011.
[XMLNS] Namespaces in XML (Third Edition) . 8 December 2009.
[XMP] Extensible Metadata Platform .
[XSD-DATATYPES] XML Schema Part 2: Datatypes Second Edition . 28 October 2004.
[EPUB3Changes] EPUB 3 Differences from EPUB 2.0.1 .
[EPUB3Overview] EPUB 3 Overview .
[RFC4329] Scripting Media Types . April 2006.