Fan of creative technology, elearning, instructional design and a little geeky

Preserving electronic publications

I found myself sitting in the classics library of one of Perth’s beautiful private colleges the other day… staring at the 1875 edition of the Encyclopedia Britannica. Taking a volume off the shelf and flipping through the fragile and slightly yellowish pages brought about a feeling of nostalgia… I couldn’t help but wonder how 135 years from now — in 2045 — someone would be sitting in a virtual (digital) library and look through those unique books we are publishing today in electronic formats.

1875 Encyclopedia Britannica

Frank Romano (2002), published an interesting paper of E-Books and the Challenge of Preservation, in which he also points to the fact that “Libraries and information repositories face a continuing challenge in maintaining files … “.  He points to issues not only related to file formats, but also to storage hardware and media, as well as operating systems and further comments that over time even the data coding system and metadata might change. Romano, concludes his article with a bald statement “Libraries and other data repositories must take a more active role in shaping the future of e-publishing” and points out that at the time of writing little consideration was given to the preservation of electronic books.

Trends in digital preservation look  at PDF (PDF/A) as a transformation document format for combined textual and graphics documents as well as scanned images (National Archives, 2004). PDF/A is the ISO-standard for the long-term preservation of electronic documents. Web based content would most likely be preserved in Hypertext Markup Language (HTML) or XML. Both TIFF and JPEG are  referred to as preservation formats for graphics.

EPUB construct

It is pretty easy to break open any EPUB file and get to both content and meta-data. The screenshot above taken in PDFXML Inspector (available from Adobe Labs), clearly displays how EPUB content is stored. The nature of electronic publications is that its content is reflowable and can be read on different devices, much like we see with web-pages. This would mean that in preservation we would need to focus not on the display or look and feel of the book itself but on its content. Library and Archives Canada (2010) refers to EPUB as a recommended preservation and standard format that “addresses the content and presentation without digital rights management (DRM)”.  This statement not only eases my original concern about electronic book preservation, it also confirms the importance of a standardized format.

I would add to Romano’s statement that not only Libraries and other data repositories must take a leading step in shaping the future of electronic publishing, but publishers would need to take an equal role in choosing their digital publication formats.  It would be interesting to also review the impact DRM would have on digital preservation…

References

Library and Archives Canada (2010). Local Digital Format Registry (LDFR). File Format Guidelines for Preservation and Long-term Access. Version 1.0. Retrieved 13 December 2010, from http://www.collectionscanada.gc.ca/digital-initiatives/012018-2220.01-e.html

National Archives. (2010). Transfer of Permanent E-records to NARA. Retrieved 11 December 2010, from http://www.archives.gov/records-mgmt/initiatives/transfer-to-nara.html

Romano, F. (2002). E-Books and the Challenge of Preservation. Building a National Strategy for Preservation: Issues in Digital Media Archiving (106). Retrieved 11 December 2010, from http://www.clir.org/pubs/reports/pub106/contents.html

Similar posts
  • Building EBooks with InDesign – Forced Line Br... Updated 3-Jan-2012 / Postscript added One of the pitfalls of converting print publications to EBooks, is that designers often use forced line breaks (Shift+Return/Enter) in print-layouts to control where headings and even paragraph lines are broken. Doing so can cause problems when the same document is also converted to EPUB later on. Problems can occur [...]
  • InDesign CS5.5 7.5.2 Update fixes EPUB issues Amongst a list of issues that’s been resolved with today’s Adobe InDesign CS5.5 7.5.2 update release are a couple of EPUB issues. Notably: An issue in the DOCTYPE of the EPUB, where InDesign would insert an extra space, which in turn caused iBooks to return an error and stop rendering the EPUB. The iBooks rendering [...]
  • EPUB Export & Relative to Page Size Updated 7 June 2011 (added additional notes ). One of the new InDesign CS5.5’s EPUB Export Option features is the ability to control whether images are exported with Fixed width or Relative to Page Size setting. In this blog-post I’m running through a few [...]
  • InDesign Character Styles & EPUB I’ve been head deep into EPUB… and wanted to share another finding with InDesign CS5.5. A short post only, because I’m in the middle of work 🙂 InDesign CS5.5 now recognises character styles that are applied through nested styles and GREP styles during EPUB export.  That’s very cool!  It adds a span tag with class [...]
  • InDesign CS5.5 Lists & EPUB. Updated 10-May-2011 (added alternate option, thanks to Bob’s comment) Publishers quite often generate multiple paragraph styles for bulleted and numbered lists behaviour. For instance the first or last bullet point in a list might have slightly different space before and after settings applied to [...]

No Comments Yet

Leave a Reply

Your email address will not be published. Required fields are marked *