PDFA in A Nutshell - 1b PDF
PDFA in A Nutshell - 1b PDF
PDFA in A Nutshell - 1b PDF
■ Accessibility
PDF/A in a Nutshell
Long-Term Archiving with PDF
Olaf Drümmer
o.druemmer@callassoftware.com
Alexandra Oettler
pdfakompakt@alexandra-oettler.de
Dietrich von Seggern
d.seggern@callassoftware.com
ISBN: 978-3-9811648-1-7
This work and all its parts are protected by copyright. All rights, including translation, reproduction, presentation, use of illustrations and tables, radio
broadcasting, microfilming, any other means of replication, and storage in data processing systems, are reserved. This also applies to extracts. Any
replication of this work or of parts thereof, even in isolated cases, is only permissible in accordance with the currently valid version of the German
copyright legislation of September 9th 1965. A copyright fee must always be paid. Violations fall under the prosecution act of German Copyright Law.
Printed in Germany
The use of general descriptive names, trade names, trademarks, and so on, in this publication, even if not specifically identified, does not imply that
these names are not protected by the relevant laws and regulations or that they can be used by anyone.
Layout, design, and composition: Alexandra Oettler; Cover design: Anja Godolt; Cover picture: Sepp Huberbauer – photocase.com/de
Printing: Galrev Druck- und Verlagsgesellschaft Hesse & Partner OHG
Preface
Our world is getting more digital by the day. have to be migrated on a regular basis, in or-
A lot of information and documents only der that newer versions of the processing
exist in digital form today, but will they still software can still read them.
be legible „tomorrow“? That was the theme Employees working on customer dos-
of an interesting TV show appropriately siers aren‘t really impressed when 10 differ-
called „The Digital Disaster“. It began with ent viewing programs are opened up at the
cave drawings from the stone age and papy- same time. In some of the programs they
rus rolls from ancient Egypt, both of which might not even know how to navigate
have survived as documents for thousands around in a document. In order to solve
of years. What documents from the 21st this problem, a document and archiving
century will future generations be able to format is needed that guarantees the re-
find and still read? But it‘s happening much quired long-term archiving period and of-
quicker than you may realize. I always carry fers the option of a single format type.
a 3½ inch floppy disk in my pocket, and it This is where PDF/A as an ISO standard
demonstrates a lot of the problems of long- for long-term archiving enters the stage.
term archiving. It begins with the hardware: The „A“ stands for „Archive“ and the PDF/A
where can you buy a 3½ inch floppy disk to- standard was specifically created for long-
day? And even if you find one, there‘s a good term archiving. It envisions a single PDF/A
chance that the disk is physically damaged. archive for all documents in an organiza-
If these two hardware hurdles are success- tion, from input through to output, and in-
fully cleared, then what kind of software or cludes all of the areas inbetween.
document will we find on the floppy disk? You will find many more advantages to
Are the appropriate viewing and processing PDF/A on the following pages, written with
programs still available? And this example the aim of converting the very formal ISO
is a mere 15 years old! standard into a form that is easily under-
My short anecdote leads us to the de- stood and enhanced with practical exam-
mand on the long-term archiving of docu- ples. Since PDF/A resolves a lot of the criti-
ments. Electronic archiving is critical for cal problems that users have, the PDF/A
businesses and organizations, because doc- Competence Center was formed as an as-
uments today often only exist in digital for- sociation with the aim of providing infor-
mat. The length of time that business docu- mation over PDF/A, promoting the distri-
ments have to be archived varies from sec- bution of the standard, and acting as a cen-
tor to sectors and country to country, but tral point of contact for your questions
some examples can help us to get an idea. dealing with PDF/A. We hope that this
Federal laws often requires an archiving booklet gives you a good overview and in-
period of around 10 years. Banks and in- troduction to PDF/A, and also helps as a
surances demand that customer dossiers be motivator for implementing the standard.
retained for more than 50 years. In the en-
gineering branch, archival periods of 100 Berlin, in September, 2007
years are common for aircraft, bridges Thomas Zellmann,
hopefully hold a whole lot longer. Chairman PDF/A Competence Center
And saving documents in proprietry for-
mats for this length of time is really not a PS: a special thanks goes out to our mem-
good idea. This leads to the second problem ber callas software GmbH, who initiated
with the digital document world - that many the German version of this booklet and
users already have a real „format zoo“, which provided it to the PDF/A Competence Cen-
can quickly become unmanageable (if it isn‘t ter for translation into English and for fur-
already so). Proprietary document formats ther distribution.
PDF/A in a Nutshell 3
Preface
Throughout history, it has always been im- The joint effort of AIIM and NPES
portant to preserve our past for future gen- brought together the document and con-
erations. Until the last 20 years in our paper tent management experts with the graphics
centric world, this was a fairly easy task. experts who had already developed the
One would simply take the folders of pa- PDF/X family of standards. When we an-
pers or other objects that were to be pre- nounced the proposed work to develop a
served and send them off to an archive for subset of PDF tags for long-term preserva-
safe keeping or place them in a fire retar- tion of electronic documents, we were over-
dant container. With electronic documents whelmed by the interest to participate from
this task is not as easily approached, which virtually every area in the world.
is how PDF/Archive or PDF/A came into AIIM’s expertise as an accredited stan-
being. dards developer and the secretariat of ISO
PDF/Archive addresses the growing need TC 171, Document Management Applica-
to electronically archive documents in a tions and ISO TC 171 SC2, Document Ap-
way that would ensure preservation of their plications, AIIM brought to the project the
contents over an extended period of time. means for gaining ISO approval and wider
Additionally, it ensures that the documents adoption of the standard. ISO 19005-1,
will be able to be retrieved and rendered Document management – Electronic docu-
with a consistent and predictable result ment file format for long-term preservation
each time they are viewed. – Part 1: Use of PDF 1.4 (PDF/A-1) became
AIIM, the Enterprise Content Manage- an approved ISO standard within 22
ment Association, and NPES – The Asso- months of introduction as a new project
ciation for Suppliers of Printing, Publish- through the dedicated efforts of many re-
ing and Converting Technologies were ap- cords managers, archivists, software devel-
proached by numerous organizations opers and end users.
which were being faced with the need to While adoption of the standard has been
preserve over long periods of time, large a little slower than we had anticipated, we
quantities of electronic documents. After are encouraged by the continuing interest
reviewing the options of maintaining this and growing adoption of the standard. This
electronic history in TIFF, XML, native book along with the continuing efforts of
format or PDF, it was decided that PDF AIIM and the PDF/A Competence Centre
would be the best format as it would enable will continue to increase the adoption rate
the accurate rendering of the document as of PDF/A in the industry.
it had been intended to be displayed. How-
ever, in order to ensure the long term pres- Silver Spring, in September, 2007
ervation of the electronic documents, PDF Betsy Fanning
would need to be enhanced slightly. AIIM, Director, Standards
4 PDF/A in a Nutshell
Table of Contents
Table of Contents
Durable documents with the PDF/A standard
PDF/A in a Nutshell 5
Table of Contents
pdfaPilot PDF/A 41
Images 42
Resolution is not part of the PDF/A standard 43
Permitted and prohibited compression types 43
Transparency 44
Illustrations: photocase.com/de
Colors 46
Fonts 48
Metadata 50
PDF/A and metadata 50
Accessibility 52
Creating an accessible PDF file from Word 54
Electronic signatures 61
Security levels 62
Digital signatures in PDF with Acrobat 63
Challenges in practice 64
6 PDF/A in a Nutshell
Table of Contents
Enhancements in PDF/A-2 65
Looking towards PDF/A-3 66
PDF/A-1 developments 66
PDF/A in one hundred years time 67
Glossary
About:
AIIM 87
PDF/A in a Nutshell 7
1. Durable documents with the
PDF/A standard
There are certain documents that people
want to keep because of their sentimental
value: Love letters, photographs of their
first day at school, or holiday snaps, for ex-
ticular document or photo on our comput-
ers. In addition, any possible space prob-
lems can be solved simply by purchasing
additional RAM. However, there are cer-
ample. Other documents have to be kept tain risks and uncertainties that might in-
for legal reasons. These document include fluence the shelf life of digital documents.
birth certificates, academic certificates and These risks do not only arise from the phys-
reports, invoices that are needed for tax ical durability of the data carriers used al-
purposes, insurance documents, and con- though it is clear that magnetic tape, CD-
tracts. ROMs, and DVDs will not necessarily last
In the days when everything existed on any longer than paper and ink. However,
paper – in the pre-digital era – the main photographic prints dating from 1900 still
problem was remembering which index exist today. Still, it’s debatable whether or
file, folder, or shoe box you’d used to store not we will similarly be able to view the
your letters or contracts. In today’s world of millions of digital snapshots being taken
digital documents, the task of archiving is and stored on mobile phone memory cards
fundamentally different. Thanks to search all over the world in, for example, 2107.
functions or database solutions, even the In addition to the restrictions imposed
most forgetful of us can easily find a par- by the limited lifetime of data carriers, the
8 PDF/A in a Nutshell
Durable documents with the PDF/A standard
document format and software used also displayed as required. Instead, the frame
present a considerable challenge for the du- where the image should appear displays
rability of electronic documents. Yester- only a rough preview of the image or a
day’s, today’s, and tomorrow’s software question mark. The problem of open files
It’s a common problem: Opening old for which not all illustrations and fonts
documents in brand-new programs are available has been causing irritating
doesn’t always work. The rate of success delays for printers and their suppliers for a
for the opposite direction (new documents long time. However, the introduction of
in old programs) is even less encouraging. PDF, a format that can store all the com-
Software developers do try to achieve ponents required for a printed document,
backward compatibility that enables files has greatly simplified work in this area. In
that are, say, five years old to be opened addition, layout files such as XPress or In-
using a current program release. However, Design are now becoming increasingly
this can change the layout and page ren- less common in printers’ archives. Instead,
dering, meaning that not everything is printers are storing the actual PDF docu-
displayed exactly as it ought to be. More ments that were used for the printing
recent software tends to generate docu- task.
TIFF-G4 – a black and white
ments with additional features that older
TIFF variant that works with a
versions may not be able to display. In TIFF as an archive format compression method devel-
some cases, it is not even possible to open For a long time, many public authorities oped for fax technology – is
current files in previous versions of a pro- and companies that need to store large commonly used for archiving.
gram. For example, whereas a Microsoft quantities of correspondence, records, in-
Word 95 file can normally be opened in voices, contracts, and similar information
Word 2003, it is not in digital archives
possible to open a have been using
Word 2003 document "The successful long-term the pixel image
in Word 95. archiving of digital files is at least format TIFF
Because software as threatened by the constant (Tagged Image File
production cycles are rollout of new program versions Format). This for-
becoming ever shorter mat digitalizes
as by damaged data or data
– one major release templates contain-
per year is not unusual
carriers." ing text and imag-
– the challenge that es pixel by pixel.
arises from new pro- TIFF is an estab-
gram developments is greater than that lished image file format that has both ad-
caused by the aging of storage media. The vantages and disadvantages. Pixel-based
successful long-term archiving of digital formats store the appearance of templates.
files is at least as threatened by the constant Problems with missing graphics and fonts
rollout of new program versions as by dam- do not occur, since the format stores all of
aged data or data carriers. the template elements as an image. Since
TIFF is widespread and is subject to few
Open files are not always complete file handling complications when upgrad-
File formats are not all equally suitable for ing to a new program version, many users
the long-term, secure archiving of content. believe that the future of the format is
If it is not possible to store all the elements guaranteed. However, while TIFF may in-
required for the complete display of con- deed be a de facto standard, it is not an
tent in a file format – graphics and fonts as official norm for safe archiving. Other dis-
well as text – then the possibility of stum- advantages include the relatively large file
bling blocks when it is attempted to use size and the fact that scanned texts cannot
the file later on cannot be ruled out. If, for be searched without OCR (text recogni-
example, the program used cannot find tion), since this format converts them to
linked external images, a page cannot be image elements. ➔
PDF/A in a Nutshell 9
Durable documents with the PDF/A standard
PDF specifications: Acrobat 4 (1999, PDF 1.3): PDF 1.3 contains the com-
plete PostScript Level 3 graphics model. It enables multi-
Since it was introduced at the start of the 1990s, the PDF channel color spaces (DeviceN) and supports ICC profiles
file format has been in a state of constant development. for the reliable reproduction of colors. It introduces
The current PDF specification is version 1.7, which was smooth shades and page geometry boxes, which are use-
introduced with Acrobat 8. Today, it is extremely rare to ful for prepress processes (TrimBox, CropBox, and Bleed-
come across PDF files with a version number lower than Box).
1.3, and modern PDF generation programs only have
backward compatibility to version 1.3 at the most. Acrobat 5 (2001, PDF 1.4): From this version, PDF
files can contain transparency. This version also intro-
With each PDF version, Adobe Systems publishes a refer- duces ‘tagged PDF’ (= structured PDF), which enables
ence that describes the features and functions of the ver- content accessibility. The security options are enhanced
sion in detail. The specification history contains ‘mile- with this version. In addition, the image compression
stones’ – important features that were introduced with type JBIG2 is supported.
the new version. Some of these milestones are listed be-
low. Acrobat 6 (2003, PDF 1.5): With this version, PDF
documents can contain layers (also called ‘optional con-
Acrobat 1 (1993, PDF 1.0): PDF 1.0 incorporates most tent’). JPEG2000 image compression is supported.
of the functions offered by the page description language
PostScript Level 2. All basic functions for text, vector Acrobat 7 (2004, PDF 1.6): This version supports
graphics, and raster graphics are available. OpenType fonts. With this version, 3D content can be in-
serted. Users can create virtual page sizes with edges of
Acrobat 2 (1994, PDF 1.1): This version supports the up to 381 km in length.
Lab color space and CalRGB. It also supports TrueType
fonts. Acrobat 8 (2006, PDF 1.7): Unicode path specifica-
tions simplify the correct specification of links, even
Acrobat 3 (1996, PDF 1.2): This version enables across international language systems. The new Acrobat
color separation and supports Unicode and CID fonts ‘PDF packages’ function allows several independent PDF
(Chinese, Japanese, and Korean). It also supports ZIP documents to be forwarded in a single file. The recipient
compression. requires Acrobat or Reader 8.
10 PDF/A in a Nutshell
Durable documents with the PDF/A standard
ever, only the new PDF/A standard can users from repeatedly having to test and
guarantee that users will be able to view ex- discuss the best appearance of a well-
actly the same content as when their docu- functioning archive PDF, industry experts
ments were created. This format brings the decided in 2002 to work together to de-
kind of legal certainty that can be decisive velop the PDF/A standard.
in many business and administrative con-
texts. The introduction of the PDF/A standard
The PDF/A standard for long-term ar-
Why PDF/A and not PDF? chiving was adopted by ISO (International
Why has a special PDF standard now been Organization for Standardization) in au-
defined for archiving documents? Are tra- tumn 2005. The PDF/A standard was pub-
ditional PDF documents not ‘good enough’ lished with the number ‘ISO 19005-1:2005’
for long-term archiving? PDF has some and is based on PDF specification 1.4. An
excellent characteristics that lend them- additional part, PDF/A-2, is currently be-
selves to the creation of archive docu- ing prepared. This part shall refer to PDF ISO is an international organization for
ments. Like a container, a PDF can incor- Version 1.7. standardization, active primarily in tech-
porate completely different elements such The PDF/A standard aims to enable the nical and electronic fields. The PDF/A stan-
dard was developed by industry and de-
as text, images, and fonts. In addition, it creation of PDF documents whose visual
velopment experts.
reproduces layouts that are true to the appearance will remain the same over the
original and is cross-platform capable. course of time. These files should be soft-
However, certain requirements must be ware-independent and unrestricted by the
met in order to enable the exact reproduc- systems used to create, store, and repro-
tion of content. duce them. As far as PDF/A is concerned,
practice soon caught up with theory.
■■ Required: One ‘must’ is that users re- While Acrobat Professional 7 contained PDF/A
quire full access to all elements belong- only ‘draft’ PDF/A functions, Acrobat 8, Competence Center
ing to a document. For example, fonts which has been available since the end of
must be embedded – a link to the font in 2006, now offers creation and verification
question is not sufficient. This means features that comply with the adopted International companies and experts from
that if, in 10 years time, a user who tries standard. the field of PDF technology have joined
to open a document does not have a re- Many new PDF/A tools and solutions forces to form the PDF/A Competence Cen-
ter. It aims to promote the exchange of in-
quired font on his or her computer, spe- for creating and verifying files have en-
formation and experiences relating to
cial characters or symbols will not be dis- tered the market since the introduction of long-term archiving. Users can visit
played correctly. the standard – from ‘small’ tools for indi- www.pdfa.org for up-to-date advice and
vidual users who want to create PDF/A background information as well as a dis-
■■ Prohibited: In addition, some PDF fea- documents every now and again to exten- cussion forum on PDF/A.
tures must be avoided. Such elements are sive server solutions that can create a hun-
prohibited because they would undermine dred thousand archive documents from
the required document durability, and in- databases in just a few hours time. ➔
clude interactive elements and PDF layers.
These features inhibit the unambiguity that PDF/A has two levels of compliance:
is required from an effective PDF/A file.
For example, in the case of a PDF docu- PDF/A-1a (Level A) applies to semantic correctness
ment with layers, users printing it out in 50 and structure. Each character must have a Unicode
years time might well ask themselves which equivalent. The structure is expressed by tags.
layers are valid and which are not. This
kind of decision needs to be made now – PDF/A-1b (Level B) applies to visual integrity.
when the PDF is created.
Any file that meets the requirements for PDF/A-1a will
A PDF/A document is basically a tradi- also comply with PDF/A-1b, which is less strict.
tional PDF document that fulfills precisely
defined specifications. In order to prevent
PDF/A in a Nutshell 11
Durable documents with the PDF/A standard
12 PDF/A in a Nutshell
Durable documents with the PDF/A standard
cal documents can also be scanned for con- he or she says that it’s a PDF/A file? Before
version to PDF/A. Solutions and services received files are saved in an archive, they
for mass processing are available for users must be checked to make sure that they are
who wish to scan a large number of pages PDF/A-compliant. There are various tools
or documents. that enable file verification: In addition to
Acrobat 8 Professional, there are other ap-
■■ Creating PDF/A from PDF: Many users plications including Berlin-based callas
already have PDF documents that are not software’s pdfaPilot, which enables the veri-
PDF/A-compliant. It is often not possible to fication and creation of PDF/A files as well
recreate such documents from the source as providing some additional functions.
program because, for example, they were
not created locally but were sent to the user Who stands to benefit from PDF/A?
in question by e-mail. There are several Many sectors and professions have been
methods for converting PDFs to PDF/As. waiting for a PDF standard for archiving. It
Acrobat 8 Professional is one of the appli- is useful not only for archives, administra-
cations that can be used. However, Adobe tive departments, industry, and commerce
is not the only company to market software but also for research and teaching. Many
for this particular task. There are many dif- different types of content can be saved as
ferent products on the market, ranging PDF/A files. Below are a few randomly se-
from single-user solutions to systems for lected examples from various fields.
high throughput.
■■ Saving e-mails as PDF/A: Today, more
■■ Is this really a PDF/A file? When work- and more correspondence, some of it of a
ing with PDF/A on a daily basis, file verifi- contractual nature, is being sent by e-mail.
cation is also important. Is it sensible to Anyone who has switched from one e-mail
believe the sender of a PDF document when program to another knows the difficulties
involved in transferring old
mail to the new system. Since
PDF/A is a safe format, it
makes sense to save e-mail ar-
chives on back-up media in
the form of PDF/A at regular
intervals.
PDF/A validation with Preflight: The Preflight validation and correction tool is part of Ac-
robat 8 Professional. It generates PDF/A files and checks existing PDF/A documents to
make sure that they comply with the standard.
PDF/A in a Nutshell 13
Durable documents with the PDF/A standard
14 PDF/A in a Nutshell
Durable documents with the PDF/A standard
Annotations Comments that take the form of sound or movies are not permit-
ted.
Traditional text/label-style annotations are permitted.
Referenced content Referenced (non-embedded) images or page content are not per-
mitted
Alternate images Alternate images (for lower-resolution screen display) are not per-
mitted
Programming Embedded JavaScript is not permitted
languages
Actions Certain actions, such as opening movies or sound files or sending
or resetting forms, are not permitted
Forms Permitted, but with restrictions
PDF/A in a Nutshell 15
Durable documents with the PDF/A standard
96 KB 112 KB 56 KB 88 KB 120 KB
16 PDF/A in a Nutshell
Durable documents with the PDF/A standard
What’s to be done with JPEG and TIFF-G4 archives? PDF/A is worthwhile. This involves using
There are basically two options for convert- mass conversion solutions that package
ing large document archives that currently pixel information into PDF and can enable
use TIFF-G4 or JPEG to PDF/A along with text searching using text recognition.
their existing inventory: Permanently or However, if users only need to call up
temporarily. data from an archive every now and then,
If the number of documents handled is ‘on the fly’ solutions can be used to gener-
not too high and regular access to the data ate a PDF/A file from a particular original
is required, converting the image files to image file. n
PDF/A in a Nutshell 17
Is XPS an alternative to PDF/A?
18 PDF/A in a Nutshell
Is XPS an alternative to PDF/A?
PDF/A in a Nutshell 19
Is XPS an alternative to PDF/A?
With applications that use PostScript for main a format that, while being extreme-
high-quality printing – including all pro- ly well-suited to depicting typical Office
fessional publishing applications such as documents, cannot depict other docu-
Adobe PageMaker, Quark XPress, Corel- ment types or can only store them with
Draw, and Adobe Photoshop – the user is unnecessary restrictions. In this respect,
confronted with nothing more than a XPS does not offer the universal support
crutch: He or she ends up with a file export of all document types that has made PDF
Acrobat can handle a whole range of doc-
that has the quality of a screenshot. such a powerful and popular format.
ument formats. Users can now use the Even if Microsoft and third-party sup- The best thing about XPS is therefore
‘Open...’ command in Acrobat 8 to open pliers manage to iron out some of these that it can be used to create a much higher
XPS documents as PDF files. issues during coming years, XPS will re- quality PDF from applications that support
it than previous methods such as GDI,
PCL, or PostScript printer drivers. From
Version 8, Adobe Acrobat offers an import
filter for XPS. It enables users to easily open
an XPS and simply save it as a PDF.
There is, of course, a suspicion that Mi-
crosoft intends to overstretch the capabili-
ties of XPS by using its market power to
position the format not just as a spool for-
mat or printer language but also as a uni-
versal exchange format – but it certainly
doesn’t fulfill the requirements for the lat-
ter as well as PDF. PDF should remain the
more reliable format for many years to
come. n
20 PDF/A in a Nutshell
PDF/A creation: Analog, digi-
tal, and mass processing
PDF/A is always the destination, but the
point of departure can differ greatly from
user to user. This chapter concentrates on
three main tasks: Converting paper docu-
themselves. For example, telephone bills
are often sent as printouts by mail. In some
cases, documents that need to be retained
are only available as printouts because the
2.
ments to PDF/A, exporting Microsoft Of- digital originals have been deleted from
fice files and other documents in a way that users’ computers. In addition, many docu-
allows them to be archived, and mass pro- ments were created by typewriter or by
cessing PDF archive files. The special pro- hand in the days before computerization.
cess flow for converting existing PDF files In such cases, the only way to digitize
to PDF/A is explained in detail later on. document pages is by using a document
scanner. As well as the type of scanner (flat-
bed scanner or a device with bypass feed),
PDF/A from scanned the scope of features provided by the soft-
ware also has a bearing on whether or not
documents the digitalization process can create a fault-
‘Analog to digital’ conversion is normally less PDF/A and dictates which additional
required when users have received the features can be used to enhance the usabil-
documents that need to be archived as ity of the document (for example, OCR for
printed pages rather than creating them full-text searching). ➔
Roufoto – photocase.com/de
PDF/A in a Nutshell 21
PDF/A creation
Important settings
Once the scanner is connected up and
switched on, the user can trigger the cre-
ation of a PDF in Acrobat by choosing ‘File’
→ ‘Create PDF’ → ‘From Scanner’ in the
menu bar. In the dialog box that then ap-
pears, the scanner being used can be selected
from the list of devices and the user can de-
fine whether the application is to scan only
the front side of the document or both the
front and back sides. In the ‘Output’ area,
Acrobat Scan: Users can use a checkbox Incidentally, all modern scanners support users can decide whether the current scan
to define that a digitalized PDF docu- the use of PDF as the initial format (in addi- process should generate a new PDF docu-
ment is to be standardized in line with tion to image formats such as JPEG or TIFF). ment or append the scanned material to an
PDF/A. If required OCR (text recognition),
However, not all scanners are currently able existing PDF. The ‘Make PDF/A Compliant’
accessibility, and metadata options can
be activated. to generate PDF/A. This restriction is sure to checkbox is especially useful here, and
change as the standard becomes more prev- should be selected. Quality settings for the
alent. Due to space restrictions, it is impos- PDF document can be made using a slider or
sible to mention all established scan pro- in more detail using the ‘Options’ button.
22 PDF/A in a Nutshell
PDF/A creation
Text recognition, accessibility, and meta- 1a-compliance, since errors can still occur
data functions can be used to give the new when structures are reconstructed. This is
PDF additional features. why the restricted version, PDF/A-1b, is
The text recognition function creates used here, too.
searchable text (otherwise, the scanned
Converting pages that have already been
scanned to PDF/A
The procedure used to convert scanned
documents that already exist in the form of
pixel data to PDF documents in Acrobat
Text Recognition and Metadata: These options give the PDF addi-
tional features such as searchable text and metadata. These fea-
tures are not PDF/A-relevant, but they do enhance the functionality
of the PDF file.
Professional is rather different. First, the Document optimization and text recogni-
image file (TIFF or JPEG) is imported by tion: The ‘Optimize Scanned PDF’ function
choosing ‘File’ → ‘Create PDF’ → ‘From File’ can be used to enhance the source materi-
al for text recognition, e.g. by removing
from the menu. It is then converted to a
edge shadows. Following this process, the
PDF file. The ‘Document’ menu contains user can use Adobe’s ‘Recognize Text Using
the ‘Optimize Scanned PDF’ function. OCR’ function to generate searchable text.
Once the document has been converted
into a PDF, the user can use this function
Recognize Text – Settings: The ‘PDF Output Style’ field contains op- to improve it before subjecting it to text
tions for generating a simple PDF image with searchable text or a
recognition.
more complex PDF file with separate areas for text and graphics
where possible. Text recognition is also called from the
‘Document’ menu item. It is triggered with
area includes a language setting and other the ‘OCR Text Recognition’ → ‘Recognize
fine-tuning settings for text recognition. Text Using OCR’ command.
For example, the user can define whether The user can then check that the pro-
the scan output should be a searchable im- cess worked correctly: Clicking ‘Find All
age or formatted text with graphics. How- OCR Suspects’ triggers a search for im-
ever, note the following: Even the second, age elements that could not be converted
superior option is no guarantee of PDF/A- to text. ➔
PDF/A in a Nutshell 23
PDF/A creation
Saving or exporting the document as a PDF/A whether the user chooses the Export op-
The PDF document must now be converted tion or ‘Save As’, only the PDF/A-1b level in
to PDF/A. This can be achieved in just a few the ‘Settings’ will be successful.
steps using the Export function or with the Even after text recognition, metadata in-
‘Save As’ command. Both methods involve put, and the integration of structural infor-
mation for accessibility, scanned docu-
ments do not automatically have advanced
PDF/A-1a features.
When the user clicks ‘OK’, Acrobat gen-
erates a PDF/A file from the PDF docu-
ment. n
the use of the integrated Acrobat Profes- Scanned documents are always converted to PDF/A-1b: To ensure a
sional Preflight engine, which carries out successful PDF/A conversion, the preset PDF/A-1b-compliance speci-
the conversion to PDF/A. Regardless of fication must not be changed.
An important factor for determining the size of PDF files is whether the docu-
ment is read in black and white (line scan), grayscale, or in color – color data
consists of much more information than bitonal data and the resulting data ment of PDF/A, LuraTech has enhanced the product and service scope of scan-
quantity is therefore also larger. to-image and scan-to-PDF solutions by adding a scan-to-PDF/A function. The
JBIG2 compression used has been improved by a type of layer technology that
Various image compression types have been developed over the past years to enables color documents to be digitalized in a legible manner while using rela-
enable users to save memory space when storing image data. The best known tively low amounts of memory.
of these methods is JPEG compression. PDF/A permits compression, but not all
types. JPEG and JBIG2 are permitted, but JPEG2000 is not. In addition to the In addition to compression, there are various text
type of compression, the compression level is also important for a scanned recognition functions and options for integrating
text. This is for readability reasons. Higher compression levels can render the metadata into PDF/A files.
image/text progressively less clearly.
More information on the Internet at:
Berlin-based LuraTech has been working on effective image compression for www.luratech.com
digitalized company documents for years. During the course of the develop-
24 PDF/A in a Nutshell
PDF/A creation
PDF/A in a Nutshell 25
PDF/A creation
PDF/A settings: Users can change the com- Users can choose between two default PDF/A settings in detail
pression level and resolution in the ‘Imag- settings in the main window of Acrobat Changes to the preset default settings for
es’ section. The compression type can be Distiller. PDF/A in color space RGB is the generation of PDF/A should only be
changed to ‘ZIP’. The preset sRGB output
mainly suited for use on computer screens. made after due consideration to avoid
intent in the Standards section is the gen-
erally recommended intent for RGB. In the CMYK PDF/A is intended for printing out creating non-compliant documents.
case of CMYK, the preset US profile can be with either an office printer or with pro- These settings can be modified by choos-
changed to a profile more suited to the Eu- fessional four color printing on an offset ing ‘Settings’ → ‘Edit Adobe PDF Set-
ropean market. printer. tings’.
Settings that influence the resolution
and compression of images can be made
in the ‘Images’ section. Files with lower
resolution and higher compression values
are smaller, but this can worsen the dis-
play quality. However, the compression
type can be changed to ZIP, which does
not impair the image quality.
When creating PDF/A with the CMYK
color space, European users should take a
look at the ‘Standards’ section. The Out-
put Intent presetting here is intended for
the US market. (The term ‘output intent’
comes from the color management field
and refers to the regulation of color set-
tings for printing.) In this area, users can
select an output intent that is more suited
for use in Europe, such as the European
ICC profile ‘ISO Coated FOGRA27’, which
is contained in the Acrobat 8 scope of de-
livery.
If a change is made to a default profile,
the changed profile is saved as a copy; Dis-
tiller default settings cannot be overwrit-
ten.
lio – photocase.com/de
26 PDF/A in a Nutshell
PDF/A creation
PDF/A in a Nutshell 27
PDF/A creation
28 PDF/A in a Nutshell
PDF/A creation
settings to export PDFs from Office 2007, The ‘Settings’ tab consists of a dropdown
they should expect to experience problems menu with various options delivered with Acrobat 7 offered support for
in conjunction with Acrobat 8.0. Accord- Adobe Distiller. There are two PDF/A-1b preliminary versions of the
ing to the manufacturers, this is due to the variants here – one for four-color CMYK PDF/A standard.
fact that the rollout dates of the two soft- output, and one for RGB monitor output. As of Acrobat 8, full support of
ware solutions were so close together. An In this example, the RGB variant is used. the final PDF/A standard is of-
Acrobat update to version 8.1 should solve Clicking the ‘Advanced Settings ...’ but- fered.
these incompatibility issues. ton opens detailed Adobe PDF settings.
Users can change the image resolution
Office 2003 and the PDFMaker and compression type here, but it is im-
It is only possible to generate PDF/A docu- portant to take care not to make changes
ments from Office 2003 using the PDF- that could endanger the PDF/A-compli-
Maker add-in and a connection to Acrobat ance of files (for example, for Acrobat
(or the Adobe Distiller). Acrobat 8 Profes- compatibility). However, let us return to
sional provides current conversion settings the conversion settings tabs in the Mi-
for PDF/A. Users can create both PDF/A-1- crosoft application.
a-compliant and PDF/A-1b-compliant files
from Office programs. Be careful with the security settings
Because security settings – passwords for
Settings for PDF/A-1b opening, printing, or changing PDF files –
The Office application menu (for exam- are not permitted in PDF/A files, users
ple, in Word) has an ‘Adobe PDF’ entry should not make any changes on the ‘Secu-
that enables the triggering of PDF gener- rity’ tab. Users who wish to protect their
ation and access to the presettings. The PDF/A files must protect the storage loca-
‘Change Conversion Settings’ command tion of these files. This can be achieved by
opens a dialog box where users can select implementing password protection for a
options and make additional settings. folder or drive, for example. ➔
PDF/A in a Nutshell 29
PDF/A creation
Bookmarks
Users can choose to use Word formats for
the generation of PDF bookmarks. Book-
marks are permitted for PDF/A. Users
The ‘Word’ tab contains the ‘Enable advanced tagging’ checkbox, can make personal specifications for
which is useful for users who want to generate structured PDFs.
styles, headings, or Word bookmarks.
PDF/A-1a: This PDF conversion setting is activated by selecting a So how do you create a PDF/A-1a-compliant
checkbox. It activates a function that can convert the advanced fea- file?
tures of the higher compliance level, such as fonts and structure, The conversion setting for PDF/A-1a
from Office documents into the resulting PDF files. takes the form of a checkbox in the PDF-
Maker Settings. If this checkbox is acti-
vated, the settings in the ‘Advanced Set-
tings’ pulldown menu are locked to pre-
vent users from making conflicting set-
tings.
30 PDF/A in a Nutshell
PDF/A creation
PDF/A using the 3-Heights PDF Producer for redistribution on clients and multi-user
Exporting PDF from Window applications servers. Swiss-based PDF Tools AG pro-
is not only a facility that is offered in more vides a whole host of tools and libraries for
recent Office versions or in conjunction the creation and processing of PDFs. The
with the Adobe Distiller – there is a whole company’s products can be purchased di-
range of converters that can generate PDF rectly or via OEM partners. A free test ver-
documents. However, only a few products sion of the 3-Heights Producer Developer
3-Heights PDF Producer: This solution
are capable of handling PDF/A. Kit (SDK) is available on the manufactur- latches on to Windows’ print functions to
PDF Tools AG’s 3-Heights PDF Producer er’s Web site: www.pdf-tools.com. n deliver different types of PDFs, including
produces PDF/A-compliant files for long- PDF/A.
term archiving. This tool is capable of cre-
ating PDF documents that meet various
Windows
Applications 3-Heights™ PDF Producer
PDF/A in a Nutshell 31
PDF/A creation
32 PDF/A in a Nutshell
PDF/A creation
WMF
DjVu
PC Documents
Raster Formats
DjVu
... ...
PDF/A in a Nutshell 33
3. From PDF to PDF/A: Converting
PDFs to archive PDFs
Many users already use PDF to store docu-
ments in digital archives in companies,
public authorities, or privately. Now that
the PDF/A standard has been adopted, they
have the opportunity to create archive doc-
uments from their existing files, thereby
ensuring that they can be used in the long
term. In addition, recipients of traditional
PDF files that need to be retained but are
not yet available as PDF/A can now convert
them to archive PDF documents. In order
to do so, they need to know the answer to
the following question: How do you create
PDF/A documents from PDF files?
34 PDF/A in a Nutshell
From PDF to PDF/A
clicking ‘Save As’, the Preflight module is Following the conversion: The Results win-
responsible for converting the file. dows shows the steps that were carried
The Preflight module is opened from the out and informs the user that the conver-
sion was successful.
Acrobat ‘Advanced’ menu or by pressing
Shift+Ctrl+X.
The lower section of the main Preflight
window immediately provides information
on the status of the opened PDF file with
regard to the PDF standard: Is the docu-
ment PDF/A and/or PDF/X-compliant?
(PDF/X is a prepress standard.) If the PDF
profile if it is not required. This reduces the
resulting file size.
When the user clicks the ‘OK’ button,
the Preflight tool searches the existing PDF
document to see whether it meets the pre-
Preflight: The PDF/A icon is also a pushbutton that triggers conver-
requisites for successful conversion to
sion to PDF/A.
PDF/A. If the prerequisites are met, the
file was not created as a PDF/A, the user re- conversion takes place. The green tick in
ceives a message telling him or her that the this example shows that no problems oc-
file is ‘not a PDF/A file’. If the user now curred during the conversion. Details on
wants to trigger PDF/A conversion, he or the conversion process are shown in the
she can simply click the PDF icon. Results window in the form of a list. The
The Preflight tool uses a dialog box to ask list contains information such as the fact
the user whether the existing PDF files that the tool added the file name suffix
should be converted to PDF/A-1a or to a re- ‘_A1b’ to the source document.
stricted PDF/A-1b version.
Conversion to PDF/A-1a
Conversion to PDF/A-1b The second scenario describes the conver-
In the first scenario, the user selects the sion of a PDF file to PDF/A-1a. The proce-
‘PDF/A-1b’ standard and sets the output
condition to ‘sRGB’ in the dialog box. This
indicates that the PDF in question is des-
tined to be displayed on a monitor. Since
the PDF file quite possibly already contains
an output intent, the tool provides a check-
box that specifies that the present intent is
to be used. In addition, another checkbox
prevents the embedding of the ICC color
PDF/A in a Nutshell 35
From PDF to PDF/A
36 PDF/A in a Nutshell
From PDF to PDF/A
Converting PDF to PDF/A ceive tips on how to solve the problems en-
countered in order to be able to carry out a
lowing Internet address:
www.callassoftware.com
Thanks to its largely self-explanatory user High-volume processing with pdfaPilot CLI
interface, callas software’s pdfaPilot allows The pdfaPilot CLI (Command Line Inter-
even unexperienced users with no prior face) is designed for high-volume PDF/A
Automation: pdfaPilot is also available as
knowledge to convert documents to PDF/A conversion and validation. This solution
a command-line (CLI) module. pdfaPilot
and verify them. This professional tool is a enables the server-based, automated gen- Validator CLI is a pure validation tool and
plug-in for Adobe Acrobat Standard and eration of PDF/A files in companies or ad- pdfaPilot Converter CLI can validate, cor-
Professional Versions 6, 7, and 8. The con- ministrative departments. n rect, and convert files.
version from existing PDF documents to
PDF/A normally needs three steps and can
be achieved in maximum of four:
PDF/A in a Nutshell 37
4. Is this really a PDF/A file?
PDF/A validation
A PDF/A document created with Adobe
Acrobat can be easily recognized by the file
name extension ‘_A1a’ or ‘_A1b’. Other
PDF/A generators use similar procedures.
compliance as a result of unintentional or
deliberate changes without it being obvious
that it is no longer compliant with the stan-
dard.
So why is an additional check needed when However, further investigation using
you receive a PDF/A file by e-mail or open a tools such as Adobe Acrobat Preflight, cal-
document from an archive? las software’s pdfaPilot, or PDFlib 7 by
The answer is simple: Because PDF/A PDFlib, all of which are specially designed
files cannot be protected from further edit- for PDF/A validation, can safely and reli-
ing by measures including encryption or ably uncover this kind of problem.
passwords. Doing so would contradict the Of course, even deception cannot be
PDF/A regulations, since PDF/A content ruled out – it is quite possible for users to
must be available in its entirety without se- manually add a file suffix such as ‘_A1b’ to
curity measures. a PDF file before sending it even if the file
This means that a PDF/A file that was in question has never actually been con-
once standard-compliant can lose that verted to PDF/A. This is why checks consti-
38 PDF/A in a Nutshell
PDF/A validation
that names the output intent contained in PDF/A status: The status icon has three
Validation with Preflight the PDF document and informs the user possible states: A file can be not yet vali-
dated, successfully validated, or have
Acrobat 8 Professional’s Preflight tool is that the file has not yet been validated.
failed the validation.
not designed only for the creation of PDF/A If the PDF/A icon does not appear in the
Preflight window, the status display may be
Calling up Preflight: This tool is called from deactivated in the Preflight preferences.
the Acrobat menu (using the
Clicking the icon starts the Preflight
‘Advanced’ menu item), by pressing
Ctrl+Shift+X, or by clicking the tool icon. PDF/A check. The tool works through a list
of conditions that the PDF document must
fulfill in order to comply with the PDF/A
files – it can also be used to test and vali- standard. More than one hundred specifi-
date PDF/A documents for their actual cations must be observed in order for a
compliance with the standard. document to be declared standard-compli-
The PDF/A icon at the bottom left of the ant.
Preflight window gives a quick overview of If the check finds no deviations from the
Successful validation: Clicking on the
the PDF/A compliance of an open docu- standard, the software indicates that the
PDF/A icon with the yellow question mark
ment. If a user opens a PDF/A file that has PDF/A file is standard-compliant (indicat- starts the validation process. The result (in
not yet been validated, the yellow question ed by the green tickmark) and names the this case – successful) appears after a few
mark icon appears along with a message output intent. ➔ seconds. Everything's fine.
PDF/A in a Nutshell 39
PDF/A validation
No valid PDF/A file: In this example, the The PDF/A validation fails if the document red X. The Preflight results window con-
Preflight validation process has found a being checked does not meet all of the tains a list of the problems encountered.
problem: The insertion of a watermark after specifications stipulated by the standard. If Users can click the entries for more infor-
the creation of the file added a PDF layer to
this is the case, the system informs the user mation on the various error messages. Pre-
the file. PDF layers are not permitted in ac-
cordance with the PDF/A standard. that problems have occurred by means of a flight can also highlight the places where
these problems were found (if the elements
allow it to do so). The detailed information
can also be viewed by double-clicking an
entry in the list.
Because these error messages are not al-
ways self-explanatory, this publication
contains an appendix that lists detailed
background information on all possible
errors in alphabetical order. Preflight also
gives the user tips on how to repair errors
that have occurred or how to avoid them
next time around (see information start-
ing on page 68).
Following a failed validation attempt,
the PDF/A status is also indicated by a red
X in the main Preflight window. n
40 PDF/A in a Nutshell
PDF/A validation
PDF/A in a Nutshell 41
5. Archive PDFs in everyday life:
What issues might arise?
PDF/A requirements can change accord-
ing to the environment in which the
PDF/A files are used and the task to be
done. One user might produce PDF/A files
They may not be allowed to ‘go missing’
over the course of time, as can happen with
other file formats that specify a link to an
external storage location rather than inte-
that only contain text and no illustrations, grating images into files. Most of us will, at
another might require signatures, and a some point, have called up a Web page only
third might need to create PDF documents to find that the illustrations are missing
that can be archived and also conform and question marks or red crosses in frames
with accessibility requirements. The in- are displayed instead. This cannot happen
formation below provides details on sev- with PDF/A.
eral usage possibilities and areas where An image on a PDF/A page is also clear-
PDF/A can be used. ly reproducible because it exists once and
only once. On rare occasions – and only in
the prepress area – alternate images are
Images used. These images contain a lower-reso-
All images contained in PDF/A files must lution variant for the screen and a high-
be clearly reproducible. This can only be resolution variant for printing. PDF/A
ensured by integrating them into the files. does not permit alternate images, partly
42 PDF/A in a Nutshell
PDF/A applications in everyday life
PDF/A in a Nutshell 43
PDF/A applications in everyday life
44 PDF/A in a Nutshell
PDF/A applications in everyday life
PDF/A in a Nutshell 45
PDF/A applications in everyday life
When flattening transparency, the user of colors for text, image, and graphical el-
can choose between different quality levels ements.
(from low resolution to high resolution),
since this process generates new images out
of overlapping graphic objects.
However, users must be careful when re-
moving highlighted text. Instead of using
transparency flattening, which would make
the yellow highlighting opaque and hide
the text, the Acrobat PDF Optimizer func- Which color should it be? Without color management, the correct
Hidden text: Transparency flattening tion ‘Discard all comments, forms and depiction of colors in company logos is a question of luck.
should not be used for highlighted text. It multimedia’ should be used. This function
is better to use the ‘Discard all comments, can be called from the ‘Discard User Data’ Color management
forms and multimedia’ function, since the
area. PDF/A uses color management to safely de-
Highlight Text Tool is a comments tool.
pict colors. Color management is based on
the use of color profiles that are appended
Colors to image files, graphical documents, and
The colors of illustrations and graphics in a PDF files to act as a kind of instruction
document should always appear exactly the manual.
same – whether displayed on one’s own The RGB color space is widespread in Of-
monitor, on a colleague’s monitor, or viewed fice environments. sRGB (‘Standard RGB’)
as a printout. Nothing is more annoying is now being used to enable colors to be dis-
than a company logo that, when used in a played or printed as reliably as possible on
presentation or brochure, fails to depict the different devices and printers. The sRGB
corporate identity because, for example, it profile is suitable for images, graphical ele-
appears orange rather than magenta. ments, and text in Office documents. It was
Thanks to PDF/A, such problems are a developed by Hewlett-Packard and Micro-
thing of the past, since the PDF/A stan- soft in 1996 to make printed pages as simi-
dard guarantees the reliable reproduction lar to those displayed on the screen as pos-
46 PDF/A in a Nutshell
PDF/A applications in everyday life
sible. Common modern monitors and Acrobat, Preflight, and pdfaPilot, is ideal. The incorrect reproduction of colors can
printers support sRGB color adjustment. On the other hand, PDF/A files that are in- sometimes affect the message of an im-
Adobe RGB is another widespread RGB tended for printing can be given an ISO age: Was the evening spent at the lake de-
picted in these two photographs a warm
profile. It was published by Adobe Systems Coated profile.
evening or a cool one?
in 1998. This profile is most useful to peo-
ple who work with digital photographs,
since cyan and green tones appear to be
more natural with Adobe RGB than with
sRGB. For documents always intended for
four-color printing (production or digital
printing), the ISO Coated color profile con-
stitutes a good choice.
PDF/A in a Nutshell 47
PDF/A applications in everyday life
photocase.com/de
48 PDF/A in a Nutshell
PDF/A applications in everyday life
U+0061: All of these letter ‘a’s have the same Unicode numbers, re-
gardless of the font.
acter and symbol that exists worldwide
Tracking information: The information on tracking has been lost in (even for historic script). The Unicode Con-
the case of the overlapping letters. sortium and ISO work together on this
Overlapping letters such as those that project. Unicode encodes only abstract
can occur when copying text are also elim- characters, not glyphs (the various graphi-
inated by compliance with the PDF/A stan- cal depictions of letters).
dard. The gobbledegook shown here is The use of Unicode encodings for PDF/A-
caused by missing tracking information. 1a brings the advantage of all character-
This problem cannot occur if PDF/A is based text being completely unique. This
used. enables text to be searched precisely and re-
liably for content as well as allowing con-
Unique characters with PDF/A-1a – thanks to tent to be reused. This is not completely
Unicode guaranteed in the case of PDF/A-1b docu-
In addition to the points mentioned above, ments, although it should usually be the
a further font requirement applies to case. n
PDF/A in a Nutshell 49
PDF/A applications in everyday life
50 PDF/A in a Nutshell
PDF/A applications in everyday life
scription’ tab contains fields that specify Document Properties: There are four basic
the title (which does not have to be the metadata fields on the initial screen of this
same as the file name), author, subject, area: Title (this field is usually prefilled on
the basis of the source document), Author,
and keywords (freely definable). The ‘Title’ Subject, and Keywords. Note the ‘Addition-
field is normally filled with the file name al Metadata...’ button. It calls the dialog
of the original file. The other fields can box shown below.
contain metadata from the original file if
the user gave them XMP-compliant data
and as long as the PDF is not being created
using the Distiller. Programs in the Adobe
Creative Suite pass on XMP metadata to
PDF documents that are created using the
Export function. The extent to which
metadata can be transferred from Word or
Excel files to the corresponding PDFs de-
pends on factors including the program
version being used. additional descriptions. Other programs
Clicking the ‘Additional Metadata...’ but- than Acrobat (such as Adobe Bridge) and
ton displays a whole range of further cate- products and solutions offered by other
gories including options for copyright in- suppliers are recommended for the mass
formation, personal processing notes, and allocation of metadata in PDFs. n
PDF/A in a Nutshell 51
PDF/A applications in everyday life
52 PDF/A in a Nutshell
PDF/A applications in everyday life
addition, tags can be used to distinguish nate text’ that explains the subject, the
between content and additional elements user is told not only that there is a graphic
such as headers and footers or other back- at the relevant point in the text but also
ground elements that do not directly be- that the graphic displays a guitar, for ex-
long to the content. Tags are also helpful ample.
for graphics and images on PDF pages. It is relatively easy to generate a PDF/A
How do screen readers deal with images? document from an accessible PDF and vice
If the creator has given the image ‘alter- versa. Note that the conversion to PDF/A
takes place at the very end of this process.
Once a valid PDF/A file has been created, it
cannot be changed – otherwise, it loses its
compliance status.
PDF/A in a Nutshell 53
PDF/A applications in everyday life
Automatically meaningful
structures?
Neither accessible PDF nor PDF/A-1a can enable a check
of the tagged PDF to make sure that the structures of a
document are meaningful or correct. Both types of
check can only determine whether structural informa-
tion exists in the specifications of the PDF file – not
whether any structural information found makes
sense.
For this reason, the standard stipulates that structural
information may not be added automatically later on. It
must be imported during the creation of the PDF or
added manually afterwards.
The automatic creation of structures might be possible
without causing problems for very simple PDF files.
However, if a user uses an automated process to recon-
struct a structure, he or she must make sure that the
process is validated.
54 PDF/A in a Nutshell
PDF/A applications in everyday life
PDF/A in a Nutshell 55
PDF/A applications in everyday life
PixelQuelle.de
56 PDF/A in a Nutshell
PDF/A applications in everyday life
Since note icons and input masks work Hyperlinks are comments
with RGB, the PDF/A file in question must It might be surprising, but from a technical
have an RGB output intent such as ‘sRGB’. point of view hyperlinks are also com-
There are also comment types that are ments. They may not be retained in their
not permitted. It is easy to understand why original form if PDF/A-compliance is to be
text edit comments are prohibited. If such achieved – instead, they must be flattened.
annotations exist, it is to be assumed that a If a user attempts to convert a PDF file that
text correction that should have been made contains links into a PDF/A file, the system
has actually been overlooked. Care should issues two error messages per hyperlink:
also be taken with comments that use ‘Annotation has no Flags entry’ and ‘Anno-
transparency to mark a document. This in- tation not set to print’.
cludes the Highlight Text Tool and the The Preflight correction profiles ‘Remove
stamps delivered with Acrobat, e.g. ‘Ap- all annotations’ and ‘Flatten comments’
proved’. can be useful here. In this case, the result of
both procedures is the same. Once the links Hyperlinks are comments: The illustration
have been discarded, Preflight can usually below shows that this PDF/A conversion
convert the PDF file into a PDF/A file with- cannot be carried out in Preflight because
of links in the document.
out any difficulty. ➔
Removing or flattening annotations: Preflight corrections can re-
move or flatten comments. In the latter case, the annotations are
still visible but they lose their typical comment features.
PDF/A in a Nutshell 57
PDF/A applications in everyday life
58 PDF/A in a Nutshell
PDF/A applications in everyday life
an alternative solution must be found for sues the following error message: ‘Font not
non-embedded fonts in form fields. embedded’.
There is now a tool that can carry out this
Embedding fonts for PDF/A forms task – the Acrobat pdfaPilot plug-in from cal-
Many current tools do not enable the em- las software. Among many other correction
bedding of fonts in PDF form fields. How- functions, it allows form PDF files to be con-
ever, these fonts must be contained within verted into PDF/A-compliant documents. For
the PDF file in order to achieve PDF/A- the process to work, all of the fonts required
compliance. for the PDF document being converted must
The Acrobat Preflight tool cannot embed be available and accessible on the computer.
fonts in form fields. Following a failed at- In addition to the function for embedding
tempt to convert a document containing fonts, pdfaPilot also solves many of the com-
them into a PDF/A document, the tool is- mon color problems that occur in forms. n
PDF/A in a Nutshell 59
PDF/A applications in everyday life
Cahloc – PixelQuelle.de
60 PDF/A in a Nutshell
PDF/A applications in everyday life
used for the PDF conversion process. How- Because parties taking part in such
ever, the PDF/A conversion may take place transactions are not in the presence of each The terms ‘digital signature’
in Adobe or in another PDF/A converter. other or witnesses, it is more important and ‘electronic signature’ are
Older plans are often line scans in formats than ever to ensure that digital documents often used interchangeably.
However, the term ‘digital sig-
such as TIFF G4. Such plans can be con- can be reliably checked for authenticity.
nature’ refers to a crypto-
verted to PDF and then to PDF/A using Ac- Electronic signatures enable a completely
graphic, technical process,
robat Professional (or other PDF conver- digital flow of communication and trans-
whereas „electronic signature“
sion solutions). It is often possible to use the actions of a contractual nature. is a legal term.
text recognition function to give drawings Proving authenticity by means of a mark
searchable text during this process. or signature dates back nearly as far as the
first written evidence of mankind. Even the
No 3D models in PDF/A Mesopotamians signed their records with a
Designs created in 2D can be archived as seal or stamp. The practice of signing docu-
PDF/A without any problems. This is not ments with a stamp instead of by hand –
the case for 3D models. Three-dimensional which is still used in China and Japan to-
designs have only been supported since Ac- day – has a history that dates back over
robat 7 (PDF 1.6). They are therefore not millennia. Magnificent wax seals are
permitted in PDF/A-compliant files. known to have been used during the Mid-
dle Ages. Placing one’s own signature at the
bottom of a contract is a relatively new pro-
Electronic signatures cedure, just as general literacy is a relatively
Our everyday life is now digital. Within the new achievement for our culture.
space of a few years, e-commerce has be- But now we are faced with another prob-
come much more widespread and business lem – how can we make digital files into
agreements are now often made online us- legal documents?
ing e-mail. Digital communication be- The simplest way of electronically sign-
tween public authorities and citizens is no ing a file is to place a scanned signature on
longer a thing of the future – just think of a page of the document in the form of an
electronic tax return systems such as image file. This procedure can be legally
EFTPS. recognized, as it is in the United States. ➔
photocase.com/de
PDF/A in a Nutshell 61
PDF/A applications in everyday life
62 PDF/A in a Nutshell
PDF/A applications in everyday life
PDF/A in a Nutshell 63
PDF/A applications in everyday life
64 PDF/A in a Nutshell
The outlook:
PDF/A in the future
PDFs are extremely practical and it is diffi-
cult to argue against their usefulness for
many application areas. PDF as a format
has ‘grown up’ over a period of 14 years,
to incorporate technical fine-tuning of the
PDF format in a manner that allows ar-
chiving.
The second part of the PDF/A standard
6.
and today the format itself and the software – PDF/A-2 – is planned for 2009. It is im-
required to use it take various mature portant to note that the second part will
forms. In addition, the adoption of the not invalidate PDF/A-1; PDF/A-1-compli-
PDF/A standard has made PDF a highly re- ant documents will still be valid and reli-
liable format, both for today and for the fu- able archive files. It will not be necessary to
ture. Does this make the issue of technical migrate existing PDF/A-1 archives to
formats and procedures for the long-term, PDF/A-2 once the new PDF/A standard is
secure archiving of digital documents a published. Doing so would benefit nobody.
closed topic? What has PDF/A achieved However, in some cases it might make sense
and what remains to be done? to archive new archive documents as
PDF/A-2 files. For example, PDF/A-2 will
Enhancements in PDF/A-2 support the JPEG2000 image compression
The PDF/A standard constitutes an ex- format. If files contain image data in
tremely solid base, at least regarding the JPEG2000 format, it is clearly sensible to
unambiguous and reliable visual repro- archive that data in JPEG2000, thereby
duction of content. Without a doubt, the avoiding the need to carry out a recompres-
PDF/A standard will be developed further sion into JPEG (which can cause albeit low
Wichert – PixelQuelle.de
PDF/A in a Nutshell 65
The outlook: PDF/A in the future
levels of data loss) or ZIP (which increases PDF/A standard intentionally permits digi-
the amount of memory required to save tal signatures, but just as deliberately re-
files). frains from stipulating actual implementa-
tion methods. One important reason why
Looking towards PDF/A-3 there is not yet an ISO standard on digitally
A third part to the standard is already being signing PDF/A documents is that require-
discussed – PDF/A-3. This part should deal ments and legislation for digital signatures
with ‘dynamic’ PDF documents. PDF/A-1 differ from country to country. Despite the
deals exclusively with PDF documents prevalence of economic processes that are
whose content and depiction does not generally globalized, this area is subject to
change and may not be modified (as is the an extremely high degree of disparity and
case with paper documents). In the case of incertitude. In addition, digital signature
PDF files that contain audio or video data, technology is still a long way from being
self-playing animated presentations, ‘walk- mature, and is not as wide-spread or easy to
able’ 3D models, or complex form logic with use as PDF technology, which is accessible to
database connections, it is only possible to all and sundry.
preserve a snapshot of a certain display point
or specific content form when printing them Full-text searching
or archiving them as PDF/A-1. This is hardly Another important aspect is full-text search-
an ideal solution. It is certain to take several ing. This function usually works so well with
years to complete and adopt the PDF/A-3 traditional PDFs that we take its permanent
standard, since depicting dynamic content availability for granted. However, there are
is far more difficult than capturing static, always a couple of actual hits that are, in
two-dimensional visual content. fact, missed – even if we do not notice it,
precisely because the hits in question are not
PDF/A-1 developments found by the text search. The failure of a full-
In any case, there are bound to be new de- text search to find certain hits can result
velopments for PDF/A-1 itself – not for the from something as basic as a typing error
PDF/A-1 standard, but for related issues. (for example, if the name ‘Smith’ is typed as
‘Smiht’). However, perfectly legitimate dif-
Digital signatures ferences in spelling or punctuation can also
One closely monitored issue is the interac- cause a hit to be missed: For example, the
tion of PDF/A-1 and digital signatures. The number one thousand point zero is written
66 PDF/A in a Nutshell
The outlook: PDF/A in the future
as 1,000.0 in the US, 1.000,0 in Germany, as PDF/A is widely used, its popularity will
and 1’000,0 in Switzerland. There is also a create a market where providers of solutions
great variety of ways to write down tele- and services can make a profit. There is no
phone numbers – various countries use need to worry about the future of the format
spaces, brackets, or hyphens to improve unless this market shrinks to a critical size.
readability or conform with different format Unwanted stumbling blocks in the form of
rules. The PDF/A standard requires all char- patents and other industrial property rights
acters to have unique Unicode names for the are increasingly less likely than for many
PDF/A-1a compliance level. However, it is other formats, even if they cannot be com-
not possible for the PDF/A standard to en- pletely ruled out in the society in which we
sure that a unique Unicode character-to- live. Consider, for example, Unisys and its
code assignment is correct. Only a human LZW patent that has only recently expired,
operator can decide whether or not an X is Forgent and its JPEG patent claims, or Mi-
being passed off as a U. crosoft, which had to pay over one and a half
billion US dollars to Alcatel-Lucent in a dis-
Structured content pute over the widely used MP3 format. In
There is also room for improvement in rela- any case, one thing’s for sure: In fifty or one
tion to the structure of content in PDF files: hundred years time, all of these patents will
Countless documents that require archiving have expired.
do not only contain structured content (a However, one question remains: Have we
reading order and important specifications already reached the critical PDF/A target re-
such as title, image caption, or sequential quired to ensure an enduring market, and, if
body text) but also include specific, uniquely not, when will the target be reached? As far as
identifiable data specifications. For example, the rollout and practical implementation of
telephone bills always contain fields such as PDF/A are concerned, we are still only begin-
‘Customer Number’ and ‘Invoice Number’, ning. However, in terms of the advantages of-
and they always state the amount owed. fered by PDF/A, there cannot be any doubt
Tagged PDF (which is also specified in PDF/A- that, by 2010, PDF/A will be so widely used
1a for the structure of content) already helps a that this critical target is certain to be met.
great deal in this area. However, it would be This is why: No format other than PDF and/
even more useful if this kind of data specifi- or PDF/A is so ideally suited, practical, and
cation could be determined and read directly widespread when it comes to archiving the
and uniquely, as data records are read from a rapidly increasing number of digital docu-
database. Format-related ambiguities also ments. Moreover, no other format has been
need to be eliminated. In fact, the current adopted as an ISO standard, which makes
state of technology enables these tasks to be manufacturers far more likely to use it.
accomplished, but a standard that makes As mentioned previously, the PDF/A
documents and software interoperable is still landscape will continue to develop in vari-
required. ous ways in order to achieve technical
progress and meet application-specific re-
PDF/A in one hundred years time quirements. However, it is important to
The aspects mentioned above will probably note that this will not result in a need to
be implemented during the next five or ten revise the basic principles of the standard:
years. But what will the world of PDF/A be The ISO PDF/A-1 standard provides a very
like in fifty or even one hundred years time? solid basis for the field – even in the light of
For example, what is the probability of a per- planned additional parts to the PDF/A
son interested in the beginnings of PDF/A norm. It describes a strong foundation that
being able to find and read a printout (or mi- is not subject to noteworthy change in the
crofilm or TIFF version) or a PDF/A-1 ver- long term. This fact is sure to facilitate the
sion of this publication in the year 2107? strategic and economic justification of in-
This can partly be answered by a harsh truth: vestment when implementing PDF/A-bases
Money makes the world go round. As long archiving processes. n
PDF/A in a Nutshell 67
What the error messages mean
Preflight results and troubleshooting for PDF/A
During PDF/A conversion or validation in Acrobat 8 Profes- contain 3D comments. Remedy: The Acrobat 8 Preflight mod-
sional, the Preflight tool informs the user of problems that ule can be used to discard 3D comments.
prevent compliance with the standard. Although the user re-
ceives a short explanation of each error in the info window, ■■ Additional actions (AA) used: PDF files can dynamically
not all descriptions are easy to understand. For this reason, alter their content during visualization. Actions can be con-
this alphabetical list of all PDF/A error messages that might tained in the PDF file for this purpose. The PDF/A standard
occur is intended to help provide an overview of why the tool stipulates that the visualization of a document must be guar-
has declared a document to be non-PDF/A-compliant. In ad- anteed and always the same. For this reason, active content is
dition, each error message is followed by a description of not permitted in PDF/A files. The only exception is elements
measures that users can carry out in advance or later on in for page navigation. Remedy: The Acrobat PDF Optimizer can
order to enable the production of a valid PDF/A file. be used to remove these actions.
68 PDF/A in a Nutshell
What the Preflight error messages mean
PDF/A does not differentiate between page objects and com- ment must be identical whether displayed on a monitor or
ments for colors, so the requirements for text, graphics, and output using a printer, comments in a PDF/A document must
images also apply here. Remedy: Such files can normally be not be defined as not to be displayed. Remedy: These com-
converted to PDF/A using the ‘sRGB’ output intent. ments can be discarded using the Acrobat PDF Optimizer
(‘Discard all comments, forms and multimedia’).
■■ Annotation has no Flags entry: A comment element in a
PDF file must contain certain additional information that de- ■■ Annotation’s AP (appearance) contains only N entry is
termines its appearance when displayed on a monitor or not true: Comments in PDF files can contain differing visu-
printed out. This information is missing for the comments in alization methods that are used, for example, depending on
this PDF file. This means that it is unclear whether/how the whether the mouse cursor is moved over a comment symbol
comment will be rendered when the document is displayed or or the comment symbol is clicked. These effects are, of course,
printed out. The PDF/A standard stipulates that all informa- not possible when a document is output on a printer. Because
tion required for visualizing a comment must be contained the PDF/A standard stipulates that the visualization of a doc-
within the PDF file in which the comment appears. Remedy: ument must be guaranteed and that it must appear identical
These comments are normally invisible components of hy- when output on a printer and when displayed on a monitor,
perlinks. They can be removed using the Acrobat PDF Opti- comments in a PDF/A document may not have different visu-
mizer (‘Discard all comments, forms and multimedia’). alization variants for mouse effects. Remedy: The Acrobat
Professional PDF Optimizer has an option called ‘Discard all
■■ Annotation Hidden flag set: Comments can set to ‘Hid- comments, forms and multimedia’ in the ‘Discard User Data’
den’ to prevent them from being displayed on the monitor. area. This option corrects this error.
The ‘Hidden’ flag is used to do this. Because the PDF/A stan-
dard stipulates that the visualization of a document must be ■■ Author mismatch between Document Info and XMP
ensured and because it is impossible to guarantee that the metadata: In this PDF document, the data on the author in
‘Hidden’ flag will be correctly evaluated, PDF/A documents the XMP area does not match the data in the general docu-
may not use this flag for comments. Remedy: These comments ment properties. The PDF/A standard stipulates that docu-
can be discarded using the Acrobat PDF Optimizer (‘Discard ment information must exist in the XMP area. If this data is
all comments, forms and multimedia’). also contained in the document properties, it must be identi-
cal to the entries in the XMP area. Remedy: New PDF/A con-
■■ Annotation Invisible flag set: Comments in PDF files version.
can be set to ‘invisible’ to prevent them from being displayed
on a monitor. The ‘Invisible’ flag is used to do this. Because ■■ Belongs to transparency group: A group of page objects
the PDF/A standard stipulates that the visualization of a doc- is defined as ‘transparent’. The PDF/A standard stipulates
ument must be ensured and because it is impossible to guar- that all features used in a PDF file must be displayed in a sin-
antee that the ‘Invisible’ flag will be correctly evaluated, gle unique way on a monitor or in a printout. Because this
PDF/A documents may not use this flag for comments. Rem- cannot be ensured in the case of transparent objects and their
edy: These comments can be discarded using the Acrobat backgrounds, transparency is not permitted in PDF/A files.
PDF Optimizer (‘Discard all comments, forms and multime- Remedy: Adobe Acrobat Professional (Version 6, 7, or 8) in-
dia’). cludes a flattener module that can be used to remove trans-
parencies.
■■ Annotation not set to print: Comments in PDF files can
be defined as non-printing to prevent them being printed out. ■■ Bits per color component > 8: Images with a color depth
Because the PDF/A standard stipulates that the visualization other than 8 bits are used in this PDF. Color depths that are
of a document must be ensured and that a document must be not 8 bits are not reliably supported by all visualization de-
identical whether displayed on a monitor or output using a vices (monitors and printers). In addition to this, such fine
printer, comments in a PDF/A document must not be defined nuances cannot be visualized technically on most devices in
as not to be printed. Remedy: Current PDF-to-PDF/A con- a way that ensures that differing color depths do not lead to
verters correct this error during the conversion process. differences in color or brightness when visualized. For this
reason, only 8 bit images are permitted in PDF/A files. Rem-
■■ Annotation NoView flag set: Comments in PDF files can edy: The PDF must be regenerated using images that have an
be set to ‘NoView’ to prevent them from being displayed on a 8 bit color depth. Acrobat 8’s Preflight module also has a cor-
monitor. Because the PDF/A standard stipulates that the vi- rection option that reduces the color depth of images from 16
sualization of a document must be ensured and that a docu- bits to 8 bits.
PDF/A in a Nutshell 69
What the Preflight error messages mean
■■ CharSet missing or incomplete for Type 1 font: A font is letters and other characters used in PDF texts require ‘fonts’
not fully embedded and contains no list of embedded sym- that determine their exact appearance when visualized. The
bols (CharSet). If a font in Type 1 format is not fully embed- characters stored in a font are allocated number codes in ac-
ded, it must contain a list of the embedded characters to en- cordance with an allocation table. These number codes are
able conversion to PDF/A The list must include all characters used to display the characters in the PDF that uses them.
used in this font in the PDF file. In this case, a font is not These allocation tables are made up differently depending
fully embedded in this PDF file and its list of embedded sym- upon the font format (PostScript Type 1, Type 3, or TrueType)
bols is missing or is incomplete. Remedy: In order to resolve and are known as ‘encodings’. MacRoman (Macintosh) and
this problem, the PDF file must be created again using a dif- WinAnsi (Windows) are standard encodings. ‘CID fonts’ can
ferent font or with the same font but in its complete form. use encodings that deviate from these standards. The PDF/A
Alternatively, the incomplete font may be used, but only with standard stipulates that a font that uses its own encoding
the relevant CharSet. must get the encoding in question from a corresponding table
(CMap). This PDF does not use standard encoding and does
■■ CIDset in subset font is incomplete: A font is not fully not contain an encoding table (CMap). This PDF can there-
embedded and contains no list of embedded symbols (Char- fore not be converted to PDF/A. Remedy: In order to resolve
Set). If a font in CID 1 format is not fully embedded, it must this problem, the PDF file must be created again using a dif-
contain a list of the embedded characters to enable conver- ferent font or with the same font but in its complete form.
sion to PDF/A. The list must include all characters used in Alternatively, the incomplete font may be used, but only with
this font in the PDF file. In this case, a font is not fully embed- the relevant CharSet.
ded in the PDF file and its list of embedded characters is in-
complete. Remedy: In order to resolve this problem, the PDF ■■ CMYK used but PDF/A OutputIntent not CMYK: Device-
file must be created again using a different font or with the dependent color (DeviceCMYK), but no CMYK output intent.
same font but in its complete form. Alternatively, the incom- Because the PDF/A standard stipulates that colors must appear
plete font may be used, but only with the relevant CharSet. the same (as far as is technically possible) regardless of the out-
put device, either a PDF/A document must only contain de-
■■ CIDset in subset font missing: A font is not fully embed- vice-neutral colors or the color properties of the output device
ded and contains no list of embedded symbols (CharSet). If a must be defined using an output intent profile. If a document
font in CID 1 format is not fully embedded, it must contain a contains DeviceRGB or DeviceCMYK colors, an output intent
list of the embedded characters to enable conversion to of the same type must therefore exist. Remedy: Preflight con-
PDF/A. In this case, a font is not fully embedded in the PDF tains a correction option that converts the alternate visualiza-
file but the list of embedded characters is missing. Remedy: In tion to CMYK (SWOP). This correction must be duplicated
order to resolve this problem, the PDF file must be created and an RGB color space such as sRGB must be used as the tar-
again using a different font or with the same font but in its get. The correction can then be assigned to a profile. The alter-
complete form. Alternatively, the incomplete font may be nate visualization of the spot color can then be modified. pd-
used, but only with the relevant CharSet. faPilot also solves this problem.
■■ CIDSystemInfo and CMap dict not compatible: A font is ■■ CMYK used for alt. color but PDF/A OutputIntent not
not fully embedded and contains no list of embedded sym- CMYK: A spot color has been defined in DeviceCMYK but
bols (CharSet). The characters stored in a font are allocated the output intent is not defined for CMYK. Because the
number codes in accordance with an allocation table. These PDF/A standard stipulates that colors must appear the same
number codes are used to display the characters in the PDF (as far as is technically possible) regardless of the output de-
that uses them. No allocation table has been specified for a vice, either a PDF/A document must only contain device-
font in this PDF. Remedy: In order to resolve this problem, neutral colors or the color properties of the output device
the PDF file must be created again using a different font or must be defined using an output intent profile. If a document
with the same font but in its complete form. Alternatively, contains DeviceRGB or DeviceCMYK colors, an output in-
the incomplete font may be used, but only with the relevant tent of the same type must therefore exist. Remedy: Preflight
CharSet. contains a correction option that converts the alternate visu-
alization to CMYK (SWOP). This correction must be dupli-
■■ CMap not embedded for custom CMap: A font is not cated and an RGB color space such as sRGB must be used as
fully embedded and contains no list of embedded symbols the target. The correction can then be assigned to a profile.
(CharSet). This font has no clear information regarding the The alternate visualization of the spot color can then be mod-
assignment of characters to letters (the CMap is missing). The ified. pdfaPilot also solves this problem.
70 PDF/A in a Nutshell
What the Preflight error messages mean
■■ Compressed object streams used: Since PDF 1.5, which bat PDF Optimizer contains an option called ‘Discard all
Adobe introduced with Acrobat 6, some objects in PDF files form submission, import and reset actions’ that corrects this
can be compressed as object streams. This technique is used problem.
in this PDF. The PDF/A standard only permits objects that
are compatible with PDF 1.4. Cross-object compression is ■■ Contains action of type Sound: Contains audio data
therefore not permitted in PDF/A files. Remedy: Use the Ac- (sound). PDF files can dynamically alter their content during
robat PDF Optimizer to save the file as a PDF 1.4 file. visualization. Actions can be contained in the PDF file for
this purpose. This PDF file contains an action for playing
■■ Contains action of type ImportData: Active content that sound. The PDF/A standard stipulates that the visualization
imports data from an external file. PDF files can dynamically of a document must be guaranteed and always the same. For
alter their content during visualization. Actions can be con- this reason, active content is not permitted in PDF/A files.
tained in the PDF file for this purpose. The PDF/A standard The only exception is elements for page navigation. Remedy:
stipulates that the visualization of a document must be guar- The Acrobat PDF Optimizer contains an option called ‘Dis-
anteed and always the same. For this reason, active content is card all comments, forms and multimedia’ that can be used
not permitted in PDF/A files. The only exception is elements to remove audio data.
for page navigation. Remedy: The PDF must be redesigned
and generated again so that all content is present within the ■■ Creation Date mismatch between Document Info and
file itself. XMP metadata: The ‘CreateDate’ entry in the XMP docu-
ment information deviates from the ‘Created’ entry in the
■■ Contains action of type Launch: Active content that trig- document properties. The PDF/A standard stipulates that
gers another application. PDF files can dynamically alter document information must exist in the XMP area. If this
their content during visualization. Actions can be contained data is also contained in the document properties, it must be
in the PDF file for this purpose. This PDF file contains an ac- identical to the entries in the XMP area. Remedy: A PDF file
tion for launching another application. The PDF/A standard can contain descriptive document information including the
stipulate that the visualization of a document must be guar- author, creation date, title, and other details. This informa-
anteed and always the same. For this reason, active content is tion can be opened using the ‘File’ menu and changed in the
not permitted in PDF/A files. The only exception is elements general Document Properties dialog. Otherwise, the PDF/A
for page navigation. Remedy: The PDF must be redesigned file can be recreated from scratch.
and generated again so that all content is present within the
file itself. ■■ Creator mismatch between Document Info and XMP
metadata: The ‘PDF creator’ entry in the XMP document in-
■■ Contains action of type Movie: Active content that shows formation deviates from the corresponding entry in the doc-
a movie in another window. PDF files can dynamically alter ument properties. The PDF/A standard stipulates that docu-
their content during visualization. Actions can be contained ment information must exist in the XMP area. If this data is
in the PDF file for this purpose. This PDF file contains an ac- also contained in the document properties, it must be identi-
tion for playing a movie. The PDF/A standard stipulates that cal to the entries in the XMP area. Remedy: New PDF/A con-
the visualization of a document must be guaranteed and al- version.
ways the same. For this reason, active content is not permit-
ted in PDF/A files. The only exception is elements for page ■■ Custom annotation used: A comment in the PDF docu-
navigation. Remedy: The Acrobat PDF Optimizer contains ment does not use a standard PDF comment type. The PDF
an option called ‘Discard all comments, forms and multime- specification allows PDFs to contain comments in custom
dia’ that can be used to remove movies. formats. This function can be used in specialized applications
to position special, additional elements in PDF files. These
■■ Contains action of type ResetForm: Active content that comments much have the comment type ‘Custom’. Use is sel-
influences the content of form fields. PDF files can dynami- dom made of these options. Because custom comments can
cally alter their content during visualization. Actions can be only be visualized on specialized output devices, they may
contained in the PDF file for this purpose. This PDF file con- not be used in PDF/A files. Remedy: These comments must be
tains an action for emptying form fields (ResetForm). The discarded.
PDF/A standard stipulates that the visualization of a docu-
ment must be guaranteed and always the same. For this rea- ■■ Destination profiles in OutputIntents differ: There are
son, active content is not permitted in PDF/A files. The only multiple output intents with different profiles. Because the
exception is elements for page navigation. Remedy: The Acro- PDF/A standard stipulates that colors must appear the same
PDF/A in a Nutshell 71
What the Preflight error messages mean
(as far as is technically possible) regardless of the output de- files can dynamically alter their content during visualization.
vice, either a PDF/A document must only contain device- This active content can be contained in PDF files in the form
neutral colors or the color properties of the output device of JavaScript, for example. This is the case in this PDF.
must be defined using an output intent profile. If a document JavaScript is often used in connection with active form ele-
contains DeviceRGB or DeviceCMYK colors, an output in- ments (for example, buttons). The PDF/A standard stipulates
tent of the same type must therefore exist. The output intent that the visualization of a document must be guaranteed and
must describe the color properties of the output device using always the same. For this reason, active content is not permit-
an ICC profile. To ensure unambiguity, a PDF/A file may only ted in PDF/A files. Remedy: The Acrobat PDF Optimizer con-
have different output intents if all of the output intents used tains the ‘Discard all JavaScript actions’ option in the ‘Dis-
have the same ICC profile. Remedy: Preflight contains a cor- card Objects’ area. This option can be used to correct the
rection option that converts the alternate visualization to error.
CMYK (SWOP). This correction must be duplicated and an
RGB color space such as sRGB must be used as the target. The ■■ Document is damaged and needs repair: The PDF file is
correction can then be assigned to a profile. The alternate vi- incorrectly formatted. Every PDF/A file must be a basically
sualization of the spot color can then be modified. pdfaPilot correct PDF file. The file that is currently open does not con-
can also carry out this task. form with the PDF specification and can therefore not be
converted to PDF/A. Remedy: The problem can possibly be
■■ Device process color used but no PDF/A OutputIntent: resolved by opening the file in Adobe Acrobat and saving it
Device-dependent color exists, but no output intent. Because once again using the ‘Save As’ option. Otherwise, it might be
the PDF/A standard stipulates that colors must appear the possible to clean it up using the ‘Save Optimized As’ option.
same (as far as is technically possible) regardless of the output
device, either a PDF/A document must only contain device- ■■ Document is encrypted. The document is encrypted and
neutral colors or the color properties of the output device cannot be analyzed. PDF files can be encrypted in order to
must be defined using an output intent profile. If a document password-protect certain functions. This means that a PDF
contains DeviceRGB or DeviceCMYK colors, an output in- can be displayed on the monitor without restrictions but a
tent of the same type must therefore exist. Remedy: Preflight password is required in order to print or modify it. Encryp-
contains a correction option that converts the alternate visu- tion is not permitted in a PDF/A file, as its visualization would
alization to CMYK (SWOP). This correction must be dupli- then be dependent upon information stored externally (i.e. a
cated and an RGB color space such as sRGB must be used as password). Remedy: The PDF file cannot be converted to
the target. The correction can then be assigned to a profile. PDF/A in this form. If the required password is known, the
The alternate visualization of the spot color can then be mod- PDF file’s password protection can be removed in Adobe Ac-
ified. pdfaPilot can also carry out this task. robat and the file can then be saved.
■■ Device process color used in alt. color space but no ■■ Embedded PostScript operator: The document uses
PDF/A OutputIntent: A spot color has been defined as a de- PostScript code for the page description. PostScript code can
vice color, but no output intent exists. Because the PDF/A also be used in PDF files. This option was primarily used at
standard stipulates that colors must appear the same (as far the beginning of the PDF format era by programs that did not
as is technically possible) regardless of the output device, ei- offer full PDF support. However, there are very few programs
ther a PDF/A document must only contain device-neutral that can visualize this PostScript code on a monitor. Post-
colors or the color properties of the output device must be Script code is used in this PDF document to describe the page
defined using an output intent profile. If a document contains objects. Because the PDF/A standard stipulates that all com-
DeviceRGB or DeviceCMYK colors, an output intent of the ponents of a PDF file must be reliably visualized, the use of
same type must therefore exist. Remedy: Preflight contains a PostScript code is not permitted in PDF/A files. Remedy: This
correction option that converts the alternate visualization to error occurs very rarely. Current PDF-to-PDF/A converters
CMYK (SWOP). This correction must be duplicated and an solve this problem during the conversion process by removing
RGB color space such as sRGB must be used as the target. The the PostScript entries.
correction can then be assigned to a profile. The alternate vi-
sualization of the spot color can then be modified. pdfaPilot ■■ EmbeddedFiles entry in Names dictionary: The docu-
can also carry out this task. ment contains an embedded file. In PDF files, other files can
be embedded as an ‘attachment’ in a similar manner as with
■■ Document contains JavaScripts: Active content in the an e-mail. The corresponding program is required to view
form of JavaScript changes the visualization of pages. PDF these files (for example, Microsoft Word if a Word file is em-
72 PDF/A in a Nutshell
What the Preflight error messages mean
bedded). Because the PDF/A standard stipulates that it must ment must be guaranteed and that it must be identical when
be possible to visualize all components of a PDF file without output on a printer and when displayed on a monitor, a PDF/A
the aid of other software, file attachments are not permitted document must only contain form fields with a visual repre-
in PDF/A files. Remedy: The Acrobat PDF Optimizer con- sentation. Remedy: The Acrobat PDF Optimizer contains an
tains an option called ‘Discard file attachments’ in the ‘Dis- option called ‘Discard all comments, forms and multimedia’
card Objects’ area. This option corrects the error. that can be used to remove form fields.
■■ Encoding entry prohibited for symbolic TrueType font: ■■ Form field’s AP (appearance) contains only N entry is
This symbol font contains an allocation table for ‘normal’ not true: Form fields in PDF files can contain differing visu-
fonts. The PDF/A standard stipulates that a TrueType font alization methods that are used, for example, depending on
that is also a symbol font may not use an entry for this type of whether the mouse cursor is moved over a form field or the
standard encoding, since standard encoding only defines form field is clicked. These effects are, of course, not possible
‘normal’ characters and not the special characters contained when a document is output on a printer. Because the PDF/A
in symbol fonts. This PDF can therefore not be converted to standard stipulates that the visualization of a document must
PDF/A. Remedy: In order to resolve this problem, the PDF file be guaranteed and that it must appear identical when output
must be created anew, using a different font. on a printer and when displayed on a monitor, form fields in
a PDF/A document may not have different visualization vari-
■■ File header not compliant with PDF/A: The PDF file ants for mouse effects. Remedy: The Acrobat Professional
header (PDF version entry or binary digit string) is not com- PDF Optimizer has an option called ‘Discard all comments,
pliant. The PDF/A standard stipulates that a PDF file must forms and multimedia’ in the ‘Discard User Data’ area. This
comply with the general file header regulations in the PDF option corrects this error.
specification (1.6). Remedy: The ‘Save As’ command in Acro-
bat can be used to solve this problem. ■■ Glyphs missing in embedded font: A font does not con-
tain all of the characters required. The letters and other char-
■■ File size is above 2GB: The file is too large (the maximum acters used in PDF texts require ‘fonts’ that determine their
permitted size is 2GB). Extremely large files may lead to ren- exact appearance when visualized. This PDF contains an em-
dering problems when the files in question are printed out or bedded font, in which however not all symbols that are used
displayed on a monitor. The maximum file size is therefore in texts using this font, are described. This means that there
limited to 2 GB in the PDF/A standard. Remedy: No repair is no visual representation for the characters that are missing.
possible. It may be possible to recreate the PDF file with a The PDF/A standard stipulates that all fonts used must be
more effective compression type. embedded and that a visual representation for all used char-
acters must exist. This PDF can therefore not be converted to
■■ Font not embedded (and text rendering mode not 3): PDF/A. Remedy: To resolve this problem, the PDF file must be
Text uses a non-embedded text. The letters and other charac- created anew.
ters used in PDF texts require ‘fonts’ that determine their ex-
act appearance when visualized. A text in this PDF uses a ■■ ICC profile version 4 or newer: This file uses an ICC pro-
font that is not embedded into the PDF. It is therefore only file for color definition that has a newer version than is per-
possible to visualize this PDF correctly if this font is installed mitted by PDF/A. The file can therefore not be converted to
on the computer or printer being used. Because PDF/A stipu- PDF/A. The ICC profile may also be defective. Remedy: Note
lates that a PDF may not require external dependencies in that it is very unusual for ICC profiles that are more recent
order to be visualized, PDF/A files must not contain any fonts than Version 3 to be used. Tools such as the Acrobat Preflight
that are not embedded. The only exception is text that is not module can be used to find out which component uses the
displayed but is merely used for the full-text search instead profile in question. This enables the error to be eliminated.
(text rendering mode = 3). Remedy: The PDF must be regen- The object in question must be recreated and the PDF file
erated with all used fonts embedded. must then be regenerated.
■■ Form field does not have appearance dict: Form field is ■■ ID in file trailer missing or incomplete: No file ID entry
‘invisible’: A PDF file can contain form fields. These form available. Every PDF file should contain an internal ID that
fields must contain additional information to ensure that gives it a certain uniqueness and is altered each time the doc-
they can be visualized. This PDF contains form fields that do ument is changed. The PDF/A standard requires the presence
not contain the required additional information. Because the of this ID. Remedy: The ‘Save As’ command in Acrobat can
PDF/A standard stipulates that the visualization of a docu- be used to solve this problem.
PDF/A in a Nutshell 73
What the Preflight error messages mean
■■ Image has OPI information: OPI (Open Prepress Inter- ment information. This entry can also be displayed with
face) is a procedure used in prepress. It involves the replace- Adobe Acrobat. To display it, the user must choose ‘Proper-
ment of images with alternate images when printing via an ties’ from the ‘File’ menu in Adobe Acrobat and then click the
OPI server. Since PDF files that carry OPI information can ‘Additional Metadata...’ button. The entry appears in the ‘Ad-
give different results depending on the output chosen (dis- vanced’ section under http://www.aiim.org/pdfa/ns/id/. This
play on a monitor, printing on a desktop printer, or printing group must contain a ‘pdfaid:conformance:’ entry that speci-
via an OPI server), OPI comments are not permitted in PDF/A fies ‘A’ or ‘B’ and declares that the file must comply with either
files. Remedy: OPI comments can be removed using the Acro- PDF/A-1a or PDF/A-1b. PDF/A-1a files are always compliant
bat PDF Optimizer. with PDF/A-1b. This PDF contains a conformity entry, but it
is not B or A. Remedy: Preflight can correct this entry.
■■ Inadequate namespace URI for PDF/A entry: The PDF/A
entry in the document information is incorrectly formatted. ■■ Interpolate key for image not false: An image has an
A PDF/A file must have a corresponding entry in its docu- ‘interpolation key’ that is not supported by PDF/A viewers.
ment information. This entry can also be displayed with Ado- The rendering or printout of an image is based upon the reso-
be Acrobat. To display it, the user must choose ‘Properties’ lution of the output device. This means that it depends on the
from the ‘File’ menu in Adobe Acrobat and then click the ‘Ad- vertical or horizontal resolution if a monitor is being used
ditional Metadata...’ button. The entry appears in the ‘Ad- and on the thickness of the lines with which the printing
vanced’ section under http://www.aiim.org/pdfa/ns/id/. The drum can be ‘imaged’ if a laser printer is being used. If the
PDF entry is available in this document but it is incorrectly image resolution is significantly less than the resolution of
formatted. Remedy: New PDF/A conversion. the output device, additional pixels must be added. This pro-
cess is known as interpolation. Interpolation is normally car-
■■ Incorrect PDF/A version number (must be 1): The PDF/A ried out in accordance with a standard procedure. However,
entry in the document information is incorrectly formatted an image in a PDF file can contain a key stating that a par-
(version number is not ‘1’). A PDF/A file must have a corre- ticular interpolation procedure must be used. Nevertheless,
sponding entry in its document information. This entry can this option is rarely used these days and the key is ignored by
also be displayed with Adobe Acrobat. To display it, the user most output devices. Because PDF/A files must appear the
must choose ‘Properties’ from the ‘File’ menu in Adobe Acro- same regardless of the output device used, images in PDF
bat and then click the ‘Additional Metadata...’ button. The files may not have interpolation keys. Remedy: PDF-to-PDF/A
entry appears in the ‘Advanced’ section under http://www. tools such as Preflight or pdfaPilot have correction functions
aiim.org/pdfa/ns/id/. This group must contain a ‘pdfaid:part: that make files containing interpolation keys standard-com-
“1” entry. This entry specifies the version of the PDF/A stan- pliant.
dard. In this PDF file, the entry has a value that is not equal to
‘1’. Remedy: New PDF/A conversion. ■■ Invalid rendering intent: Only the following standard
rendering intents are permitted in PDF/Afiles: Relative Color-
■■ Incorrect PDF/A-1a conformance level (must be “A”): metric, Absolute Colormetric, Perceptual, and Saturation. It
This message only appears when validating PDF/A-1a (and not is very unusual for a different rendering intent to be specified
when validating PDF/A-1b). The PDF/A entry does not have the in a PDF. However, this PDF uses a different rendering intent.
compliance level PDF/A-1a. A PDF/A file must have a corre- Remedy: This is a very unusual error. It can be corrected us-
sponding entry in its document information. This entry can ing pdfaPilot.
also be displayed with Adobe Acrobat. To display it, the user
must choose ‘Properties’ from the ‘File’ menu in Adobe Acro- ■■ Invalid WMode: The stream direction is entered incor-
bat and then click the ‘Additional Metadata...’ button. The en- rectly in this font. The letters and other characters used in
try appears in the ‘Advanced’ section under http://www.aiim. PDF texts require ‘fonts’ that determine their exact appear-
org/pdfa/ns/id/.Thisgroupmustcontaina‚pdfaid:conformance:‘ ance when visualized. In addition to the appearance of the
entry that specifies ‘A’ and declares that the file must comply characters, a font must contain information regarding the
with PDF/A-1a (and not ‘only’ with PDF/A-1b). PDF/A-1a files font ‘stream direction’, since the characters of a font may not
are always compliant with PDF/A-1b. Remedy: Preflight can always be strung together horizontally from left to right as is
correct this entry if converting the file to PDF/A-1a. the case with Latin fonts. For example, some Far Eastern
fonts characters are strung together in a vertical direction
■■ Incorrect PDF/A-1b conformance level (must be “B”): (from top to bottom). This PDF uses a font with incorrect
The PDF/A entry does not have the compliance level PDF/A- stream direction information. It is therefore impossible to
1b. A PDF/A file must have a corresponding entry in its docu- ensure that the PDF will always be visualized in exactly the
74 PDF/A in a Nutshell
What the Preflight error messages mean
same way regardless of the output device. This PDF cannot be and flatten visible layers’ option or the Preflight ‘Merge Lay-
converted to PDF/A. Remedy: This problem can be avoided ers’ option can be used to correct this error.
by using a different font for the text in question.
■■ LZW compression used: Objects used in a PDF are often
■■ JPEG2000 compression used: An image in this docu- compressed to keep the size of the PDF file to a minimum.
ment is compressed in JPEG2000. Images placed in a PDF are Various compression methods are permitted for doing this,
usually compressed to keep the size of the PDF file to a mini- including ZIP, LZW, and JPEG (for images). This PDF file
mum. Various compression methods such as ZIP, LZW, and uses LZW compression. LZW compression is a lossless com-
JPEG can be used to compress image data. Images com- pression method that is patented. It can quite easily be re-
pressed using JPEG2000 can be decompressed in stages dur- placed by ZIP compression, which is also lossless and uses a
ing visualization, meaning that an image can be displayed similar algorithm but is not patent-protected. The PDF/A
even if it is not fully decompressed. However, this process is standard only permits objects that can be visualized without
only supported in more recent PDF versions and must not be restriction, even in the future. This also includes legal restric-
used in PDF/A files. The PDF/A standard does not support tions that might exist because of the LZW patent. For this
objects that were not permitted in the PDF 1.4 specification reason, LZW compression is not permitted in PDF/A files.
that was published by Adobe with Acrobat 5. Remedy: The Remedy: The Acrobat PDF Optimizer contains an option
Acrobat PDF Optimizer can be used to apply JPEG or ZIP that can be used to apply JPEG or ZIP compression (without
compression (without downsampling) to all images. Alterna- downsampling) to all images in the ‘Images’ section.
tively, the file can be saved as a PDF 1.4 file.
■■ Marked entry in MarkInfo missing: This message only
■■ Keyword mismatch between Document Info and XMP appears when validating PDF/A-1a (and not when validating
metadata: The ‘Keywords’ entry in the XMP document in- PDF/A-1b). The document does not contain any information
formation deviates from the corresponding entry in the doc- on its structure (in the document catalog). The stricter PDF/
ument properties. The PDF/A standard stipulates that docu- A-1a standard stipulates that a PDF file must contain struc-
ment information must exist in the XMP area. If this data is tural information. Remedy: To resolve this problem, the PDF
also contained in the document properties, it must be identi- file must be given the relevant structural information. This
cal to the entries in the XMP area. Remedy: New PDF/A con- information can be added when the PDF is generated. Some
version. PDF export modules have a ‘Tagging’ option or an option
with a similar name. This enables structural information to
■■ Last Modification Date mismatch between Document be transferred into a PDF. It is also possible to add structural
Info and XMP Metadata: The ‘ModifyDate’ entry in the information later on in Adobe Acrobat Professional. Alter-
XMP document information deviates from the correspond- natively, it might be possible to convert the file to PDF/A-1b
ing entry in the document properties. The PDF/A standard instead.
stipulates that document information must exist in the XMP
area. If this data is also contained in the document proper- ■■ Marked entry in MarkInfo not boolean: This message
ties, it must be identical to the entries in the XMP area. Rem- only appears when validating PDF/A-1a (and not when vali-
edy: The ‘Save As’ command in Acrobat can be used to solve dating PDF/A-1b). The document contains no correctly for-
this problem. matted information on its structure. The stricter PDF/A-1a
standard stipulates that a PDF file must contain structural
■■ Layers used: The file contains layers that can be used to information. Remedy: To resolve this problem, the PDF file
switch the visibility of objects on and off. Layers can be used must be given the relevant structural information. This in-
in PDF files to define that certain page content should only be formation can be added when the PDF is generated. Some
visualized under certain circumstances. Whether or not an PDF export modules have a ‘Tagging’ option or an option
object placed on a layer is visible depends on whether the with a similar name. This enables structural information to
viewer has set the layer in question to ‘visible’ or ‘invisible’. (It be transferred into a PDF. It is also possible to add structural
is also possible for visibility of a layer to be linked to other information later on in Adobe Acrobat Professional. Alter-
factors such as the zoom level with which a PDF is viewed – natively, it might be possible to convert the file to PDF/A-1b
in this case, very small details in a drawing might only be instead.
visible when using a large zoom value.) Because PDF/A stipu-
lates that the visual appearance of a PDF must always be ex- ■■ Marked entry in MarkInfo not set to true: This message
actly the same, layers cannot be used in PDF/A files. Remedy: only appears when validating PDF/A-1a (and not when validat-
The Acrobat PDF Optimizer ‘Discard hidden layer content ing PDF/A-1b). The entry for structural information is defined
PDF/A in a Nutshell 75
What the Preflight error messages mean
as ‘not available’ in the document. The stricter PDF/A-1a stan- rectly formatted. Remedy: The file must be either converted
dard stipulates that a PDF file must contain structural infor- to PDF/A again or the PDF document must be regenerated.
mation. Remedy: To resolve this problem, the PDF file must be
given the relevant structural information. This information ■■ Metadata entry missing: No document information for
can be added when the PDF is generated. Some PDF export the PDF/Aentry. The PDF/A standard stipulates that document
modules have a ‘Tagging’ option or an option with a similar information must exist in the XMP area. There is no XMP
name. This enables structural information to be transferred document information in this PDF file. Remedy: The ‘Save As’
into a PDF. It is also possible to add structural information command in Acrobat can be used to solve this problem. XMP
later on in Adobe Acrobat Professional. Alternatively, it might metadata is created during the generation of PDF/A.
be possible to convert the file to PDF/A-1b instead.
■■ Metadata not embedded as plain text: The PDF/A stan-
■■ MarkInfo missing: This message only appears when vali- dard stipulates that document information must exist in the
dating PDF/A-1a (and not when validating PDF/A-1b). The XMP area and must not be compressed. However, the XMP
document does not contain any information on its structure metadata in this PDF is compressed. Remedy: The file must
(in the structure info directory). The stricter PDF/A-1a stan- be either converted to PDF/A again or the PDF document
dard stipulates that a PDF file must contain structural infor- must be regenerated.
mation. Remedy: To resolve this problem, the PDF file must be
given the relevant structural information. This information ■■ More than one encoding in symbolic TrueType font’s
can be added when the PDF is generated. Some PDF export cmap: A symbol font has more than one allocation table,
modules have a ‘Tagging’ option or an option with a similar which means that characters cannot be uniquely identified.
name. This enables structural information to be transferred The PDF/Astandard stipulates that TrueType font that is also
into a PDF. It is also possible to add structural information a symbol font may only contain a single encoding entry. Oth-
later on in Adobe Acrobat Professional. Alternatively, it might erwise, unique allocation is impossible. Remedy: To resolve
be possible to convert the file to PDF/A-1b instead. this problem, the PDF file must be created anew.
■■ Max. nesting level of graphic states exceeded: The PDF ■■ Named action with a value other than standard page
file contains very deeply nested page objects that can cause navigation used: PDF files can dynamically alter their con-
problems when it is printed out. Every PDF/A file must be a tent during visualization. Actions can be contained in the
basically correct PDF file. The file currently open violates the PDF file for this purpose. The PDF/A standard stipulates that
restrictions of the PDF specification, which limits the degree the visualization of a document must be guaranteed and al-
of nesting for page objects. It is therefore not compliant with ways the same. For this reason, active content is not permit-
the PDF specification and cannot be converted to PDF/A. ted in PDF/A files. The only exception is elements for page
Remedy: The problem can possibly be resolved by opening the navigation. Remedy: This problem can be solved using the Ac-
file in Adobe Acrobat and saving it once again using the ‘Save robat PDF Optimizer or pdfaPilot.
As’ option. Otherwise, it might be possible to clean it up us-
ing the ‘Save Optimized As’ option. ■■ NeedAppearances flag present but not set to false:
Form fields can be filled with variable content. These initially
■■ Max. number of colorants for DeviceN exceeded: An empty fields can have an entry that defines that they must be
object uses too many color channels in a DeviceN object. De- filled with variable content (either through user input or in-
viceN is a multi-channel color space in which spot colors can put that is determined dynamically such as the system envi-
also be used, for example. DeviceN objects can be used in ronment or time). Because the PDF/A standard stipulates that
PDF/A files but the number of channels is restricted to a max- the visualization of a document must always be identical
imum of 8. Remedy: It is extremely unusual for a DeviceN whether it is displayed on a monitor or output on a printer,
color space to use more than 8 channels. Tools such as the form fields in PDF/A documents must not contain this entry.
Acrobat Preflight module can be used to find out which com- Remedy: As long as the content in question is not adversely
ponent uses the color space in question. The error can then be affected, the ‘Flatten form fields’ PDF Optimizer function
corrected. The object in question must be recreated and the can be used to correct this error.
PDF file must then be regenerated.
■■ Number of PDF/A-1 OutputIntent entries > 1: There are
■■ Metadata does not conform to XMP: The PDF/A stan- multiple output intents for PDF/A. This is not compliant with
dard stipulates that document information must exist in the the PDF/A standard. Output intents are used to define the
XMP area. This document has XMP metadata but it is incor- colors used in a PDF in accordance with a specific output
76 PDF/A in a Nutshell
What the Preflight error messages mean
procedure (for example, printing or display on a monitor). tent profile. If a document contains DeviceRGB or DeviceC-
This uniquely defines color specifications. Consequently, only MYK colors, an output intent of the same type must therefore
a single PDF/A output intent may be specified for a PDF/A exist. Remedy: New PDF/A conversion.
file. Remedy: Acrobat 8 Preflight has a correction option that
removes all output intents from a document. The user has to ■■ PDF/A entry missing: No PDF/A entry in the document
create a new profile and choose the option ‘Remove Output information. A PDF/A file must have a corresponding entry
Intent’ from the list of predefined corrections. Since the cor- in its document information. This entry can also be displayed
rection removes all output intents, the document must then with Adobe Acrobat. To display it, the user must choose
be converted to PDF/A again. ‘Properties’ from the ‘File’ menu in Adobe Acrobat and then
click the ‘Additional Metadata...’ button. The entry appears in
■■ OutputConditionIdentifier missing or empty in PDF/A the ‘Advanced’ section under http://www.aiim.org/pdfa/ns/
OutputIntent: The output intent is incomplete. The output id/. There must be a ‘pdfaid:part: 1’ entry for the PDF/A ver-
condition identifier is missing. Remedy: New PDF/A conver- sion (only version 1 at present) and a ‘pdfaid:conformance:’
sion. entry for the conformity level (PDF/A-1a or PDF/A-1b). The
conformity level must be ‘A’ or ‘B’. PDF/A-1a files are always
■■ Page description contains invalid operator: The PDF compliant with PDF/A-1b. Remedy: The file can be converted
file uses invalid commands for the page description. Every to PDF/A again.
PDF/A file must be a basically correct PDF file. The file that is
currently open uses a command in its page description that is ■■ PDF/A OutputIntent has no destination profile: Because
not defined in the PDF specification. This file is not a valid the PDF/A standard stipulates that colors must appear the
PDF file. It is therefore not possible to convert it to PDF/A. same (as far as is technically possible) regardless of the output
Remedy: The problem can possibly be resolved by opening the device, either a PDF/A document must only contain device-
file in Adobe Acrobat and saving it once again using the ‘Save neutral colors or the color properties of the output device
As’ option. Otherwise, it might be possible to clean it up us- must be defined using an output intent profile. If a document
ing the ‘Save Optimized As’ option. If the problem still exists, contains DeviceRGB or DeviceCMYK colors, an output in-
the PDF file must be regenerated. tent of the same type must therefore exist, along with its des-
tination profile. Remedy: Preflight contains a correction op-
■■ PDF contains data after end of file marker: Every PDF tion that converts the alternate visualization to CMYK
file should have an end of file marker. No further data should (SWOP). This correction must be duplicated and an RGB
follow this marker. In this PDF, there is data after the end of color space such as sRGB must be used as the target. The cor-
file marker. Remedy: The ‘Save As’ command in Acrobat can rection can then be assigned to a profile. The alternate visu-
be used to solve this problem. alization of the spot color can then be modified. pdfaPilot
can also carry out this task.
■■ PDF contains EF (embedded file) entry: The document
contains an entry for an embedded file. In PDF files, other ■■ Producer mismatch between Document Info and XMP
files can be embedded as an attachment in a similar manner metadata: The PDF ‘Producer’ entry in the XMP document
as with an e-mail. The corresponding program is required to information deviates from the corresponding entry in the
view these files (for example, Microsoft Word if a Word file is document properties. The PDF/A standard stipulates that
embedded). Because the PDF/A standard stipulates that it document information must exist in the XMP area. If this
must be possible to visualize all components of a PDF file data is also contained in the document properties, it must be
without the aid of other software, file attachments are not identical to the entries in the XMP area. Remedy: New PDF/A
permitted in PDF/A files. Remedy: The Acrobat PDF Opti- conversion.
mizer contains an option called ‘Discard file attachments’ in
the ‘Discard Objects’ area. This option corrects the error. ■■ Prohibited annotation type: A PDF can contain differ-
ent types of comment. Some of these comment types are in-
■■ PDF/A Destination profile version 4 or newer: The file tended for multimedia content: Sound and movie comment
has an output intent, but the ICC profile used is not compat- types. These types of comment cannot be reproduced by
ible with PDF/A. Because the PDF/A standard stipulates that printers. The FileAttachment comment type allows files in
colors must appear the same (as far as is technically possible) other formats to be embedded into a PDF. Only specialized
regardless of the output device, either a PDF/A document visualization systems are able to render these file attach-
must only contain device-neutral colors or the color proper- ments. These types of comment are not permitted in PDF/A
ties of the output device must be defined using an output in- files as they cannot be visualized on all output devices. In ad-
PDF/A in a Nutshell 77
What the Preflight error messages mean
dition, all types of comment that were not specified in PDF PDF/A files. Remedy: Adobe Acrobat Professional (Version 6,
1.4 (from Adobe Acrobat 5 onwards) are not permitted in 7, or 8) includes a flattener module that can be used to remove
PDF/A since PDF/A is based on PDF 1.4. Remedy: The Acro- transparencies.
bat PDF Optimizer provides an option for removing file at-
tachments in the ‘Discard User Data’ area. ■■ Stream object contains F entry: To be visualized in its
entirety, this PDF requires additional files. Because the PDF/A
■■ RGB used but PDF/A OutputIntent not RGB: Device- standard stipulates that a PDF must be complete and must
dependent color (DeviceRGB) is used but no RGB output in- not require any other information for its visualization, this
tent exists. Because the PDF/A standard stipulates that colors PDF cannot be converted to PDF/A. Remedy: It is unusual for
must appear the same (as far as is technically possible) re- external files to be required for the visualization process. In
gardless of the output device, either a PDF/A document must order to trace the cause, a file that produces this error must
only contain device-neutral colors or the color properties of be checked – with Adobe Preflight, for example – to see which
the output device must be defined using an output intent pro- objects are to blame.
file. If a document contains DeviceRGB or DeviceCMYK col-
ors, an output intent of the same type must therefore exist. ■■ Stream object contains FDecodeParams entry: External
Remedy: New PDF/A conversion. files are required for the visualization of the file in question
(FDecodeParams entry). To be visualized in its entirety, this
■■ RGB used for alt. color but PDF/A OutputIntent not RGB: PDF requires additional files. Because the PDF/A standard stip-
A spot color has been defined in DeviceRGB but the output ulates that a PDF must be complete and must not require any
intent is not defined for RGB. Because the PDF/A standard other information for its visualization, this PDF cannot be con-
stipulates that colors must appear the same (as far as is techni- verted to PDF/A. Remedy: It is unusual for external files to be
cally possible) regardless of the output device, either a PDF/A required for the visualization process. In order to trace the cause,
document must only contain device-neutral colors or the color a file that produces this error must be checked – with Adobe
properties of the output device must be defined using an out- Preflight, for example – to see which objects are to blame.
put intent profile. If a document contains DeviceRGB or De-
viceCMYK colors, an output intent of the same type must ■■ Stream object contains FFilter entry: External files are
therefore exist. Remedy: New PDF/A conversion. required for the visualization of the file in question (FFilter
entry). To be visualized in its entirety, this PDF requires ad-
■■ Scaling factor used: Page contains a zoom factor or ditional files. Because the PDF/A standard stipulates that a
downsizing factor. This PDF defines a change of image scale. PDF must be complete and must not require any other infor-
This particular characteristic was first introduced with Ado- mation for its visualization, this PDF cannot be converted to
be PDF 1.6 (as of Acrobat 7). The PDF/A standard only per- PDF/A. Remedy: It is unusual for external files to be required
mits objects that are compatible with PDF 1.4. A change of for the visualization process. In order to trace the cause, a file
image scale is therefore not permitted in PDF/A files. Reme- that produces this error must be checked – with Adobe Pre-
dy: This problem can be corrected using the Acrobat PDF Op- flight, for example – to see which objects are to blame.
timizer by selecting PDF 1.5 compatibility (Acrobat 6).
■■ Stream size is above 2 GB: A data stream in this file is too
■■ SMask entry present with a value other than None: A large (the maximum permitted size is 2 GB). Extremely large
partially transparent mask is used in this PDF file. Masks hide files may lead to rendering problems when the files in ques-
background objects. They can however be set to ‘transparent’ tion are printed out or displayed on a monitor. The maximum
in PDF files so that objects positioned behind them still remain file size and also the size of internal data objects in PDF/A
partially visible. You can set a percentage value on a scale of 0% files is therefore limited to 2 GB. Remedy: No repair possible.
to 100% to define the extent to which the background of a It may be possible to recreate the PDF file with a more effec-
‘transparent’ object should be visible. The color values of the tive compression type.
foreground mask and background object must be offset against
one another for the reproduction of such constructions when ■■ Subject mismatch between Document Info and XMP
displaying files on a monitor or printing them out. However, metadata: The ‘Subject’ entry in the XMP document informa-
this method of blending is not clearly defined. The PDF/A stan- tion deviates from the corresponding entry in the document
dard stipulates that all features used in a PDF file must be dis- properties. The PDF/A standard stipulates that document in-
played in a single unique way on a monitor or in a printout. formation must exist in the XMP area. If this data is also con-
Because this cannot be ensured in the case of transparent ob- tained in the document properties, it must be identical to the
jects and their backgrounds, transparency is not permitted in entries in the XMP area. Remedy: New PDF/A conversion.
78 PDF/A in a Nutshell
What the Preflight error messages mean
■■ Text cannot be mapped to Unicode: This message only troduced with Acrobat 7 (PDF 1.6). NChannel is an extension
appears when validating PDF/A-1a (and not when validating of the DeviceN color space, permitted since PDF 1.3. Both
PDF/A-1b). This PDF file contains characters that could not color spaces allow the specification of color values in multi-
be allocated to a Unicode ID since the relevant information is channel color spaces in which spot colors, for example, may
not available in the PDF file. Remedy: In order to resolve this also be used. The PDF/A standard does not support objects
problem, the PDF file must be created anew, using a different that were not permitted in the PDF 1.4 specification that was
font. Alternatively, it might be possible to convert the file to published by Adobe with Acrobat 5. Remedy: Use the Acrobat
PDF/A-1b instead. PDF Optimizer to save the PDF file as a PDF 1.4 version.
■■ Title mismatch between Document Info and XMP ■■ Uses OpenType font: OpenType fonts may not be used in
metadata: The ‘Title’ entry in the XMP document informa- PDF/A files. There are various common font formats (Post-
tion deviates from the corresponding entry in the document Script Type1, Type3 or TrueType). They can be embedded in
properties. The PDF/A standard stipulates that document in- all PDF versions. As of PDF 1.5 (Acrobat 6), OpenType for-
formation must exist in the XMP area. If this data is also con- mat fonts can also be embedded. This font format is used by
tained in the document properties, it must be identical to the one of the fonts embedded in this PDF file. The PDF/A stan-
entries in the XMP area. Remedy: This information can be dard only permits the use of objects that are compatible with
opened using the ‘File’ menu and changed in the general PDF 1.4. Fonts in OpenType format must therefore not be
Document Properties dialog. used or embedded. Remedy: In order to resolve this problem,
the PDF file must be created anew, using a different font.
■■ TR2 entry used with value other than Default: Underly-
ing gradation curve. The color values of a page object (text, ■■ Width information for glyphs incomplete: Width infor-
images, graphics, and so on) can be changed with the aid of mation is missing for some characters in a font used in this
gradation curves (or ‘transfer curves’). On the basis of a document. The letters and other characters used in PDF texts
‘transfer curve’, a new value is determined for every color require ‘fonts’ that determine their exact appearance when
value when a document is displayed on a monitor or printed visualized. The characters used in the text are then depicted
out. This value is then displayed or printed in place of the and arranged in accordance with the representation stored in
‘original value’. Because not all devices can handle transfer the font. The precise position of each character depends upon
curves, they are not permitted in PDF/A files. Remedy: The the tracking of the previous character. The PDF/A standard
Preflight ‘Apply transfer curves’ function corrects this error. stipulates that width information must be available for every
single character used in a document. Remedy: To resolve this
■■ Transparency used: Objects in this PDF file are defined problem, the PDF file must be created anew.
as ‘transparent’. The PDF/A standard stipulates that all fea-
tures used in a PDF file must be displayed in a single unique ■■ Width information for glyphs is inconsistent: Deviating
way on a monitor or in a printout. Because this cannot be specifications exist for character width. The letters and other
ensured in the case of transparent objects and their back- characters used in PDF texts require ‘fonts’ that determine
grounds, transparency is not permitted in PDF/A files. Rem- their exact appearance when visualized. The characters used
edy: Adobe Acrobat Professional (Version 6, 7, or 8) includes in the text are then depicted and arranged in accordance with
a flattener module that can be used to remove transparen- the representation stored in the font. The precise position of
cies. each character depends upon the width of the preceding
symbol. The width specification for any one character is de-
■■ Type 2 CID font: CIDToGIDMap invalid or missing: Not fined both in the font that is embedded in the PDF and in the
all characters (glyphs) can be allocated to this font (CI- PDF itself. The PDF/A standard stipulates that the width
DToGIDMap is missing or incorrect). A font in this PDF does specifications in the embedded font and in the PDF file must
not have a complete allocation table for allocating character be identical. Remedy: To resolve this problem, the PDF file
codes to character representations in the font. In PDF/A files, must be created anew.
the reliable allocation of codes to character representations
must be ensured. Remedy: In order to resolve this problem, ■■ Wrong encoding for non-symbolic TrueType font: This
the PDF file must be created anew, using a different font. font does not use standard character to symbol allocation
(MacRoman or WinAnsi). This PDF can therefore not be
■■ Uses NChannel color: An object in this file uses the converted to PDF/A. Remedy: In order to resolve this prob-
NChannel color space, which is not allowed in PDF/A. The lem, the PDF file must be created anew, using a different
document defines colors in the ‘NChannel’ color space, in- font.
PDF/A in a Nutshell 79
Glossary
Explanation of terms relating to PDF/A
■■ Accessibility: In the digital world, accessibility aims to ■■ Adobe Systems: US software company founded in
ensure that users with impaired vision, restricted neuro- 1982 by John Warnock and Charles Geschke. Warnock
muscular skills, and other disabilities can also take part in and Geschke developed the → PostScript format for print-
the exchange of information. Web pages and other files ing files. The name ‘Adobe’ refers to a type of clay or the
must be designed in a way that provides a clear flow struc- brick made from it, and a river called Adobe Creek flows
ture to help screenreaders reproduce their content cor- near the company’s headquarters. Their well-known prod-
rectly. PDF files can be both accessible and PDF/A-compli- ucts include Photoshop, Illustrator, InDesign, and Acro-
ant at the same time. Accessibility is increasingly regulated bat. PDF was developed by Adobe.
by legislation in the US and Europe.
■■ CCITT Group 4: The ‘Comité Consultatif International
■■ Adobe Acrobat: Program for creating and processing Téléphonique et Télégraphique’ (International Telephone
PDF files. Version 1 was introduced by → Adobe Systems in and Telegraph Consultative Committee) developed this
1993. The current version is Version 8. Acrobat Standard, lossless compression procedure for black and white images
Professional, and Elements offer different features but all (line art) for use when sending faxes.
belong to the Acrobat family. Adobe Professional includes
Adobe → Distiller, which can create PDF documents from ■■ CMYK: This abbreviation stands for Cyan, Magenta,
→ PostScript and EPS data. Yellow, and Key (= black). Different sized dots in these four
colors can be distributed in various ways to realistically
■■ Adobe Reader: Adobe Reader (previously called Acro- depict most color images as well as graphics and text.
bat Reader) is Adobe’s free → PDF viewer. This program However, fluorescent colors and other shades cannot be
runs on various computer and mobile-device platforms displayed well using CMYK. Spot colors are used to dis-
and has been downloaded from the company’s sites mil- play these shades.
lions of times. The free distribution of this program has
contributed to the success of the PDF format. Adobe Read- ■■ Color management: This technology aims to enable
er 8 also allows form files to be saved if the creator of the the uniform display of colors regardless of whether they
documents has enabled the function in Acrobat Profes- appear on a monitor or in proofs, newspaper printouts, or
sional. art printouts. Color profiles (usually → ICC profiles) are
very important for color management – they enable the
device-independent display of colors. Color management
encompasses all production stages from digitalization us-
ing a scanner or digital camera to editing and displaying
the result on a monitor or printing it out.
80 PDF/A in a Nutshell
Glossary
images compressed in JPEG. However, other elements of a ments that used to be printed out on paper in electronic
PDF file that are not components of the page description systems. DMS is an important component of electronic
can also be compressed. As of PDF 1.6, these elements can document archiving systems.
even be merged together into a compressed object (cross-
object compression). ■■ Document scanner: These special devices are for cre-
ating large document sets in the shortest amount of time
■■ Conversion: The term ‘conversion’ refers to changing a possible. Document scanners enable entire batches of doc-
file from one file format to another. uments to be digitalized (both the front and back of pages).
Scanned material is increasingly stored as PDF files. If the
■■ Digital signature: Electronic signatures are important files in question are to be archived, it makes sense to save
in many fields of business and administration. They are them as PDF/A files.
used to identify the originator of a document as well as the
read and usage authorizations of its recipient. Digital sig- ■■ Document properties: The document properties (docu-
natures must be suitable for encryption. They must also be ment info) for PDF files contain four entries: Title, Author,
impossible to falsify. They can be managed and used in Subject, and Keywords. These entries constitute basic
PDF documents using programs such as Acrobat and → metadata information. In → Adobe Acrobat and
→ Adobe Reader. The use of PDF/A with digital signatures → Adobe Reader, document info can be displayed by press-
requires a precisely planned process flow. ing Ctrl+D.
Document Properties in Acrobat 8 Professional. This area displays information such as the ti-
tle and author of a document.
The Distiller enables the creation of PDF/A-1b documents. It is not possible to convert files to ■■ Glyph: A glyph is a graphical representation of a char-
PDF/A-1a because the structures required for compliance with the stricter compliance level acter. A character is an abstract concept of a letter or sym-
cannot be adopted or generated. bol. A glyph is the actual graphic used to represent it.
■■ DMS: This abbreviation stands for Document Manage- ■■ ICC profile: ICC profiles are important features of
ment System. It encompasses the management of docu- → color management.. An ICC profile is a data record that
PDF/A in a Nutshell 81
Glossary
describes the color space a device (a monitor, printer, scan- procedure was the predecessor of → J BIG (bi-level im-
ner, or similar device) uses to specify or reproduce colors. ages, black and white files) and → JPEG2000 (improved
ICC stands for International Color Consortium, a group compression).
that consists of manufacturers of graphics, image editing,
and layout programs. ■■ JPEG2000: An image compression standard that, like
→ JPEG, was developed by the Joint Photographic Experts
■■ Image resolution: A digital image consists of pixels Group. JPEG2000 supports both lossless and lossy com-
(image elements). The number of pixels per inch deter- pression. This image file format can include a range of
mines the quality of an image. Because they carry more metadata that facilitates file management and makes it
information, high-resolution images have larger file sizes. easier to find images on the Internet. JPEG2000 is not per-
The usual screen resolution is 72 ppi (pixels per inch – mitted by PDF/A-1a and -1b, but PDF/A-2 will support it.
1 inch is 2.54 centimeters). 300 ppi is often used for print-
ing.
The left-hand image above is rendered in 72 ppi; the right-hand image has a resolution of
300 ppi. Both images have been significantly magnified.
82 PDF/A in a Nutshell
Glossary
Instead, it is an auxiliary module that contains elements ment of an output intent can modify colors in line with the
required to make programs available. requirements of a different output device. For example,
since Version 6, Adobe Acrobat has displayed a PDF with
■■ LZW: An older, lossless image compression procedure an output intent for offset printing differently to how it
from the 1970s/1980s. It is named after its creators – would display the same PDF with an output intent for
Abraham Lempel, Jacob Ziv, and Terry A. Welch. This newspaper printing.
procedure is not supported by PDF/A because it was sub-
ject to licensing restrictions for a considerable period of ■■ PDF: This abbreviation stands for ‘Portable Document
time. Format’. It is a platform-independent, open file format that
has been developed by → Adobe Systems since 1993. Like a
■■ Metadata: A digital document can have additional container, a PDF document can contain diverse elements:
data on its properties. This information is called metadata. Images, text, sound, movies, 3D objects, form elements,
Metadata provides information on attributes such as the and many more. The functional scope of PDF is constantly
author of a file and the title of a document. It enables docu- being enhanced. The current version is PDF specification
ments to be categorized by keywords and supports the ad- 1.7, which was introduced with → Adobe Acrobat 8.
dition of copyright information. Adobe products use mod-
ern → X MP metadata. ■■ PDF viewer: Program for displaying PDF documents.
In addition to the → Adobe Reader, such programs include
the ‘Preview’ program that belongs to the current version
of Apple’s operating system. There are both free PDF view-
ers for various platforms and viewers that must be pur-
chased.
PDF/A in a Nutshell 83
Glossary
■■ PDF version: PDF is constantly being developed. With ■■ Preflight: This plug-in, which is delivered with Acro-
each new Acrobat version, Adobe publishes a new PDF bat, is a tool for checking PDF files. It is developed by the
specification. The document containing the specification Berlin-based company callas software. As of Acrobat 8,
is called a ‘PDF Reference’. PDF 1.7 has been available since Preflight can carry out corrections as well as checking
the rollout of Acrobat 8 (tip: to determine the correspond- PDFs. Acrobat also use Preflight to carry out all of its
ing Acrobat version, add one to the PDF version number PDF/A validations and conversions. In addition to using
– for example, PDF 1.3 belongs to Acrobat 4). the validation and verification profiles delivered with Pre-
flight, users can also create their own profiles.
■■ PDF/A: Standard developed by → ISO, the International
Organization for Standardization, especially for the long- ■■ RGB: This color space consists of the primary colors
term archiving of PDF files. The PDF/A-1 standard was red, green, and blue and is used for displaying documents
adopted under the name ISO 19005-1:2005 in 2006. This on color monitors. The additive color model has 255 grades
first version used the PDF 1.4 specification to define the for these three basic colors. White is created if all three
elements that are permitted in PDF/A files. PDF compo- components have the value 255; black is formed if they all
nents that were only introduced in later versions of the have the value 0.
PDF specification are therefore prohibited in PDF/A files.
Such components must be modified or removed. The ■■ sRGB: sRGB (standard RGB) color space. It was mutu-
PDF/A-2 standard, which is already being compiled, will ally developed in 1996 by Hewlett-Packard and Micro-
be based on a more recent PDF specification. soft.
84 PDF/A in a Nutshell
Glossary
Not yet validated: The Acrobat Preflight tool can be used to check the validity of PDF/A docu-
ments.
Tags in Acrobat Professional: All elements of a tagged PDF file are given marks that clearly
assign them to a content type and style. Tags also control their sequence.
PDF/A in a Nutshell 85
About: The PDF/A Competence Center
Association for Digital Document Standards – ADDS
The PDF/A Competence Center is an initia- conducting events, working on further
tive of the Association for Digital Docu- standardizations and serving as a central
ment Standards (ADDS) e.V., founded in competent point of contact for answering
September 2006. A particularly important all questions about PDF/A.
aim of the association is to promote the ex-
change of information and experience in Work on the ISO Standard
the area of long-term archiving in accor- Several members of the PDF/A Compe-
dance with ISO 19005 (PDF/A). tence Center are technically oriented and
actively participate in the further develop-
ment of the PDF/A standard as members of
PDF/A the responsible ISO committee (ISO TC
171 – Document management applica-
Competence Center tions).
Member companies test each others
products for compliance with the ISO stan-
dard and compatibility in order to guaran-
The new ISO standard for long-term ar- tee a high level of quality. It is planned to
chiving, PDF/A, is generating considerable also offer test suites and compliance checks
interest in the market. In order to encour- for products from other suppliers. This
age the high demand for information and happens in the context of the Technical
exchange of ideas concerning PDF/A, callas Working Group (TWG).
software GmbH, Compart Systemhaus
GmbH, LuraTech Europe GmbH, PDF Events around the PDF/A Standard
Tools AG and PDFlib GmbH have founded In order to meet the high informational
the PDF/A Competence Center. needs around PDF/A in the market, the
The executive chairman is Thomas Zell- PDF/A Competence Center organizes sem-
mann, a managing partner of LuraTech. inars and events in different locations on a
Dr. Hans Baerfuss, CEO of PDF Tools AG, regular basis.
Switzerland, is the executive vice-chair-
man.
The association is geared towards devel-
opers of PDF solutions, companies that
work with PDF/A in the area of DMS/ECM,
interested individuals, and also users who
want to implement PDF/A in their organi-
zations. Although the months directly after
the founding saw new members predomi-
nantly from German speaking regions, the
executive committee has expanded their
activities internationally beginning in
2007.
Interested parties can thus benefit from
the combined knowledge of competent For details about current activities,
PDF/A suppliers. The newly founded asso- please check the Events page at pdfa.org on
ciation offers numerous services including the Internet. n
86 PDF/A in a Nutshell
AIIM
The Enterprise Content Management Association
AIIM is the international authority on En- and M-iD (Managing Information and
terprise Content Management (ECM). Documents Magazine) – the leading indus-
ECM is the technologies used to capture, try print publications in North America
manage, store, preserve, and deliver con- and the UK; and our online Solution Cen-
tent and documents related to organiza- ters for financial services, healthcare, and
tional processes. ECM tools and technolo- state & local government.
gies provide solutions to help users with
the four C’s of business: Continuity, Col- ■■ Professional Development: AIIM’s in-
laboration, Compliance, and Costs. dustry education road map offers business
and government professionals a variety of
training opportunities. Our ECM & ERM
Certificate Programs provide instruction
on the Why?, What?, and How? of Enter-
prise Content Management and Electronic
Records Management via Web-based and/
or classroom courses.
AIIM provides:
■■ Market Education: AIIM provides un-
biased information through its ECM Solu-
tions Seminar (held throughout the U.S.
and Canada); the Managing Information
and Documents Road Show (held through-
out the UK); InfoIreland (held in Dublin);
AIIM Webinars; AIIM E-DOC Magazine
PDF/A in a Nutshell 87
PDF/A in a Nutshell – Long Term Archiving with PDF
The authors:
Olaf Drümmer:
PDF/A is the PDF for long-term archiving. Olaf Drümmer is the co-author of
PDF/A – which was adopted at the end of 2005 – is the ‘Postscript- und PDF-Bibel’, and has
first file format which, since it is an ISO standard, played a crucial role in the standard-
guarantees that documents created today will also be able ization of PDF/X (since 1999) and
to be opened and used in the future. ‘PDF/A in a Nutshell’ PDF/A (since 2002). He is a member
allows the user to take a look behind the scenes of the of several international institutions
standard and provides practical instructions on generat- and associations: DIN, ECI, Ghent
ing PDF/A that conforms with the stipulations of the PDF Workgroup, PDF/A Competence Center, and PDF/X-
standard in his or her working environment. This book ready.
also serves as a comprehensive introduction to a subject Olaf Drümmer is CEO of callas software GmbH. callas
software develops the Preflight functions integrated in
matter that is still very new as well as providing practical
Acrobat since Version 6 (2003).
examples for different software tools and industry solu-
tions able to generate and work with PDF/A.
Alexandra Oettler:
Alexandra Oettler is a technical
writer who has worked as a freelance
Extracts from the content of the book: journalist specializing in software for
■ Why PDF/A? many years. She regularly has articles
on DTP software in practice pub-
■ The PDF/A-1a and PDF/A-1b conformity levels lished in specialist prepress journals.
■ PDF/A with Acrobat 8 Professional She also writes user manuals for PDF
and prepress programs and teaches software training
■ Archive PDFs from Microsoft Office 2003 and 2007 courses. As the editor in chief of the pdfnews.de Web site,
she provided German-speaking readers with daily infor-
■ Scanning documents to create PDF/A and applying
mation on new products, schedules, and tips between 2001
text recognition
and 2004.
■ High-volume PDF/A creation
■ Validating PDF/A Dietrich von Seggern:
After completing his university stud-
■ Accessible PDF/A documents ies in print technology, Dietrich von
■ Future-proof contracts Seggern worked as a prepress man-
ager. He worked on research projects
■ Forms in PDF/A related to the transmission of digital
print data. Later, he became the
■ Fonts and images in PDF/A
manager of the digital advertisement
■ Reliable colors on monitors and when printing transmission department at the mar-
keting organization of the German newspaper publishers
(ZMG). He has been working as the head of Product
Management at callas software GmbH in Berlin for several
years now.
ISBN 978-3-9811648-1-7
PDF/A
Competence Center
9 7 8 3 9 81 16 4 817