PDF Accessibility (PDF UA, WCAG, Section 508) Support - danfickle/openhtmltopdf GitHub Wiki
Note: PDF/UA only documents have not been compliant in recent releases due to missing Dublin Core metadata (see #707. This will be fixed in version 1.0.10
. In the meantime PDF/UA documents which also use PDF/A compliance should continue to work.
This project is now able to produce PDF documents which are accessible to screen readers, in compliance with standards like PDF/UA, Section 508 and WCAG 2. As well as being important to screen-reader users, many governments now require that PDF documents submitted be accessible.
<html lang="EN-US">
<head>
<title>All-in-one PDF/UA Testcase</title>
<meta name="subject" content="PDF/UA all-in-one"/>
<meta name="author" content="openhtmltopdf.com team"/>
<meta name="description" content="An example containing everything for easy testing"/>
<bookmarks>
<bookmark name="Simple Paragraphs" href="#para"/>
<bookmark name="Lists" href="#lists">
<bookmark name="Ordered" href="#ordered"/>
<bookmark name="Unordered" href="#unordered"/>
</bookmark>
<bookmark name="Images" href="#images"/>
<bookmark name="Links" href="#links"/>
<bookmark name="Tables" href="#tables"/>
<bookmark name="Backgrounds" href="#backgrounds"/>
<bookmark name="Conclusion" href="#conclusion"/>
</bookmarks>
<style>
@page {
margin: 30px 20px;
@top-center {
font-family: 'TestFont'; /* Font provided with builder. */
font-size: 16px;
color: blue;
content: "This is PDF/UA page " counter(page) " of " counter(pages) ".";
}
}
body {
margin: 0;
font-family: 'TestFont'; /* Font provided with builder. */
font-size: 15px;
}
</style>
</head>
<body style="">
<h1 id="title">All-in-one accessible (PDF/UA, Section 508, WCAG) PDF example</h1>
<h2 id="para">Simple paragraphs</h2>
<p>Paragraph one. Some text that goes over multiple lines. OK, this is getting to the required length. Need another sentence to get there in the end.</p>
<p>Paragraph two. Some text that goes over multiple lines. OK, this is getting to the required length. Need another sentence to get there in the end.</p>
<p>Paragraph three. Some text that goes over multiple lines. OK, this is getting to the required length. Need another sentence to get there in the end.</p>
<h2 id="lists">Lists</h2>
<h3 id="ordered">Ordered</h3>
<ol>
<li>One</li>
<li>Two</li>
<li>Three</li>
</ol>
<h3 id="unordered">Unordered</h3>
<ul>
<li>Bullet item one</li>
<li>And two</li>
<li>And three</li>
</ul>
<h2 id="images">Images</h2>
<img src="../../demos/images/flyingsaucer.png" alt="The FlyingSaucer logo. We should get our own!"/>
<h2 id="links">Links</h2>
<p>This is an external link to the project <a title="The openhtmltopdf.com homepage" href="https://openhtmltopdf.com">homepage</a>.</p>
<p>This is an internal link to the <a title="Go to top" href="#title">top</a> of the document.</p>
<h2 id="tables">Tables</h2>
<table>
<caption>Simple table example with fake data</caption>
<thead>
<tr><th>Col One</th><th>Col Two</th></tr>
</thead>
<tbody>
<tr><td>One</td><td>Two</td></tr>
<tr><td>Three</td><td>Four</td></tr>
<tr><td>Five</td><td>Six</td></tr>
</tbody>
<tfoot>
<tr><td>Footer1</td><td>Footer2</td></tr>
<tr><td>Footer3</td><td>Footer4</td></tr>
</tfoot>
</table>
<h2 id="backgrounds">Backgrounds</h2>
<div style="background-color: red; height: 40px; border-radius: 10px; border: 1px solid gray;">
<p>Some text on a background. Remember to use a good contrast if using background colors.</p>
</div>
<h2 id="conclusion">Conclusion</h2>
<p>Remember to keep it simple for PDF/UA compliance.</p>
</body>
</html>
try (FileOutputStream os = new FileOutputStream("/path/to/doc.pdf")) {
PdfRendererBuilder builder = new PdfRendererBuilder();
builder.useFastMode(); // required
builder.usePdfUaAccessbility(true); // required
builder.usePdfAConformance(PdfRendererBuilder.PdfAConformance.PDFA_3_U); // may be required: select the level of conformance
// Remember to add one or more fonts.
builder.useFont(new File("path/to/font.ttf"), "TestFont");
builder.withHtmlContent(html, baseUrl);
builder.toStream(os);
builder.run();
}
- Provide a
lang
attribute on thehtml
element with the document language as an ISO code. - Provide a proper title in the
title
element. - Provide a meta description, subject and author.
- Provide bookmarks linking to the sections of your document, especially for documents larger than a handful of pages.
- Provide a page header or footer with page number information.
- Content in page margins and fixed position elements is marked as pagination artefacts so avoid putting critical information in such places.
- If using background colors or images, remember to use a good contrast for text over them.
- Images and other replaced elements like SVG are required to have an
alt
attribute with alternate text. - Links are required to have a
title
attribute with a description of the link target. - PDF/UA prohibits the use of built-in fonts so make sure to provide a font and specify it for the
body
element and any page margin rules (eg.@top-center
). - Avoid using
overflow:hidden
. The PDF specification makes it very hard to include clips in tagged content and this implementation is not expected to be clip compatible. - Avoid using transforms in content (you can use them in running boxes such as the page margin and fixed position elements).
- Avoid using
position
withrelative
orabsolute
. Out-of-flow content upsets the reading order. Again you can use them in running elements. - Use the
float
property with caution, it can upset the reading order of the document. - Avoid using
table
elements for layout. Instead, if desiring a table based layout, use adiv
withdisplay: table
,display: table-row
, etc. - Avoid, if possible, images over two or more pages.
- Nest header levels correctly (
h1
,h2
, etc). - This implementation does not yet cover form controls, such as the
input
element.
- Test your output documents with the PDF Accessibility Checker or PAC. This is Windows software and is free (though not open-source). Pay special attention to the screen-reader preview (making sure the text in the proper order and markup).
- Test with Acrobat Pro, if available.
- Also consider testing with actual screen-reader software.
You can see:
This PDF/UA implementation is indebted to the work of @chris271 at StackOverflow and Github.