Blogs: PDF

Blog posts filtered by the PDF subject tag.

Browse blogs by subject

The tech clinic offers OPF members the opportunity to book one-to-one online sessions to discuss any technical aspects of your work. This might include: getting started with OPF tools, e.g. installation and basic usage; help with integrating open source tools into local automated workflows and systems; investigating problems and issues with open source tools; assistance […]

By Becky, posted in Becky's Blog

14th Dec 2017  10:00 AM  0 Reads  No comments

In digital preservation we rely on automation and tools for some of our most crucial tasks like format identification and validation. One of the most widespread tools for format validation is JHOVE. As there is no other validation tool which checks the well-formedness and validity of plain PDF files, the quality and infallibility of JHOVE’s […]

By Becky, posted in Becky's Blog

21st Nov 2017  10:00 AM  0 Reads  No comments

The PREFORMA project that commissioned and funded the development of veraPDF draws to an end this year. Recent activity has been focused on PREFORMA acceptance testing, formalising the decisions of the PDF Association’s Technical Working Group and fixing issues reported by the community. In this webinar, we’ll be presenting the results of recent development and […]

By Becky, posted in Becky's Blog

19th Sep 2017  2:00 PM  0 Reads  No comments

Course Overview If you have a digital preservation strategy that involves digital files, you’ll know how important it is to understand the file formats in which your data is encoded. To do this comprehensively involves at least three main operations: identifying the format, characterising the format, and validating the format. To put it another way, […]

By Becky, posted in Becky's Blog

18th Jul 2017  12:00 AM  0 Reads  No comments

Some four years ago I wrote a blog post that demonstrated how Apache Preflight (the PDF/A validator tool that is part of Apache PDFBox) can be used to detect features in a PDF that are potential preservation risks. A follow-up blog applied Schematron rules to the Preflight output in an attempt at doing policy-based assessments. […]

By johan, posted in johan's Blog

1st Jun 2017  1:53 PM  750 Reads  No comments

The PDF Association is pleased to announce its first event in the southern hemisphere! With special focus on archiving and accessibility, PDF Day Australia is a unique opportunity for those in the Asia / Pacific region to engage with the world’s leading PDF technology experts on a broad range of subjects. Comprising 17 sessions in […]

By Becky, posted in Becky's Blog

25th Nov 2016  12:00 AM  0 Reads  No comments

On 11th October we held our first JHOVE online hack day. Our aim was to catalogue error messages produced by JHOVE to get a better understanding of their meaning and potential preservation impact. Background: organising an online hack day We have been considering running online hackathons because attending face-to-face events has become more difficult as […]

By Becky, posted in Becky's Blog

19th Oct 2016  10:06 AM  1625 Reads  No comments

The PREFORMA project is running a webinar series throughout September. The final webinar focuses on veraPDF. Overview The PREFORMA project’s prototyping phase finishes at the end of 2016. The veraPDF consortium will be producing a v1.0 release candidate of the software library and applications in December 2016. This webinar demonstrates the current state of veraPDF […]

By Becky, posted in Becky's Blog

22nd Sep 2016  2:00 PM  0 Reads  No comments

Inspired by Jenny Micham’s blog post about developing her first file format signature, I thought it would be fun to take a crack at creating one myself. I previously dipped my toe into the world of contributing to PRONOM by looking at a few mis-identifications and multi-identifications, but I had yet to create a file […]

By Andrea Byrne, posted in Andrea Byrne's Blog

8th Sep 2016  9:14 AM  1827 Reads  4 Comments

Practical experiences with archiving PDF files This webinar deals with archiving PDF files. As PDF files in our repository have myriad data producers, the heterogeneity of PDF files is overwhelming. Unfortunately this means the creation of errors as well. Usually, the original data producers cannot be contacted any more, therefore we have to do the […]

By Becky, posted in Becky's Blog

20th Jul 2015  1:00 PM  0 Reads  No comments