Loading Events
  • This event has passed.

Putting JHOVE to the acid test: A PDF test-set for well-formedness validation in JHOVE

21st Nov 2017 : 10:00 AM - 11:00 AM

In digital preservation we rely on automation and tools for some of our most crucial tasks like format identification and validation. One of the most widespread tools for format validation is JHOVE. As there is no other validation tool which checks the well-formedness and validity of plain PDF files, the quality and infallibility of JHOVE’s PDF module is especially important. Unfortunately, as there are no other tools, checking JHOVE’s PDF skills via tool-benchmarking is not an option.

As of today, there is not a ground-truth data set which can be used to understand and test PDF validation at the structural level. In this webinar, we present a corpus of light-weight files designed to test the validation criteria of JHOVE’s PDF module against well-formedness. Based on the findings of checking this data set with JHOVE, we give an overview of how reliable JHOVE is, what works well and where still are inconsistencies.

Session leads

Yvonne Tunnat, Deutsche Zentralbibliothek für Wirtschaftswissenschaften
Michelle Lindlar, Technische Informationsbibliothek

Date and time

Tuesday 21 November at 10:00 GMT / 11:00 CET. The webinar will last approximately one hour.


Registration is now closed. Get early notifications about our webinars by subscribing to our mailing list.

Member content

Members have access to content which is not available publicly.

Please sign in to access member content.



Find out how to become a member


21st Nov 2017
10:00 AM - 11:00 AM
Event Categories:
Event Tags:
, , ,


Open Preservation Foundation