Hi Caylin. Thanks for posting this. Its always interesting to see approaches to this problem of validating large collections.
From your post, it looks like you’ve undertaken this method for atleast the collection of 1,111 pdf files. Could you share your findings? I’d be very interested to see the numbers around your impact levels and any other insights the process gave you.