The metadata in the PDFs doesn't contain reliable creation dates for these documents. They could be 80 year old documents that have been recently digitized or the editing of the PDFs might have altered their creation date. To overcome this, we want to try using a VLM or other methods to extract the document date from the first/last page of the PDF.