Splitting PDF Invoices by Invoice Number
Introduction
It is a common task to split a PDF document that contains multiple invoices (or similar documents) into separate files. The AutoSplit™ software can split PDF documents that contain variable-length invoices, account statements, and reports into separate PDF files. The following tutorial shows how to split a PDF file that contains multiple invoices. Each output PDF file will contain only a single invoice and will be named using the corresponding invoice number.
Input Document Description
The input PDF document contains multiple invoices of variable length. The goal is to split it into multiple documents that contain each invoice as a separate file and name it, using invoice numbers from the first page of each document for file names.
Splitting Approach
It is common that each invoice has its separate page numbering, typically in a form of "Page N of M" text pattern. The easiest method to split such documents into separate files is to use "Page 1 of " or “1 of N” text as a separator. Since this text always occurs on the first page of each invoice, it is natural to use it as a reliable separator. Using “Page 1 of” is the very common method for splitting invoices or statements.
Output Results
Each invoice is now saved as a separate PDF file and named by the corresponding invoice number.