Splitting PDF Documents By a Keyword List

Introduction

It is often necessary to split a PDF document at pages that contain specific keywords. The AutoSplit™ software allows to search a PDF document and check every page for presence of the user-specified keywords. If at least one keyword appears on the page, then it is marked as a splitting page. The document will be split at these pages and multiple PDF documents will be created.

Input Document Description

The sample PDF document we are going to use in this tutorial contains 20 pages with Bates numbers from ABC-200001 to ABC-200020 in the lower right corner of each page. The goal is to split PDF document at pages with a specific Bates numbers from a user-specified list and name each output PDF document using a corresponding Bates number.

Splitting Approach

We are going to use the "Page with Keywords From List" separator option to split the PDF document at pages that contain following Bates numbers (keywords): ABC-200001, ABC-200004, ABC-200005, ABC-200008, ABC-200012. The PDF document will be split at pages that contain any of these Bates numbers (keywords).

Output Results

The AutoSplit™ plug-in has split the input PDF document at pages with specific Bates numbers and created 5 output PDF documents:

The first document with pages from ABC-200001 to ABC-200003.
The second document with page ABC-200004 only.
The third document with pages from ABC-200005 to ABC-200007.
The fourth document with pages from ABC-200008 to ABC-200011
The fifth document with pages from ABC-200012 to the last page of the input PDF document.