This article explains how to split multi-page documents supported by Parseur — including PDFs and TIFF images — into individual documents.
Why Split Multi-Page Documents?
Sometimes, your PDF or TIFF image contains multiple documents bundled into a single file. While you can create a template or use Parseur’s AI engine to extract data from the first document, Parseur requires splitting the original file into individual documents to extract data from each section properly.
Using the Split Document feature ensures that all documents within the bundle are processed, as if looping through each one separately.
Setting Up the Split Document Feature
Parseur allows you to automate this process by specifying how many pages each split should contain. Here’s how to configure the settings:
Open your Parseur mailbox.
Click on Upload / Import from the left menu.
Click on Adjust split settings under the drag-and-drop upload box:
There are three controls around document splitting:
Split documents every X amount of pages
Self explanatory, Parseur will split the document by a static amount of pages.
Split documents by page ranges
Many users have large documents including only a few pages of data they need to process. Contrasting from our keep pages by range function, this option allows you to split a document into multiple documents from the page ranges you define.
Split documents by keyword
Lastly, you can split documents by keywords, telling it to split before or after every instance of a keyword.
Frequently Asked Questions
Why is Parseur only processing the first page of my document?
If each page of your multi-page document contains repeating data, use the Split Document feature to ensure that Parseur processes all pages individually. After enabling this feature, delete the original upload and re-upload the document to apply the new settings.
What’s the difference between Split Document and Table Fields?
Both features capture repetitive patterns, but they serve different purposes:
Split Document: Use this feature when the entire document contains identical fields repeated across multiple pages or sections. It will divide the original file into separate documents, each processed individually.
Table Fields: Use this feature to extract data organized in tables or lists within a single page or across multiple pages.
Tip: If you are working with a single PDF or TIFF that contains multiple documents, each with their own tables, you can combine Split Document and Table Fields.
Can I use different splitting settings for different document types?
The split settings are global for each mailbox. If you need different split configurations, you can either:
Adjust the split settings manually each time you upload documents, or
Create separate mailboxes for each document type with unique split settings.
Can I reverse the document splitting?
After your documents have been processed, you will see the original document uploaded with a Split status, when clicking on this document you are given a few options to either reverse the document split or to re-split the document again:
Note: The Reverse document split and re-split document functions will consume credits on use.