Parseur's advanced AI technology now empowers you to extract data from documents by simply leveraging the field names within your mailbox. No more manual template setup β just seamless and accurate data extraction, regardless of language or document complexity.
AI data extraction features:
Parseur's AI extraction feature introduces a new era of efficiency and convenience:
Template-less extraction: Bid farewell to template creation and updates. Our AI-driven solution eliminates the need for manual setup, allowing you to automatically extract data from documents.
Any type of document layout: Since no templates are involved, the AI engine can extract data from documents with a great variety of layouts.
Field names are the key: Guiding our AI to extract the precise data you require is as simple as naming the desired fields within your mailbox. These field names serve as intuitive cues for our AI to identify and extract the corresponding data.
Multilingual proficiency: Parseur's AI understands and extracts data from documents in any language, ensuring global accessibility and applicability.
Limitations:
The AI engine has the following limitations to bear in mind:
Page count limitation: when using table fields, the AI is capable of extracting data from a limited number of pages. The exact number of pages can be slightly more or less, depending on the text density of your pages. You can use the Split PDF feature to have your document split into smaller one at upload
How to parse documents using AI
Getting started with Parseur's AI parsing feature is quick and intuitive.
Step 1 : Create a new mailbox
Use the AI-assisted mailbox creation mode to have Parseur setup a mailbox for you.
Alternatively, if you choose the Manual Creation mode, make sure the "Use AI" toggle is on after choosing your mailbox type.
For existing mailboxes, you can also activate AI on an existing mailbox in the mailbox settings.
Step 2: Upload a sample document
Upload a representative sample document that showcases the type of data you want to extract.
Step 3: Configure your fields
Wait for Parseur to analyze the document.
Then, if your mailbox already has fields, Parseur will immediately start the extraction process.
Option 1: Create simple fields
For custom mailboxes where there are no default fields, you will need to create some fields:
Click on the uploaded document to view it
Navigate to the Fields tab.
Add the specific fields you wish to extract.
Ensure these fields are named in a manner that the AI can easily understand, such as using terms like "InvoiceNumber" or "customer_address".
Option 2: Create table fields to capture repeating data
To extract a list of repeating data, use the New Table button.
Then click on the "Add fields to <your field>" button to name the individual fields you want to extract from the table.
Repeat this for each field you want to add. For example: quantity, description, sku, price, etc.
Step 4: Process your document and check the results
After adding all desired extraction fields, click the "Process" button to initiate the AI-driven data extraction process.
Frequently Asked Questions (FAQ)
Parseur AI didn't fetch the value I wanted for some of my fields. How can I train the Parseur AI model to do better?
Tip #1: use better field names
Parseur uses the name of your fields to find the relevant data in your documents. If the wrong value is fetched, try renaming the field to something more accurate that AI will better understand. Think of the AI as a data entry trainee that needs guidance to understand what you want.
For example, to capture the invoice number in invoice documents:
β don't name the field
Field
,Test
, orinvno
β name it
InvoiceNumber
,invoice_number
orInvoice number
Tip #2: delete unused or duplicate fields
The more fields you have, the more the AI tends to get some of them wrong. If tip 1 didn't help, try to restrict the number of extracted fields to the core of what you need.
Tip #3: consider using the template engine for some layouts
AI being a probabilistic model, it cannot guarantee 100% accuracy for all documents. If you need better results and don't manage to get them, you could consider creating some templates for some of the layout. Read more about the pros and cons our AI parsing engine vs template parsing engines.
Parseur only retrieved 1 data point from my documents. I have other similar data points in my document. How do I tell Parseur to extract all the data?
If the data repeats within a page, use Table fields instead of single fields:
Go to the the Fields tab when viewing a document
Click New Table
Name it something the AI will understand (for example, if you are working on extracting contact details, name the table something like
ContactList
)Click Create
Click Add Field and name each field similar to the single fields you had previously
Delete the single fields so as not to confuse the AI
Reprocess your documents and check to see if you get the right results
If your document contains several individual documents (like several invoices, for example), use the Split PDF feature described below.
I have long documents; will AI be able to extract data from them?
AI will only be able to extract data from the first few pages of your document. The exact number depends on document density and the number of pages.
If you have long documents, you can consider the following options:
If you have a PDF consisting of several individual documents all bundled together, you can use the Split document feature to have Parseur cut the document into individual ones.
You can also consider using one of our two template engines: Text engine for emails and text documents and OCR engine for PDFs
I have some templates and the AI engine enabled in my mailbox. Which engine will be used to parse my documents?
Matching templates take priority over the AI engine. But if there are no matching templates, Parseur will use the AI Engine to extract your data.
How secure is my data when using the AI engine? Do you share my data to improve the AI model?
Parseur uses state-of-the-art AI models from Azure, Google and AWS to parse your data. Your data is processed in the European Union. The data remains yours and we don't re-use or share it to improve the AI models.
What is the difference between AI engine v1 and v2 in the mailbox settings?
AI v1 is our legacy template engine, introduced in late 2023. AI v2 is our newest model introduced in July 2024.
The v2 model improves extraction accuracy and can handle parsing data from much larger documents. We recommend that you use v2 by default and only try v1 if v2 doesn't give you satisfying results.