Parseur's advanced AI technology now empowers you to extract data from documents by simply leveraging the field names within your mailbox. No more manual template setup – just seamless and accurate data extraction, regardless of language or document complexity.
AI Extraction Features:
Parseur's AI extraction feature introduces a new era of efficiency and convenience:
Template-less extraction: Bid farewell to template creation and updates. Our AI-driven solution eliminates the need for manual setup, allowing you to automatically extract data from documents.
Field names are the key: Guiding our AI to extract the precise data you require is as simple as naming the desired fields within your mailbox. These field names serve as intuitive cues for our AI to identify and extract the corresponding data.
Multilingual proficiency: Parseur's AI understands and extracts data from documents in any language, ensuring global accessibility and applicability.
While the AI engine is currently in beta, there are also a few limitations to bear in mind:
Page count limitation: For now, the AI is capable of extracting data from about 5 to 10 pages of any document. The exact number of pages can be slightly more or less, depending on the text density of your pages. In any case, Parseur will not charge you more than 10 credits per document.
No support for "Reprocess All": For performance reasons, we currently have to limit using the "Reprocess All" button to non-AI mailboxes only.
How to parse documents using AI
Getting started with Parseur's AI parsing feature is quick and intuitive.
Step 1 : Create a new mailbox
Choose from our pre-defined mailboxes or create a customized mailbox tailored to your needs.
If you have an existing mailbox for which you want to use AI, enable the AI engine as described in step 2 below.
Step 2: Enable AI at mailbox level (or user level)
After selecting the mailbox type, click the AI checkbox to activate it.
You can also activate AI on an existing mailbox in the mailbox settings:
Finally, you can also activate AI for all of your existing mailboxes in your user account. Click on your name in the left menu > Account > Manage account > AI engine.
Step 3: Upload a sample document
Upload a representative sample document that showcases the type of data you want to extract.
Step 4: Configure your fields (Optional)
Wait for Parseur to analyze the document.
Then, if your mailbox already has fields (for example, if you chose one of our pre-defined mailboxes), Parseur will immediately start the extraction process.
Create simple fields
For custom mailboxes where there are no default fields, you will need to create some fields:
Click on the uploaded document to view it
Navigate to the Fields tab.
Add the specific fields you wish to extract.
Ensure these fields are named in a manner that the AI can easily understand, such as using terms like "InvoiceNumber" or "customer_address".
Create table fields to capture repeating data
To extract a list of repeating data, use the New Table button.
Then click on the "Add fields to <your field>" button to name the individual fields you want to extract from the table.
Repeat this for each field you want to add. For example: quantity, description, sku, price, etc.
Step 5: Process your document and check the results
After adding all desired extraction fields, click the "Process" button to initiate the AI-driven data extraction process.
Frequently Asked Questions (FAQ)
Parseur AI didn't fetch the value I wanted for some of my fields. How can I train the Parseur AI model to do better?
Tip #1: use better field names
Parseur uses the name of your fields to find the relevant data in your documents. If the wrong value is fetched, try renaming the field to something more accurate that AI will better understand. Think of the AI as a data entry trainee that needs guidance to understand what you want.
For example, to capture the invoice number in invoice documents:
❌ don't name the field
✅ name it
Tip #2: delete unused or duplicate fields
The more fields you have, the more the AI tends to get some of them wrong. If tip 1 didn't help, try to restrict the number of extracted fields to the core of what you need.
Tip #3: consider using the template engine for some layouts
AI being a probabilistic model, it cannot guarantee 100% accuracy for all documents. If you need better results and don't manage to get them, you could consider creating some templates for some of the layout. Read more about the pros and cons our AI parsing engine vs template parsing engines.
I have long documents; will AI be able to extract data from them?
AI will only be able to extract data from the first few pages of your document. The exact number depends on document density and the number of pages.
If you have long documents, you can consider the following options:
If you have a PDF consisting of several individual documents all bundled together, you can use the Split document feature to have Parseur cut the document into individual ones.
I have some templates and the AI engine enabled in my mailbox. Which engine will be used to parse my documents?
Matching templates take priority over the AI engine. But if there are no matching templates, Parseur will use the AI Engine to extract your data.
How secure is my data when using the AI engine? Do you share my data to improve the AI model?
Parseur uses Azure AI to parse your data. Your data is processed in the European Union. Your data is not used to improve the AI model.
Help Us Help You!
We greatly value your feedback and insights as we refine and optimize this AI feature during its beta phase.
If you have questions, suggestions, or encounter any challenges, our support team is just a click away!
Click on the chatbox icon at the bottom right of your screen to get in touch. Your input is invaluable in shaping the future of Parseur's AI parsing feature.