What are the document formats supported by Parseur?
Parseur can extract data from a wide range of document types commonly used in the workplace. Below is a list of supported formats:
File extensions | Document type |
bmp | Bitmap images |
csv | Comma-Separated Values |
doc, docx | Microsoft Word Documents |
eml | Emails (Multipart MIME encoded) |
gif | GIF images (First frame only) |
html, htm, xhtml | HTML Documents |
ics | Internet Calendar and Scheduling Files |
jpe, jpeg, jpg | JPEG images |
msg | Microsoft Outlook Email Messages |
odt, ods | OpenDocument Text/Spreadsheets |
Portable Document Format (PDF) | |
png | PNG images |
rtf | Rich Text Files |
tiff | TIFF images (supports multi-page) |
txt | Plain Text Files |
xls, xlsx, xlsm | Microsoft Excel Spreadsheets |
xml, hl7 | XML documents |
zip | Zipped Archives (decompressed automatically, with supported files appearing in the document queue) |
Need Support for a Format Not Listed?
If you need Parseur to support a specific file type not mentioned here, please reach out to us!
Frequently Asked Questions (FAQs)
Can Parseur Extract Data from Password-Protected PDFs?
No, Parseur currently cannot extract data from password-protected PDFs.
You can help us prioritize this feature by upvoting the request: Unlock and parse password-protected PDFs.
What Is the Maximum Document Size Parseur Supports?
Emails sent or forwarded to Parseur: Up to 35MB
Documents uploaded directly in the app or via API: Up to 256MB
Images sent to Parseur by email or via upload or via the API: Up to 20MB and maximum dimensions of 10,000 pixels in width or height. Minimum size of 32x32 pixels, maximum size of 40 Megapixels.
How Can I Stop Parseur from Parsing Images in Emails?
To disable image processing, go to your mailbox settings under the “Processing” tab and uncheck the “Enable image file processing” option. Don’t forget to click Save!
How Can I Access the Original Document?
You can download the original file using the OriginalDocument
Metadata Field in the Fields section of your mailbox. Reprocess your documents to see the change. Read more about using Metadata fields.
How Do I Upload the Original Document to Cloud Storage?
Once the OriginalDocument
Metadata Field is enabled, you can use its URL with Zapier or other tools (like Google Drive or Dropbox) that support file uploads.
To do this, simply map the Original Document URL to the file field in your Zap. Zapier will automatically download and upload the document to your desired app.
Best Practices
Extracting Text from PDFs
You can extract text from PDFs using AI-powered parsing or Parseur’s OCR template engine.
Merging CSV and Excel Files
Parseur can automatically consolidate CSV and Excel attachments based on their column headers, without needing a custom template.
The parsed data will be stored in the “Sheet” table field.
Use the table field download option from the Export section, or the “New Table Processed” trigger in Zapier to access the data.
Note: If you prefer not to use Parseur’s default CSV parsing, you can create a custom template. Any custom template will override the default parsing behavior.