Skip to main content
All CollectionsSending emails and documents
Document formats supported by Parseur
Document formats supported by Parseur

Types of documents Parseur can extract text from, FAQ and Best Practices

Updated over a week ago

What are the document formats supported by Parseur?

Parseur can extract data from a wide range of document types commonly used in the workplace. Below is a list of supported formats:

File extensions

Document type

bmp

Bitmap images

csv

Comma-Separated Values

doc, docx

Microsoft Word Documents

eml

Emails (Multipart MIME encoded)

gif

GIF images (First frame only)

html, htm, xhtml

HTML Documents

ics

Internet Calendar and Scheduling Files

jpe, jpeg, jpg

JPEG images

msg

Microsoft Outlook Email Messages

odt, ods

OpenDocument Text/Spreadsheets

pdf

Portable Document Format (PDF)

png

PNG images

rtf

Rich Text Files

tiff

TIFF images (supports multi-page)

txt

Plain Text Files

xls, xlsx, xlsm

Microsoft Excel Spreadsheets

xml, hl7

XML documents

zip

Zipped Archives (decompressed automatically, with supported files appearing in the document queue)

Need Support for a Format Not Listed?

If you need Parseur to support a specific file type not mentioned here, please reach out to us!

Frequently Asked Questions (FAQs)

Can Parseur Extract Data from Password-Protected PDFs?

No, Parseur currently cannot extract data from password-protected PDFs.

You can help us prioritize this feature by upvoting the request: Unlock and parse password-protected PDFs.

What Is the Maximum Document Size Parseur Supports?

  • Emails sent or forwarded to Parseur: Up to 35MB

  • Documents uploaded directly in the app or via API: Up to 256MB

  • Images sent to Parseur by email or via upload or via the API: Up to 20MB and maximum dimensions of 10,000 pixels in width or height. Minimum size of 32x32 pixels, maximum size of 40 Megapixels.

How Can I Stop Parseur from Parsing Images in Emails?

To disable image processing, go to your mailbox settings under the “Processing” tab and uncheck the “Enable image file processing” option. Don’t forget to click Save!

A screenshot showing the Enable Image File Processing checkbox

How Can I Access the Original Document?

You can download the original file using the OriginalDocument Metadata Field in the Fields section of your mailbox. Reprocess your documents to see the change. Read more about using Metadata fields.

Use OriginalDocument metadata field to access the original file

How Do I Upload the Original Document to Cloud Storage?

Once the OriginalDocument Metadata Field is enabled, you can use its URL with Zapier or other tools (like Google Drive or Dropbox) that support file uploads.

To do this, simply map the Original Document URL to the file field in your Zap. Zapier will automatically download and upload the document to your desired app.

Best Practices

Extracting Text from PDFs

You can extract text from PDFs using AI-powered parsing or Parseur’s OCR template engine.

Merging CSV and Excel Files

Parseur can automatically consolidate CSV and Excel attachments based on their column headers, without needing a custom template.

  • The parsed data will be stored in the “Sheet” table field.

  • Use the table field download option from the Export section, or the “New Table Processed” trigger in Zapier to access the data.

Note: If you prefer not to use Parseur’s default CSV parsing, you can create a custom template. Any custom template will override the default parsing behavior.

Did this answer your question?