What are the document formats supported by Parseur?

Parseur can extract data from most documents commonly used in the workplace.

Here is the list of supported document formats that you can extract text from:

File extensions

Document type

csv

Comma Separated Value

doc, docx

Microsoft Word

eml

Emails (Multipart MIME encoded)

html, htm

HTML Document

pdf

Portable Document File

rtf

Rich Text File

txt

Text File

xls, xlsx, xlsm

Microsoft Excel Documents

xml, hl7

XML documents

Do you need Parseur to support a specific file type not listed here? Let us know!

What is the maximum document size allowed?

Parseur has various file size limitations depending on how you send documents:

  • maximum size of emails sent or forwarded to Parseur is 35MB

  • maximum size of documents uploaded directly into the app or via the API is 256MB

How to access the document in original format?

Use the OriginalDocument Extra Field download the file in its original format.

How to upload original document to a your cloud storage or app?

Once OriginalDocument extra field is enabled, you can use the URL with any Zapier connector that supports files (such as Google Drive, Dropbox etc.). To do so, map the Original Document URL with the file field in your Zap. Zapier will download the document and upload it into your favorite app.

What are the best practices to extract text from PDFs?

There are a few things you should know about creating template for PDFs. Check out this article for more information.

What are the best practices to consolidate CSV and Excel attachments?

Parseur can automatically combine CSV and Excel files without creating a template. Parseur will combine the files based on their column headers.

Parseur will store the parsed result in the "Sheet" table field.

As the result is in a table field, make sure to use the table field download option in the Export section or the "New Table Processed" trigger in Zapier. Check out at the end of this article for more information about exporting table field data.

Note: If you don't to want to use Parseur default parsing method for CSVs, you can create your own template and it will take priority over the default parsing.

Did this answer your question?