Advanced Document Extraction

Extract text from documents and images using AI vision (OCR). Use the Advanced Document Extraction action when a previous step in your Workflow outputs a file — such as a PDF attachment or a downloaded document — and you need that content as text for later steps like Extract Information or Generate Text.

Use Advanced Document Extraction when your document has complex layouts like tables or multi-column structures, poor scan quality, non-Latin scripts with diacritics, or when accuracy is very important.

Add the action

In the Workflow builder, click + between blocks and search for Advanced Document Extraction or OCR, then select it from the action library.

Action library with Advanced Document Extraction selected

Advanced Document Extraction action added to the Workflow

Select the document

In the Document field, reference the file you want to read. Use # to reference variables from a previous step.Common sources include:

Image uploads from a Manual trigger file input (images are not auto-parsed like PDFs and Word docs)
Email attachments from an Email trigger
Downloaded files from Download File from URL or integration download actions (Google Drive, SharePoint, Box, etc.)
Files inside a Loop when processing multiple attachments or documents one at a time

Use the output in later steps

The action outputs the extracted text as a single text variable. Reference it in subsequent actions to summarize, categorize, extract fields, or generate a response.

Supported file types

Type	Formats
Documents	PDF, Word (`.doc`, `.docx`), PowerPoint (`.pptx`), Keynote (`.key`)
Images	PNG, JPEG, WEBP, GIF

If the file type isn’t supported, the action stops the Workflow with an error. Convert the file to a supported format before passing it in.

Advanced: custom instructions

Under Advanced Settings, add Custom Instructions to tell the AI what to focus on or how to format the output. For example:

Extract all table data as markdown tables
Focus on the invoice number, date, and line items only
Preserve headings and bullet lists from the original document

Custom instructions are useful when you need a specific structure rather than a full verbatim transcription.

Pair Advanced Document Extraction with Extract Information to turn unstructured document text into structured fields you can send to CRM actions, spreadsheets, or email steps.

Request input from team Research Agent

Getting started

Agents

Workflows

Knowledge Base

Meetings

Extensions

Settings and admin

Security and compliance

Advanced Document Extraction

Supported file types

​Supported file types

​Related

Supported file types

Related