Skip to main content
The extract node uses AI to pull structured data from documents based on a schema you define. It is the core action node for document processing in DocPipe. You define the fields you want, and the AI extracts them from the document content.

Configuration

FieldTypeRequiredDescription
EngineselectYesExtraction engine: Engine 1 or Engine 2
Schemaschema editorYesDefines the fields to extract (name, type, description)
PrecisionselectNoProcessing precision: Fast, Standard, or Advanced
Extraction hintsstringNoNatural language instructions to guide the AI extraction
JSON pathstringNoJSONPath expression to extract a subset of the output

Engines

  • Engine 1: Faster, lower cost. Best for simple documents with clear structure.
  • Engine 2: More capable. Best for complex documents, handwriting, or ambiguous layouts.

Inputs and outputs

Allowed inputs: Trigger nodes, route, parse, review. Output: Structured JSON data matching the configured schema.

Credit cost

EnginePrecisionCost per page
Engine 1Fast1 credit
Engine 1Standard2 credits
Engine 1Advanced3 credits
Engine 2Fast2 credits
Engine 2Standard3 credits
Engine 2Advanced5 credits