Haystac Platform / Chicago

Document correlation for high-volume inbound content.

Chicago automatically separates, identifies, and organizes documents arriving in mixed batches so downstream systems start with the right document in the right context.

Auto-separate Break mixed files into distinct business documents with minimal setup.
Auto-identify Classify content by meaning, not just rigid templates or keywords.
Ready for downstream workflows Feed parsing, routing, and automation layers with cleaner, structured input.
Where Chicago Fits
Step 1: Organize content Turn bundled inbound files into separated, labeled, workflow-ready documents.
Built on embeddings Chicago uses embeddings to understand semantic similarity and improve classification fidelity.
Feeds Nashville and Orion Once documents are separated and identified, they can move into parsing and insight generation.

What Chicago does

Chicago addresses the digital mailroom problem by separating batches of inbound documents and identifying what each document is before further processing occurs.

Separate mixed files

Split a single uploaded file into discrete business documents such as invoices, claims, statements, forms, and supporting materials.

Classify by meaning

Use semantic understanding to distinguish similar-looking documents and reduce manual sorting effort.

Prepare for downstream processing

Create cleaner inputs for parsing, adjudication, workflow automation, and generative insight.

Correlation-first Organize content before extraction or decisioning begins.
Embeddings-based Capture meaning and context beyond keyword matching.
Minimal setup Reduce manual effort required to create classification-ready workflows.
Platform-native Feeds directly into Nashville, Orion, and Polaris.

How Chicago works

Chicago is designed for high-volume environments where documents arrive in bundles, packets, or mixed uploads and need to be organized before business logic can be applied.

Step 1 Receive inbound content

Files enter through a host application, workflow layer, or upstream intake process.

Step 2 Separate documents

Chicago detects document boundaries inside mixed content and breaks them apart.

Step 3 Classify each item

Each separated document is identified and labeled using semantic classification.

Step 4 Return structured output

Results are passed back to the host or routed into downstream Haystac services.

Key capabilities

Chicago is built for noisy, inconsistent, real-world inbound content.

  • Auto-separate documents contained within a single uploaded file or packet.
  • Semantic classification using embeddings-based understanding.
  • Supports digital mailroom and intake-heavy workflows.
  • Returns structured JSON-ready correlation results to the host environment.

Best-fit use cases

Chicago is especially valuable anywhere inbound documents arrive mixed together.

  • Claims packets containing multiple insurance documents.
  • Loan files with applications, statements, IDs, and supporting records.
  • Government intake scenarios with mixed forms and attachments.
  • Transportation and logistics workflows with shippers, invoices, and bills of lading.

Start with cleaner inputs.

Chicago helps teams organize inbound content before extraction, decisioning, and automation begin.

Talk to Haystac