Haystac has completely changed how we manage documents. What used to take hours of manual processing is now done in minutes—with greater accuracy.
Chicago automatically separates, identifies, and organizes documents arriving in mixed batches so downstream systems start with the right document in the right context.
Chicago addresses the digital mailroom problem by separating batches of inbound documents and identifying what each document is before further processing occurs.
Split a single uploaded file into discrete business documents such as invoices, claims, statements, forms, and supporting materials.
Use semantic understanding to distinguish similar-looking documents and reduce manual sorting effort.
Create cleaner inputs for parsing, adjudication, workflow automation, and generative insight.
Chicago is designed for high-volume environments where documents arrive in bundles, packets, or mixed uploads and need to be organized before business logic can be applied.
Files enter through a host application, workflow layer, or upstream intake process.
Chicago detects document boundaries inside mixed content and breaks them apart.
Each separated document is identified and labeled using semantic classification.
Results are passed back to the host or routed into downstream Haystac services.
Chicago is built for noisy, inconsistent, real-world inbound content.
Chicago is especially valuable anywhere inbound documents arrive mixed together.
Chicago is the correlation layer within the broader Haystac platform.
Chicago helps teams organize inbound content before extraction, decisioning, and automation begin.
Haystac is a fully containerized AI platform that deploys behind your firewall — on-prem, in your private cloud, or hybrid. Built for banks, insurers, hospitals, and agencies that need to classify, extract from, reason over, and act on the documents that keep getting flagged by security.
“Are there indicators of fraud in claim FB-9183?”
Sensitive claims, loan files, charts, and case data can’t leave your environment without violating contracts, regulations, or customer commitments.
12–24 months of security architecture, framework hardening, governance controls, and audit cycles before a single model runs in production. Fewer than 20% of AI experiments reach production.
Knowledge workers process documents by hand while cloud-first competitors automate the same workflows. The cost of waiting is operational drag, every quarter.
of regulated-industry data sits locked and unusable for AI — not because the AI isn’t good enough, but because the AI is in the wrong place.
Haystac is a fully containerized AI platform that deploys behind your firewall. Your documents stay where they are. The models train on your content. Every answer is traceable to a source document you already own. NIST 800-53 aligned out of the box.
Four products. One pipeline. Sort · Extract · Answer · Act — each independently deployable, all designed to work together.
Inbound documents grouped by meaning, not templates. Mailroom triage, claims intake, case files — classified before they hit a human queue.
Pull every field from forms, tables, handwriting, irregular layouts — without OCR brittleness. Outputs are JSON, ready for your systems.
Get answers from your own documents. Every response cited. Every reasoning chain auditable. Bias toward retrievable content reduces hallucination.
Approve, route, escalate, generate — multi-step workflows with a full audit trail. Turns content-driven understanding into operational follow-through.
Most AI rollouts in regulated industries die in security review. Haystac changes the inputs to that conversation — and the conversation that follows.
Claims, charts, loan files, case files — finally usable.
Every answer cited to source. NIST 800-53 out of the box.
Containerized, pre-hardened, ready to install behind your firewall.
Sort, extract, answer, route — without manual handoffs for in-policy work.
REST API. Runs alongside your ECM, CRM, ERP, and case systems.
Same architecture from 500K to 50M+ pages a year.
Haystac has completely changed how we manage documents. What used to take hours of manual processing is now done in minutes—with greater accuracy.
We reduced document processing time by 80% after implementing Haystac. The AI-powered classification and extraction features have been game-changers for our team.
Our workflow was drowning in paperwork before Haystac. Now we’ve automated data extraction and improved efficiency across departments.
With Haystac we eliminated manual errors in invoice processing, saving thousands in operational costs each quarter.
Process loan files, KYC packets, and trade docs without leaving your data center. Decisions cited to source.
Triage claims, automate underwriting reviews, and read every policy without exposing PII.
Extract from charts and prior authorizations under HIPAA, on infrastructure you control.
FedRAMP-aligned AI for case files, FOIA responses, and benefits adjudication.
“Are there indicators of fraud in this claim file?”
Haystac reviews claim forms, adjuster notes, photos, and prior history. Returns a structured summary with cited evidence — ready for an SIU referral or routine close.
“Which of our procedures need to change under this new regulation?”
Haystac analyzes policies, identifies impacted processes, produces a traceable impact assessment — with line-level citations to both the regulation and the affected procedure.
“Does this treatment qualify under the patient’s plan?”
Haystac evaluates clinical documentation, coverage rules, and payer guidelines. Returns a defensible coverage decision with citations to the exact policy clauses.
“Can we auto-process this in-policy claim?”
Haystac classifies, extracts, validates against business rules, and routes. Escalates exceptions to a human queue. Logs every step for the audit trail.
The Haystac team led or founded the companies that defined every IDP wave before this one — ABBYY, Kofax, Sapiens, Solaris, ILOG. Now they’re building the platform they always wished existed: domain-specific AI that runs inside regulated environments, not around them.

20+ years in enterprise software. Founded Solaris Development (Partners Healthcare, MGH, Dana Farber). Early sales at ILOG — helped scale to $30M in five years.

30+ years in distributed systems, rules engines, and computational linguistics. Co-founder and principal architect at Sapiens. M.S., Latvian State University.

20+ years in ML research. Author of the core algorithms behind Haystac — scalable classification, conceptual clustering, fuzzy pattern matching for noisy text.

CIO at ABBYY. Former CTO at Kofax — led the company’s move into mobile capture, NLP, and process automation. 45+ patents in text analytics and image processing.
Haystac applies that playbook to the next generation of AI — with a relentless focus on control, domain-specific intelligence, and real-world adoption in regulated industries.
Haystac goes to market through partners — VARs, OEMs, and BPOs already operating inside regulated environments. They handle the deployment. We handle the platform.
Maximum control. Your hardware, your network, your air gap if you need one.
Skip the hardware procurement. Deploy into your own AWS, Azure, or GCP account.
Sensitive ingest stays on-prem. Reasoning runs in private cloud. One platform across both.
Yes. Fully containerized (Docker). On-prem, customer-managed VPC, or hybrid. No outbound connectivity. No SaaS dependencies. No telemetry phoning home.
RAG and Graph RAG architecture. Every response is constrained to retrieved enterprise content, with line-level source attribution. The model can’t answer from outside your corpus — that’s the point.
Day One inside existing infrastructure. DIY private AI takes 12–24 months. We measure deployment in weeks, not quarters.
REST API. MCP for tool invocation. Runs alongside your ECM, CRM, ERP, and case systems. No rip-and-replace.
Partner-led. Talk to a Haystac partner for a deployment quote scoped to your environment, document volume, and use cases.
Bring your documents, your systems, and your compliance constraints. We’ll show you how Haystac works inside the environment you already control — in 30 minutes.
Request a demo →