Services · Four core areas

Four things we
know how to build.

We don't do everything. These are the four areas where we've shipped enough production systems to know what the hard parts actually are — and what to do about them.

01 / 04

Document Intelligence
Systems

Extract structure from chaos.

docsextractvalidateoutputjsonhumanrawllm · rulesschemashipped

Most teams come to us after they've already tried document extraction once. The pilot worked on clean PDFs, then broke the moment real documents arrived — scanned invoices with handwriting in the margins, forms filled out inconsistently, tables that span pages, PDFs that were actually just photos of PDFs.

That gap between 'the model extracted the right fields on the sample' and 'the system reliably handles what the business actually receives' is where most document projects die. We build for the second case. That means configurable pipelines instead of hardcoded rules, a real ground-truth dataset from day one, an evaluation harness that tells you when accuracy drops on new document types, and a human review loop for the cases the model is uncertain about.

The output is boring by design: a pipeline that an operations team can trust, tune, and extend without calling us back every quarter.

What this looks like in practice
  • LLM-powered extraction pipelines
  • Schema-driven, client-configurable outputs
  • Variable document format handling (PDFs, reports, tables)
  • Human-in-the-loop validation workflows
  • Ground truth and evaluation systems
  • Two-pass verification architecture
02 / 04

Knowledge Graphs &
Query Systems

Make your data queryable and explorable.

querygraphpeopleorgsdocseventsplacestagsdesigned around the questions that matter

A knowledge graph is only as useful as the questions you can actually ask it. Most graph projects fail because the schema was designed around the source data instead of the questions the business needs answered — so you end up with a technically correct graph that no one can query usefully.

We work backwards. We start with the five or ten questions that matter most to your team, then design the schema, entity types, and relationships to make those questions fast and obvious. Along the way we handle the messy parts most tutorials skip: entity disambiguation, schema evolution without reloading the whole graph, and making the graph queryable in plain language when that's what the end users actually need.

The graphs we build sit in production for years, not months. That only happens if they're designed to answer real questions from day one.

What this looks like in practice
  • Knowledge graph architecture and schema design
  • Graph databases (Neo4j-based systems)
  • Data ingestion pipelines from extracted sources
  • Natural language → structured query interfaces
  • Query validation and refinement layers
  • Template-based query generation for speed and reliability
03 / 04

Entity Resolution
Systems

The hardest part of the system — and the most critical.

MESSYJohn SmithJ. Smithjohnny s.JOHN SMITHEmatchingfuzzy · rules · mlRESOLVEDJohn Smithid: 4782four records → one entity

Entity resolution is the problem that quietly kills data projects. You have the same person in four systems spelled five ways, the same company with three different tax IDs across merged subsidiaries, products that are 'the same thing' by one definition and 'different SKUs' by another. The right answer depends on what the business is trying to do — and the business usually can't tell you until they see the wrong answer.

We approach this iteratively. We build a first pass with sensible defaults, put it in front of the people who'll use it, watch what they flag as wrong, and fold their feedback into the matching rules. After three or four rounds the system converges on something that reflects how your team actually thinks about identity, not how a textbook says they should.

The hardest part isn't the matching algorithm. It's building a system that can be corrected without a rewrite when the business definition of 'same entity' shifts — which it will.

What this looks like in practice
  • Multi-stage matching pipelines (rules + embeddings + graph signals)
  • Proprietary 5-tier resolution cascade
  • Deduplication and disambiguation workflows
  • Identity stitching across documents and datasets
  • Confidence scoring and human review loops
  • Domain-agnostic — deployable across industries
04 / 04

AI Systems Architecture
& Infrastructure

Most AI failures are architectural, not model-related.

THE SYSTEMmodelswappabledatapipelinesevalsground truthretrylogichumanreviewmonitorobservabilityversionrollbackthe model is the last thing we pick

Most AI failures we've seen in production weren't model failures. They were architecture failures — systems where a model was plugged into a pipeline that had no error boundaries, no retry logic, no way to swap components out, no visibility into why things broke, and no clean separation between the model and everything around it.

When we design an AI system, the model is the last component we pick. First we decide where human review belongs, how we'll evaluate the system in production, what happens when the model is wrong, how we'll version and roll back changes, and what the failure modes are when downstream systems time out. Only then do we pick the model — and usually we pick it last because by that point the right choice is obvious.

This is unglamorous work and it's what separates AI systems that are still running in year three from the ones that got quietly replaced by a script.

What this looks like in practice
  • End-to-end pipeline design (ingestion → processing → output)
  • Model orchestration and control layers
  • Evaluation and testing frameworks
  • Data quality and feedback systems
  • Internal tooling for operators and analysts
  • Continuous accuracy monitoring and regression detection

The model you picked will be replaced in eighteen months. The pipeline around it is what decides whether your team still trusts the system in year three.

— From an internal note
Get in touch

Working on one
of these four?

If you're building in any of these areas and the project needs to actually work in production, we'd like to hear about it. We take on a small number of new engagements each year, and the best ones start with a real conversation about where the hard parts are.

Start a conversation