HOW AI WORKS // NOT A CHATBOT // AN INTEGRATED AGENT

We don't build chatbots. We build agents that work inside your organization.

Give an AI real access to your systems and your people, and it stops being a box you type into. It becomes an integrated agent — one that reasons over your data, acts across your tools, and works alongside your team as a first-class member of the organization, under the controls you set. Not a smarter search box. Something that takes on real work and raises the level of the whole org. This page is the field guide to how that works — and to the toolkit you build it from.

THE SHIFT

Stop picturing a smarter assistant. Picture a new member of the organization — one that reaches across your tools, brings your scattered knowledge to bear on a decision, takes the next step, and loops in your people where judgment is needed. That move, from answering to working, is the whole subject of this page.

01 // A NEW WAY TO WORK

Not a better answer. A different way to work.

The same model sits behind both. What changes is everything around it — what it can see, what it can touch, and whether it can act. That difference is the whole game.

Knowledge

CHAT BOX

Answers from only what you paste into the box — plus whatever the model happened to be trained on.

INTEGRATED AGENT

Brings your collective knowledge to the problem at once — ERP records, email, files, the expert's playbook — not just what one system happens to hold.

Reach

CHAT BOX

Talks about your work. Can't touch the tools you actually run it in.

INTEGRATED AGENT

Works across your stack — ERP, EHS, email, Excel, the line-of-business apps your teams live in all day.

Output

CHAT BOX

An answer you then go and act on yourself.

INTEGRATED AGENT

Action. It reads, decides, and takes the next step — under the controls and approvals you set.

Cadence

CHAT BOX

One turn at a time, with you in the loop on every keystroke.

INTEGRATED AGENT

A supervised loop that runs on its own where you let it, and escalates to a person where you don't.

Read your data, reason over it, take the next step — then check the result and go again. That loop, wired into your tools and your team, is the integrated agent at work.

02 // THE ANATOMY

The chat box is one wire into a much larger machine.

What you've used is an interface. What does the work is a harness — the production engineering wrapped around a reasoning loop. Hover any node to see what we cover.

FIG · CTG/N-AI-HARNESS-01 · INTERACTIVE

HOVER A NODE

THE HARNESS

A system, not a chatbot

A harness is a closed-loop working system around the model. The loop reasons and acts; the harness makes it observable, testable, and safe to run in production. Hover any node — each is something we cover and build.

1 loop · 6-part harness · 0 magic

03 // THE TOOLKIT INSIDE — EIGHT PATTERNS

Inside that infrastructure, a toolkit of eight patterns.

The harness houses them; these are what do the work. Pick by problem shape, not sophistication — each names the problem it fits, the one it doesn't, and where it breaks. A 30-line script can do what a six-agent crew was built to do; the skill is knowing which problem you have.

PATTERN 01

Scripts

The rule is writable, the inputs are stable.

The smallest unit of automation, and still the one that does the most work in most companies. A list of steps a computer follows without judgment — it runs the same way at 2 a.m. on a holiday as it does at noon on a Tuesday.

WHERE IT FITS: Repeated work with stable schemas: pulling a daily report, normalizing a feed, posting a webhook, calling an API on a cron. The first thing to try, almost always.
WHERE IT BREAKS: Brittle to schema drift, silent on edge cases. No judgment — that's the feature and the limit. If the next step needs “well, it depends,” a script alone is wrong.

PATTERN 02

Machine learning

The rule isn't writable, but you have data.

ML predicts, classifies, ranks, and scores. You reach for it when the input is messy, the rule is statistical, and you have labeled examples to learn from. The right tool when “if/else” runs out because the rule lives in the data.

WHERE IT FITS: Demand forecasting, fraud scoring, churn prediction, defect classification, anomaly detection, ranking, route optimization — a number or a label that depends on patterns too tangled to hand-code.
WHERE IT BREAKS: Needs data infrastructure, labeling discipline, monitoring, and a retraining cadence. And an LLM is the wrong tool here — a logistic regression beats a chat completion at predicting a number, every time.

PATTERN 03

Visual workflows

The bottleneck is integration breadth, not reasoning depth.

Node-based tools like n8n: a visual canvas, a thousand-plus connectors, LLM nodes built in. The graph is editable by anyone who can read it. When the work is moving things between systems, it's hard to beat for time-to-first-win.

WHERE IT FITS: Cross-system glue: lead routing, ticket dispatch, daily report assembly, file moves, notification chains, CRM hygiene. The litmus test — “take this from here, do something simple, notify someone.”
WHERE IT BREAKS: Long-running stateful work, deep agent loops, complex retries, anything that needs version-controlled tests and CI. Past a hundred nodes or shipping as a product, code-first wins.

PATTERN 04

Code-first graphs

The agent loop has to ship as a product.

LangGraph and the like: stateful, multi-agent workflows expressed as a directed graph. Each node mutates shared state, interrupts pause for human review, and the whole graph compiles, persists, streams, and replays — with first-class tracing and evaluation.

WHERE IT FITS: Production agent systems that must be reliable, testable, and ownable by an engineering team: a research agent, a support triage that routes and resolves, a document generator with a human checkpoint. Anything needing unit tests, eval suites, and a deploy pipeline.
WHERE IT BREAKS: Costs developer hours up front. No canvas, no drag-and-drop. If your team won't write code — or the work is mostly moving data between SaaS apps — the wrong tool. Visual workflows are faster.

PATTERN 05

Skills

You need to teach the model “how we do this here.”

A Skill is a folder with a playbook inside. The model loads it only when the task matches — same model, more capable on the things you teach it. Published as an open standard, so the same skill works across Claude Code, Codex, and other harnesses.

WHERE IT FITS: Encoding procedure: a brand voice, a document template, a regulatory checklist, a deployment runbook, a customer onboarding sequence. Skills compound under version control — the library gets better, the model gets better, no retraining required.
WHERE IT BREAKS: Skills are recipes, not the kitchen. If the task needs a model, a tool, or external data, the skill is only part of the system. A poorly written description means the skill never loads — the most common failure.

PATTERN 06

Skill orchestration

One identity needs to run many procedures.

A workflow of skills, run under a stable identity, is an agent with a job. The identity is the role, the principles, the voice. The skills are the procedural library. The orchestration decides which skill, in what order, under what conditions.

WHERE IT FITS: Roles, not tasks: a “lead qualifier” that always sounds like your sales team and routes to a human at the right threshold; a “release manager” that follows the same checklist every time; a “client onboarder” that runs the same skills in the same order.
WHERE IT BREAKS: Lives or dies on the quality of the identity and the skill descriptions. A vague identity drifts; vague descriptions load the wrong skill. Treat the identity as a real artifact — write it, version it, review it.

PATTERN 07

Agent harnesses

The model needs hands, not just a mouth.

A harness is everything around the model: filesystem access, code execution, a permission system, hooks, subagents with isolated context, tool integration, durable session state. Without one, an LLM is a chat box. With one, it's an operator that can read files, run tests, edit code, and hit APIs across hours of work.

WHERE IT FITS: Any work that requires real action — editing files, running tests, calling APIs across long sessions, coordinating subagents, enforcing policy on what the model may do. The harness is what lets a model finish a job rather than describe one.
WHERE IT BREAKS: Build versus adopt is the real decision. Off-the-shelf harnesses give you the ecosystem for free; building your own gives full control of the lifecycle. Most operators should adopt; some should fork; few should build from scratch.

PATTERN 08

Retrieval

The answer lives in your data, not the model.

The model knows what it was trained on; your business knows what it wasn't. Retrieval is the bridge — saved knowledge made available on demand, in the form most useful to the question. RAG, grep, semantic search, hybrid, structured queries, knowledge graphs.

WHERE IT FITS: Choose retrieval shape by question shape: grep for an exact term or identifier, vector for a conceptual match, hybrid when both matter (the common production answer), a knowledge graph when the value is in relationships.
WHERE IT BREAKS: Bad chunking, stale indexes, missing citations, no evaluation. Retrieval is the part of an AI system most likely to silently give the wrong answer — the model will happily reason on top of irrelevant context. Treat retrieval quality as its own metric.

04 // PRODUCTION SYSTEMS ARE STACKS

Real builds use four to six patterns at once.

Pure single-pattern systems are rare in production. The interesting builds plug each pattern into the next — each doing the work it does best. Selection still matters; combination multiplies the value of getting each one right.

Lead-to-pipeline

Script · ML (lead score) · Retrieval (account) · Skills (drafting) · Visual workflow (CRM)

Document automation

Skills (templates & voice) · Harness (file I/O) · Retrieval (priors) · Script (formatting)

Coding agent

Harness (filesystem & exec) · Skills (style) · Code-first graph (plans) · Retrieval (codebase)

Support triage

ML (intent) · Retrieval (KB) · Skills (responses) · Code-first graph (escalation) · Visual workflow (tickets)

Forecast + narrative

ML (forecast) · Script (chart prep) · Skills (executive voice) · Harness (delivery)

4–6 typical Patterns per production system

Rare Pure single-pattern builds

Cross-functional Who owns the stack

05 // A SHORT RUBRIC FOR PICKING

Most selections come down to a few questions, asked in order.

Answer them honestly and the right shape is usually obvious. Skip them and you'll over-build the simple half of the problem and under-build the hard half.

The wrong pattern used elaborately is worse than the right pattern used simply. Sophistication is a side effect, not a goal.

Can you write the rule in plain language?

Scripts (01). Try first, always.

Is the answer a number or label, with historical examples?

Machine learning (02).

Is the work mostly moving data between systems?

Visual workflows (03).

Does the agent loop need to ship as a product?

Code-first graph (04) on a harness (07).

Does the model need to learn “how we do this here”?

Skills (05) under an identity (06).

Does the model need to take real action — files, code, APIs?

An agent harness (07).

Does the answer live in your data, not the model?

Retrieval (08) — shape from the question shape.

Start at 01 and walk down.

Reaching for the agent crew before you've tried the script is the classic over-build.

Combine without apology.

A production system using five patterns isn't complicated — it's correct.

Buy the harness, write the skills, own the orchestration.

The harness moves fastest with community. Skills are your IP. Orchestration is your business logic.

06 // WHERE AI SITS

Three altitudes of AI — and a portfolio across all three.

Productivity wins are fast and broad; differentiating systems are slow and defensible. A good roadmap funds both — and knows which is which.

TIER / 01

Productivity AI

Tools that lift individual and team efficiency — assistants, copilots, retrieval over your documents. Fastest to adopt, broadest reach, lowest moat.

TIER / 02

Embedded AI

AI built into the tools and workflows you already run — monitoring, automation, classification quietly doing work inside existing systems.

TIER / 03

Differentiating AI

Custom systems that become a competitive advantage — proprietary agents and workflows wired to your data and your way of working. The roadmap's deep end.

07 // THE ELEVEN BUILDING BLOCKS

Each is useful but limited. Composed, they look like intelligence.

Underneath every pattern, the model does some combination of these eleven things. If a workflow maps to them, AI can do it — the trick is breaking the work into its components.

Read & extract

Pull structured info from any doc, email, form, or image.

Understand

Get the meaning, not just the words.

Write & create

Generate reports, emails, code, and summaries.

Reason

Draw conclusions, follow logic, weigh tradeoffs.

Sort & prioritize

Triage by urgency, category, or relevance.

Search & find

Find related info across thousands of documents — by meaning, not keywords.

Transform

Convert data from one format or system to another.

Play a role

Act as an auditor, writer, analyst, or any expert.

Remember

Carry preferences and context across interactions.

Plan

Break big tasks into steps and run them in order — the line between chatbot and agent.

Self-check

Review its own work, find errors, and fix them.

HOW THEY COMPOSE INTO CAPABILITIES

Automate documentation Read + Understand + Write + Transform

Find, analyze, report Search + Understand + Reason + Write

Route / triage incoming Read + Sort + Reason + Transform

Cross-system data sync Read + Transform + Plan + Self-check

Capture a retiring expert's knowledge Read + Understand + Remember + Search + Role

Continuous compliance monitoring Read + Sort + Search + Reason + Write

Find the workflows where humans act like machines. Put machines there. Let humans be humans.

ENGAGE

Now map it to your workflows.

The vocabulary is the easy part. The value is in running it against the real work your business does — which is exactly what the session is for.

Request a session Read the method