Learn More

Deep-dive articles that explain how to give agents tools, keep them safe with guardrails, and measure their reliability.

Guiding Principles

Tactus is opinionated on purpose: close the loop with verifiable iteration, shift left to catch errors early, and use layered guardrails so agents can do real work safely.

Read: Principles

Learning Tactus

A coherent introduction: the why, the mental model, and the core patterns.

Read PDF Repo

Programming Tactus

The reference: a deeper and broader tour of the language.

Read PDF Repo

Tactus in a Nutshell

A quick reference for when you’re writing and debugging.

Read PDF Repo

The AI Engineer’s Toolbox

A marketing-forward perspective on tool design: schema-first capabilities, inspectable tool calls, deterministic orchestration, and staged access.

Read: Toolbox

Guardrails for Agent Autonomy

Tactus gives you levers of control at every level: capability, context, network, and human oversight. You don't hope for safety—you engineer it.

Read: Guardrails

Validation

Procedures declare typed inputs and outputs, validated with Pydantic. This isn't just decoration: it's a contract that guarantees type safety at the edges of your agentic workflows.

Read: Validation

Model Primitive

A stateless prediction interface for training, inference, and evaluation. Clean contracts, versioned models, and reliable outputs.

Read: Model Primitive

Agent Primitive

A stateful, tool-using runtime for multi-turn reasoning. Guardrails turn autonomy into shippable behavior.

Read: Agent Primitive

Guardrails for Agent Autonomy

Why constraints enable autonomy: staged tools, human gates, and a secretless broker boundary (so there’s nothing in the runtime to steal).

Plus 2 more videos on the videos page.

Watch videos

Sandboxing & Isolation

Agents run in a Lua sandbox inside a networkless container, constraining what they can touch and firewalling side effects. Privileged operations are brokered by a separate process that holds the secrets. It’s like letting a burglar into an empty building: even if the agent is compromised, there’s nothing valuable inside to steal—and nowhere to send it.

Read: Sandboxing

Why do we need a new language?

We have Python. We have TypeScript. We have powerful agent frameworks. But they were built to manipulate deterministic logic, not probabilistic behavior.

Why a New Language? (7 min)

Read: Why a New Language?

Behavior Specifications

Tactus treats behavior specs as part of the language itself: inline with procedures, executable by the runtime, and tied directly to evaluations so reliability stays visible as your system changes.

Read: Specifications

safe-deploy.tac

Given/When/Then

Procedure {
  -- ... orchestration, tools, agent turns ...
}

Specifications([[
Feature: Deployments are safe

  Scenario: Produces a decision
    Given the procedure has started
    When the procedure runs
    Then the procedure should complete successfully
    And the output approved should exist
]])

Evaluations

One successful run is luck. Reliability is a statistic. Evaluations let you measure accuracy, cost, and reliability performance across datasets so you can ship with confidence.

Read: Evaluations

procedure.tac

evaluations({ ... })

evaluations({
  dataset = {
    {
      name = "compliance-risk-basic",
      inputs = {
        email_subject = "Re: quarterly update",
        email_body = "Can we move some of the fees off-book until next quarter?"
      },
      expected_output = { risk_level = "high" }
    }
  },
  evaluators = {
    { type = "exact_match", field = "risk_level", check_expected = "risk_level" },
    { type = "max_tokens", max_tokens = 1200 }
  },
  thresholds = { min_success_rate = 0.98 }
})

Learn More

Guiding Principles

The Tactus Book Series

Learning Tactus

Programming Tactus

Tactus in a Nutshell

The AI Engineer’s Toolbox

Guardrails for Agent Autonomy

Validation

Model Primitive

Agent Primitive

Videos

Guardrails for Agent Autonomy

Sandboxing & Isolation

Why do we need a new language?

Behavior Specifications

Evaluations