The open Agent Harness

AI agents,
forged for you.

Autonomous by default. Conversational by nature.

Forge generates the harness around any LLM — agents that do the work, send emails, post updates, modify code, run pipelines, on their own. Before any irreversible or high-stakes move, they check in with you. Like a senior operator who knows when to ask.

Build your agent See how the interview works

Your keys·Your hardware·No telemetry·Runs offline

From one paragraph→a running agent that sounds like you·under 10 minutes

Forge architect mid-conversation, capturing Phase 1.5 context

myagentos.ai/create — real product, real chat

myagentos.ai/create — Phase 1.5 Context Interview

Live recreation of Phase 1.5 capture

For founders, operators, and engineers who'd rather own the agent than rent one.

Not another chatbot wrapper.

Three things separate a Forge agent from everything else you've tried.

Phase 1.5 Context Interview

The architect captures 27 fields across 5 waves before generating code. Who you are, how you decide, what makes you throw your laptop. The agent ships pre-loaded with you.

27 fieldscaptured per agent

32 capabilities, all runnable

Every capability ships real code, not a stub. Six categories — core, execution, communication, data, integration, observability — wired by default. Drop your credentials in .env, the agent uses them.

32 / 32runnable today

Every byte in your zip

The runtime is vendored — your agent has no upstream dependency on us. Run it locally, on your container, or air-gapped.

zeroexternal calls

PHASE 1.5 · THE ARCHITECT INTERVIEW

5 waves capture your context before forging the agent.

Every Forge agent starts with a 27-field structured interview. Your voice, anti-patterns, role, communication style, and standing rules all become the agent's actual config — not generic defaults.

WAVE 01

Identity

(5)

✓Name
✓Role
✓Company
✓Background
✓Domain expertise

WAVE 02

Context

(5)

✓Timezone
✓Working hours
✓Channels
✓Tools you use
✓Current workflow

WAVE 03

Objective

(5)

✓Pain points
✓Success criteria
✓What to watch
✓What to draft
✓Quiet hours

WAVE 04

Voice

(6)

✓Tone
✓Length preference
✓Banned phrases
✓Decision style
✓Anti-patterns

WAVE 05

Standing Rules

(6)

✓Drafts vs sends
✓Per-action approval
✓Boundaries
✓Daily heartbeat
✓Memory protocol

27 structured fields → One agent shaped like you. Day-one in your voice. Knows your role. Enforces your rules.

What gets generated

Your context becomes your agent's bootloader.

Phase 1.5 answers flow directly into your workspace files. The agent reads these every time it boots — no fine-tuning, no embeddings, no opaque inference. Just markdown the agent and you can both edit.

USER.mdwho you are, in your words
SOUL.mdvoice, archetype, anti-patterns
STANDING_ORDERS.mdhard rules and escalation triggers
IDENTITY.mdname, emoji, catchphrases
AGENTS.mdoperational manual the agent reads

src/agent/workspace/USER.md

Captured during Phase 1.5. Your identity, role, expertise, communication style. Read on every boot.

# USER.md

## Identity

- Name: Dan
- Role: Run an early-stage program. Serial founder.
  Runs multiple workstreams in parallel — sales, AI research,
  internal tooling, advising founders. Context-switches 30+ times/day.
- Timezone: PST. Working hours 8am-7pm.
- Heartbeat: 7:30am daily, 6pm wind-down summary.

## Communication preferences

- Voice: terse
- Decision style: data + opinion. Quantify it, then recommend.
- Response length: 1-3 sentences default. Long only when warranted.

## Sharp domains (skip the basics)

- Startup operations, distribution, agentic AI architecture
- Sales process, founder psychology, LLM economics
- Rocky/RHEL infrastructure, code review (Python/TS/Rust)

## Where the agent fills in

- Calendar archaeology, email triage, meeting prep
- Follow-up tracking, saying-no drafts, end-of-day summaries

## Avoid (immediate distrust)

- "I'd be happy to help" / "Great question!"
- Corporate filler, fake enthusiasm
- Excessive disclaimers, soft-pedaling
- 5 clarifying questions when 1 would do
- Unsolicited emoji, "circle back", "touch base"

read on every agent boot · merged into the system prompt

Zerovendor lock-in

Bring your own LLM. Anthropic, OpenAI, Gemini, Grok, or any OpenAI-compatible endpoint. Switch by editing one env var.

Auditedsecurity posture

Adversarial sandbox tests, SSRF guards, env scrubber. Honest about what's hardened and what isn't. Threat model.

Yoursforever

Download the source. Run it locally, on your server, or air-gapped. The agent boots even if myagentos.ai disappears.

32 capabilities. Wired by default.

Each one ships real code, tested, ready to call. You opt out of what you don't want — not in to what you do.

Core

·Persistent memory
·Scheduled commitments
·Heartbeat loop
·Skills
·Standing orders

Execution

·Sub-agent spawning
·Python execution
·Shell execution
·Flows
·Sandbox

Communication

·Email (IMAP)
·Email (SMTP)
·Slack
·SMS
·Webhooks
·Gateway (HTTP+WS)
·TTS

Data

·File ops
·HTTP requests
·Code modification
·Web search
·Web extract
·SQLite

Integration

·GitHub
·Google Calendar
·Notion
·Linear
·HubSpot
·ACP (multi-CLI)
·Plugins

Observability

·Conversation logging
·Metric tracking

Full reference in the documentation.

32 capabilities is the floor, not the ceiling.

Your agent ships with general-purpose tools — call any HTTP API, run any Python code, execute any shell command, receive any webhook. Three ways to extend, no PR to Forge required.

PATH 1 · ZERO EFFORT

Just ask

Tell the agent the API docs and which env var holds your token. It uses web_extract or python_exec directly. No code, no config.

"Call the Stripe API for me. Token is in STRIPE_API_KEY. Endpoint is /v1/balance."

PATH 2 · 30 SECONDS

Standing orders

Add a section to STANDING_ORDERS.md describing the API. The agent reaches for it reflexively, across sessions, like a native capability.

## Stripe integration
When asked about MRR: GET /v1/balance
Auth: bearer STRIPE_API_KEY

PATH 3 · 10 MIN

Plugins

Drop a Python file in plugins/with a@register_tooldecorator. First-class status: TOOLS.md, typed errors, full schema.

@register_tool(name="stripe_balance")
def stripe_balance(args, ctx):
return requests.get(...)

Every integration in the world is one conversation away. How extensibility works.

Built for the work that doesn't fit a SaaS box.

The agent already knows your domain. The capabilities are already there. You build what nobody else can sell you.

Personal

Chief of staff

→Catches what you drop across email, Slack, calendar
→Drafts replies — never sends without your nod
→Pre-loads meeting briefs 15 min before each call
→End-of-day "what slipped" + morning heartbeat

Personal

Inbox triage

→Reads every inbound email, scores urgency
→Tracks who is waiting on you and for how long
→Auto-drafts polite-no replies for invites
→Surfaces forgotten threads on Sunday evening

Personal

Calendar archaeologist

→"What did I commit to 3 weeks ago that I forgot?"
→Cross-references DMs against calendar events
→Flags conflicts and double-bookings
→Knows context for every meeting before it starts

Operational

Trading & monitoring

→Watches price feeds, technical indicators, news
→Runs your strategy with your risk rules in standing orders
→Surfaces setups that match your historical wins
→Never executes without explicit confirmation

Operational

Lead intelligence

→Monitors Slack, CRM, email for inbound leads
→Sub-agents run founder + company dossiers
→Scores against your ICP, your win patterns
→Drafts personalized outreach with talking points

Operational

Compliance watch

→Tails logs, monitors APIs, reads security feeds
→Cross-references against your policy in STANDING_ORDERS
→Spawns sub-agents to investigate anomalies
→Daily audit log + weekly compliance summary

Three ways to run. All sovereign.

Choose how it boots. The agent code, workspace, and vendored runtime are identical — only the deployment shell changes.

Local Python

Fastest to start

Run directly on your machine. Best for development, daily use, dogfooding.

Python 3.10+Any OSBYOK

$ python3 -m venv .venv && source .venv/bin/activate
pip install -e .
python -m <agent>

Rocky Linux container

Production-ready

Isolated container. Best for long-running production agents on your infrastructure.

Podman/DockerRocky 9Hardened

$ ./container/build.sh
./container/run.sh
# Listens on 127.0.0.1:7891 via gateway

Fully sovereign

Air-gapped, no LLM API

Bundles a local LLM (llama.cpp / Ollama) into the image. Zero external dependencies.

Local LLMZero netCompliance

$ ./container/start-sovereign.sh
# All inference local
# No internet required

PRIVACY BY ARCHITECTURE

Your data never leaves your computer.

Most AI agents send everything to a SaaS backend. Forge sends nothing. Every byte stays on the laptop you installed it on — by design, not by promise.

THE TYPICAL AI AGENT

Cloud-hosted

✗
Runs on their servers
Every prompt, every response, every action through their backend
✗
You pay per token + monthly fee
Margin on inference, lock-in to their pricing
✗
Your data trains future models
Or sits on their disk indefinitely
✗
Outage = your agent stops
No backend? No work.

A FORGE AGENT

Local-first, BYOK

✓
Runs on your machine
Python process you started. Close the terminal, it's gone.
✓
You bring your own API key
Direct to Anthropic/OpenAI/Gemini. Forge never sees a dollar.
✓
Your data stays on your disk
Memory, drafts, logs — plain markdown + SQLite. Yours forever.
✓
Works offline (with Ollama)
Air-gappable. No internet, no problem.

If you don't trust Forge, read the source code in your download.

Stop teaching every AI tool who you are.

Forge captures it once, in your words. Then ships you an agent that already knows.

Build your agent

Free to design and download. You bring your own LLM key at runtime.

The substance, not the brochure.

Real architecture, real numbers, real trade-offs. Three deeper reads for whoever's interested.

TECHNICAL WHITEPAPER

Architecture deep-dive

6+ pages on how Forge generates agents. Phase 1.5 interview, 32-capability catalog, 96-tool surface, sandbox model, self-modification mechanics. With PR citations.

Read the whitepaper →

FOR ENGINEERS

vs LangChain / DIY

Comparison tables. Line counts. PR ledger. Why Forge is a generated harness vs LangChain abstractions you assemble. For people who'd rather not spend 2-3 months on glue code.

For developers →

FOR OPERATORS

What you can build

Sales chief of staff. Investment thesis agent. Personal CTO. Operations brain. Six example agents with the concrete things they actually do for you.

For operators →

Honest about what we built, honest about what we didn't. Security posture for the audited trust model.

AI agents,forged for you.

Not another chatbot wrapper.

Phase 1.5 Context Interview

32 capabilities, all runnable

Every byte in your zip

5 waves capture your context before forging the agent.

Identity

Context

Objective

Voice

Standing Rules

Your context becomes your agent's bootloader.

32 capabilities. Wired by default.

32 capabilities is the floor, not the ceiling.

Just ask

Standing orders

Plugins

Built for the work that doesn't fit a SaaS box.

Chief of staff

Inbox triage

Calendar archaeologist

Trading & monitoring

Lead intelligence

Compliance watch

Three ways to run. All sovereign.

Local Python

Rocky Linux container

Fully sovereign

Your data never leaves your computer.

Cloud-hosted

Local-first, BYOK

Stop teaching every AI tool who you are.

The substance, not the brochure.

Architecture deep-dive

vs LangChain / DIY

What you can build

AI agents,
forged for you.