Autonomous by default. Conversational by nature.
Forge generates the harness around any LLM — agents that do the work, send emails, post updates, modify code, run pipelines, on their own. Before any irreversible or high-stakes move, they check in with you. Like a senior operator who knows when to ask.

myagentos.ai/create — real product, real chat
Live recreation of Phase 1.5 capture
For founders, operators, and engineers who'd rather own the agent than rent one.
Three things separate a Forge agent from everything else you've tried.
The architect captures 27 fields across 5 waves before generating code. Who you are, how you decide, what makes you throw your laptop. The agent ships pre-loaded with you.
Every capability ships real code, not a stub. Six categories — core, execution, communication, data, integration, observability — wired by default. Drop your credentials in .env, the agent uses them.
The runtime is vendored — your agent has no upstream dependency on us. Run it locally, on your container, or air-gapped.
Every Forge agent starts with a 27-field structured interview. Your voice, anti-patterns, role, communication style, and standing rules all become the agent's actual config — not generic defaults.
27 structured fields → One agent shaped like you. Day-one in your voice. Knows your role. Enforces your rules.
Phase 1.5 answers flow directly into your workspace files. The agent reads these every time it boots — no fine-tuning, no embeddings, no opaque inference. Just markdown the agent and you can both edit.
USER.mdwho you are, in your wordsSOUL.mdvoice, archetype, anti-patternsSTANDING_ORDERS.mdhard rules and escalation triggersIDENTITY.mdname, emoji, catchphrasesAGENTS.mdoperational manual the agent reads# USER.md
## Identity
- Name: Dan
- Role: Run an early-stage program. Serial founder.
Runs multiple workstreams in parallel — sales, AI research,
internal tooling, advising founders. Context-switches 30+ times/day.
- Timezone: PST. Working hours 8am-7pm.
- Heartbeat: 7:30am daily, 6pm wind-down summary.
## Communication preferences
- Voice: terse
- Decision style: data + opinion. Quantify it, then recommend.
- Response length: 1-3 sentences default. Long only when warranted.
## Sharp domains (skip the basics)
- Startup operations, distribution, agentic AI architecture
- Sales process, founder psychology, LLM economics
- Rocky/RHEL infrastructure, code review (Python/TS/Rust)
## Where the agent fills in
- Calendar archaeology, email triage, meeting prep
- Follow-up tracking, saying-no drafts, end-of-day summaries
## Avoid (immediate distrust)
- "I'd be happy to help" / "Great question!"
- Corporate filler, fake enthusiasm
- Excessive disclaimers, soft-pedaling
- 5 clarifying questions when 1 would do
- Unsolicited emoji, "circle back", "touch base"Bring your own LLM. Anthropic, OpenAI, Gemini, Grok, or any OpenAI-compatible endpoint. Switch by editing one env var.
Adversarial sandbox tests, SSRF guards, env scrubber. Honest about what's hardened and what isn't. Threat model.
Download the source. Run it locally, on your server, or air-gapped. The agent boots even if myagentos.ai disappears.
Each one ships real code, tested, ready to call. You opt out of what you don't want — not in to what you do.
Full reference in the documentation.
Your agent ships with general-purpose tools — call any HTTP API, run any Python code, execute any shell command, receive any webhook. Three ways to extend, no PR to Forge required.
Tell the agent the API docs and which env var holds your token. It uses web_extract or python_exec directly. No code, no config.
Add a section to STANDING_ORDERS.md describing the API. The agent reaches for it reflexively, across sessions, like a native capability.
Drop a Python file in plugins/with a@register_tooldecorator. First-class status: TOOLS.md, typed errors, full schema.
Every integration in the world is one conversation away. How extensibility works.
The agent already knows your domain. The capabilities are already there. You build what nobody else can sell you.
Choose how it boots. The agent code, workspace, and vendored runtime are identical — only the deployment shell changes.
Fastest to start
Run directly on your machine. Best for development, daily use, dogfooding.
$python3 -m venv .venv && source .venv/bin/activate pip install -e . python -m <agent>
Production-ready
Isolated container. Best for long-running production agents on your infrastructure.
$./container/build.sh ./container/run.sh # Listens on 127.0.0.1:7891 via gateway
Air-gapped, no LLM API
Bundles a local LLM (llama.cpp / Ollama) into the image. Zero external dependencies.
$./container/start-sovereign.sh # All inference local # No internet required
Most AI agents send everything to a SaaS backend. Forge sends nothing. Every byte stays on the laptop you installed it on — by design, not by promise.
If you don't trust Forge, read the source code in your download.
Forge captures it once, in your words. Then ships you an agent that already knows.
Build your agentFree to design and download. You bring your own LLM key at runtime.
Real architecture, real numbers, real trade-offs. Three deeper reads for whoever's interested.
6+ pages on how Forge generates agents. Phase 1.5 interview, 32-capability catalog, 96-tool surface, sandbox model, self-modification mechanics. With PR citations.
Comparison tables. Line counts. PR ledger. Why Forge is a generated harness vs LangChain abstractions you assemble. For people who'd rather not spend 2-3 months on glue code.
Sales chief of staff. Investment thesis agent. Personal CTO. Operations brain. Six example agents with the concrete things they actually do for you.
Honest about what we built, honest about what we didn't. Security posture for the audited trust model.