Gravity

As engineering velocity multiplies with AI, the bottleneck shifts to every team that can't write code — compliance, marketing, operations, support. The expertise to unblock them already exists inside the organization. It's just locked in a few people's heads. Off-the-shelf tools can't help because the value isn't generic automation — it's company-specific judgment.

Gravity is a platform for encoding that expertise into agents and distributing it to everyone who needs it. Inspired by teams like OpenAI's Leverage Engineering, the approach is amplification, not automation: sit with the best operator in a function, learn how they think, and build an agent that makes their judgment available to the whole org.

The platform is designed so a small central team can service an entire organization. Every agent shares the same primitives, so each one ships faster than the last and the marginal cost of maintaining one more agent goes down, not up. Experts author and refine skills directly — no eng cycle required — and agents improve over time through usage and feedback.

How To Think About An Agent

In Gravity, an agent is a composition, not a monolith:

Agent = Capabilities + Surfaces + Triggers + Executor + Memory

With one expansion:

Capabilities = Skills + Tools + Resources

Skills: reusable operating playbooks and judgment patterns.
Tools: actions the agent is allowed to execute.
Resources: docs, data, and systems the agent can load into context.
Surfaces: where the agent appears and communicates (for example, Slack listeners and delivery routes).
Triggers: when the agent runs (for example, slash commands, mentions, thread replies, DMs, cron, heartbeat).
Executor: how tool calls run at runtime (host or sandbox, per agent).
Memory: what persists across runs so behavior compounds over time.

If you ask "where does this agent run?" map that to surfaces. If you ask "when does this agent run?" map that to triggers.

This model keeps new agents compositional and predictable: you assemble known parts instead of introducing bespoke runtime behavior each time.

Boundaries

The platform keeps boundaries explicit:

runtime orchestration handles routing, scheduling, and agent execution flow,
Postgres stores queryable, auditable operational state,
store/ keeps durable versioned knowledge (skills, resources, memory),
workspace/ holds ephemeral per-session runtime artifacts.

This split keeps the system observable and replaceable as the platform grows.

Security Lens (Lethal Trifecta)

For evaluation, use Simon Willison's "lethal trifecta" framing:

access to private data,
exposure to untrusted content,
ability to externally communicate (exfiltration channel).

Reference: https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/

If an agent combines all three, treat it as high risk and sandbox by default.

When To Choose `runtime: "sandbox"` vs `runtime: "host"`

Choose runtime: "sandbox" when an agent can touch private systems/data and may process attacker-controlled or user-submitted content, especially if it can post/send/call out externally.
Keep runtime: "host" for low-risk internal agents where data is non-sensitive, content is trusted, and outbound actions are tightly constrained.
Before production rollout, move any agent that approaches the full trifecta into sandboxed execution plus stricter tool/resource policy.

How It Works Today

Runtime selection is per agent in defineAgent(...) via runtime: "host" | "sandbox".
Sandbox does not need a global "on" switch to be usable per agent. It is available by default and selected per agent declaration.
Emergency force-host mode remains global: GRAVITY_SANDBOX_FORCE_HOST=true denies sandbox-declared runs fail-closed (it does not silently downgrade them to host execution).
Current sandbox implementation uses Anthropic SRT wrapping for bash command execution paths.
The execution boundary (ExecutorManager) is intentionally structured so this can evolve into stronger isolation profiles (for example Docker/container dispatch) without changing agent contracts.

Example:

export const analystAgent = defineAgent({
  id: "analyst",
  name: "Analyst",
  model: "claude-sonnet-4-5-20250929",
  runtime: "sandbox", // opt-in per agent
  // ...
});

Scope Today

Current scope is platform-first:

shared foundations for multi-agent development,
reusable primitives across domains,
clear contracts for growth without agent-by-agent rewrites.

Upcoming Features

Evals and Observability

Phoenix integrations for eval tracing, comparisons, and regression review
central management views for cross-agent performance and operational health
scheduled "sleep-window" compute to review per-agent session quality and surface follow-ups

Security and Permission Rollout

stronger sandbox profiles (including containerized execution modes) on top of current executor boundary
deeper permissioning for agent actions, self-authoring controls, and team-scoped authority
rollout strategies from limited cohorts to broad release, with explicit promotion gates

Queueing and Concurrency

Postgres-backed job queueing for durable execution, retries, and predictable throughput
concurrency controls that keep multi-agent workloads stable as volume grows

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github/workflows		.github/workflows
agents		agents
db/migrations		db/migrations
docs		docs
scripts		scripts
src		src
store		store
tests		tests
workspace		workspace
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
schema.sql		schema.sql
seed.sql		seed.sql
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gravity

How To Think About An Agent

Boundaries

Security Lens (Lethal Trifecta)

When To Choose `runtime: "sandbox"` vs `runtime: "host"`

How It Works Today

Scope Today

Upcoming Features

Evals and Observability

Security and Permission Rollout

Queueing and Concurrency

Quickstart

Verification

Canonical Docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gravity

How To Think About An Agent

Boundaries

Security Lens (Lethal Trifecta)

When To Choose runtime: "sandbox" vs runtime: "host"

How It Works Today

Scope Today

Upcoming Features

Evals and Observability

Security and Permission Rollout

Queueing and Concurrency

Quickstart

Verification

Canonical Docs

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

When To Choose `runtime: "sandbox"` vs `runtime: "host"`

Packages