Foundry AI

From Data to Agent,
Radically Simplified.

Foundry AI is the command center for building enterprise-grade AI. We integrate data versioning, experiment tracking, collaborative fine-tuning, and cost-effective inference into a single pane of glass.

See The Workflow

The Modern Stack for Custom AI

One platform to manage the entire lifecycle of your models.

Data Layer

Git-Native Tooling for Petabyte-Scale Data

The single source of truth for your AI.

Stop fighting brittle data pipelines. Foundry AI brings branching, merging, and collaboration to your unstructured data. Build reproducible models on a foundation of versioned, queryable datasets.

main

Add 10k curated examples

Commit c0a9b1c by admin

Initial commit of raw data

Commit f8e7d6b by admin

Live Experiment Dashboard

train/loss: 0.123

val/accuracy: 0.925

Tracking Layer

W&B-Style Observability, Natively Integrated

"How well is my model learning?"

Log metrics, visualize performance in real-time, compare runs, and version artifacts without leaving the platform. Every fine-tuning job is an experiment, automatically tracked and linked to its data and code.

Training Layer

Serverless Fine-Tuning, Without the Lock-In

The engine that builds your custom models.

Go from a dataset commit to a fine-tuned model with a single command. Our managed training layer orchestrates jobs on open-source frameworks, providing full transparency and eliminating infrastructure headaches.

Fine-Tuning Job Status

job_2910a: c0a9b1c

Completed

job_554cd: 9a8b7c6

Running

job_1876b: f8e7d6b

Failed

Live Deployments

> curl -X POST https://api.foundry.ai/v1/invocations/... \

-H "Adapter-Id: job_2910a-adapter" \

-d '{"prompt": "How do I reset my password?"}'

Serving Layer

Deploy Thousands of Models for the Cost of One

Where your models meet the real world.

Our multi-adapter inference server deploys a single base model and dynamically loads thousands of fine-tuned LoRA adapters on demand. Drastically reduce your serving costs and GPU idle time without sacrificing performance.

A Single Pane of Glass for AI Development

Unify your data, training, and deployment into one auditable, collaborative workflow.

Version Your Data

Commit unstructured data like a git repo. Every change is tracked, every experiment is reproducible.

Fine-Tune a Model

Launch a training job from a specific data commit. Our platform handles the orchestration and scaling.

Track & Compare

Experiments are auto-logged, letting you visualize metrics and compare performance in real-time.

Deploy with One Click

Promote your trained adapter to a production endpoint. Our server handles dynamic loading and scaling.

Monitor & Iterate

Observe model performance, then create a new data branch to start the cycle again, with full auditability.

Distinctive Superpowers

Foundry isn't just a pipeline. It's an intelligent, interconnected system designed for one purpose: shipping better models, faster.

Unified Project Graph

Full reproducibility, from data to deployment.

Your moat. The Foundry Project Graph is the single source of truth, natively linking data commits to experiments, models, evals, and deployments. Every artifact is a node; lineage is first-class, making every result perfectly reproducible.

Eval Studio & Gated Deployments

"Did my model learn the right things and meet our quality bar?"

Ship with confidence. Define quality gates in simple YAML, run evaluations against golden sets, and automatically roll back canary deployments if metrics regress. Stop bad models before they reach production.

Cost & Performance Autopilot

Maximum performance, minimum cost.

Eliminate guesswork. Our planner automatically chooses the optimal quantization, batch size, and caching strategy for your models. Set budget targets in $/1M tokens and get real-time optimization hints.

Collaborative Reports

Share insights, not just dashboards.

Turn insights into action. Weave live charts, metrics, and code snippets into shareable, Markdown-based reports. Keep stakeholders aligned and document your team's progress, all versioned within your project.

Transparent, Unit-Based Pricing

Pay only for what you use. No hidden fees, just clear unit economics.

Inference

Multi-LoRA Serving (7-8B Models)

$1.50 / 1M tokens

Aggressive batching
Sub-second adapter loading
H100 & A100 GPU tiers

POPULAR

Fine-Tuning

SFT / Continued Pretrain

$4.00 / 1M tokens

Config-based job submission
Integrated with experiment tracking
Automated checkpointing

Storage

Datasets & Artifacts

$0.015 / GB-month

Content-addressable storage
Zero egress fees
CLI & HTTP access

Comprehensive Documentation

Get started quickly and master advanced workflows with our detailed guides and API references.

Status	Meaning
queued	Submitted to backend, waiting for GPU
running	Training in progress
downloading	Pulling artifacts from backend
converting	Converting adapter weights to local format
ready_for_eval	Adapter downloaded and ready to test locally
evaluating	Local eval running
complete	All steps done
error	Failed — check error_message field

Rule	Severity	What it checks
empty_response	high	Assistant turn has empty or whitespace-only content
malformed_messages	high	Missing required fields (role, content) or wrong message order
oversized_input	medium	Total token count exceeds model context limit
duplicate_conversation	low	Exact duplicate of another example in the dataset
truncated_tool_call	medium	Tool call JSON is cut off mid-stream

From Data to Agent, Radically Simplified.

The Modern Stack for Custom AI

Git-Native Tooling for Petabyte-Scale Data

Live Experiment Dashboard

W&B-Style Observability, Natively Integrated

Serverless Fine-Tuning, Without the Lock-In

Fine-Tuning Job Status

Live Deployments

Deploy Thousands of Models for the Cost of One

A Single Pane of Glass for AI Development

Version Your Data

Fine-Tune a Model

Track & Compare

Deploy with One Click

Monitor & Iterate

Distinctive Superpowers

Unified Project Graph

Eval Studio & Gated Deployments

Cost & Performance Autopilot

Collaborative Reports

Transparent, Unit-Based Pricing

Inference

Fine-Tuning

Storage

Comprehensive Documentation

Login to Console

Project Graph

Dataset Versions

Experiment Comparison

Loss Curves

Experiment Runs

Fine-tuning Runs

Create New Fine-tuning Run

Evaluation Runs & Gates

Did it learn? — Perplexity comparison

Eval Policy

Run Eval

Results

Deployments & Cost Autopilot

Inference Cost Estimates

Multi-Adapter Server

Active Deployments

Collaborative Reports

Foundry

Quick Start

Workflow — Dataset Versioning

Workflow — Updating a Dataset

Workflow — Training an Adapter from a Versioned Dataset

CLI — Installation

foundry init

foundry track

foundry track train

foundry ingest

foundry status

foundry quality

foundry inspect

foundry data

foundry train

foundry version

REST API — Authentication

REST API — Sources

REST API — Pipeline

REST API — Quality

REST API — Dashboard

REST API — Projects

REST API — Experiments

REST API — Evals

REST API — Deployments

REST API — Training Runs

REST API — Data Studio

UI Guide — Graph

UI Guide — Fine-tuning Runs

UI Guide — Quality

UI Guide — Data Review

UI Guide — Pipeline

Data — JSONL Format

CLI — Upcoming Commands

Compute Backends

User Management

My Credentials

From Data to Agent,
Radically Simplified.

`foundry init`

`foundry track`

`foundry ingest`

`foundry status`

`foundry quality`

`foundry inspect`

`foundry data`

`foundry train`

`foundry version`