Making AI Outputs Trustworthy with Ceramic's Supervised Generation Powered by Nemotron 3 Nano

Published / Last Updated Mar 17, 2026

Category Blog

Today at NVIDIA GTC 2026, we're pulling back the curtain on something we've been building: Supervised Generation — a system that makes AI outputs trustworthy by grounding every response in real-time evidence, with inline citations and confidence signals.

We're live at booth #4035 all week. Here's what we're showing.

The Problem We're Solving

LLMs are great at thinking, but not at knowing. In enterprise settings, the difference matters enormously.

A hallucinated fact in a financial report, a misattributed claim in a legal brief or an unverifiable assertion in a medical summary can erode trust, create liability and undermine the value of AI at the moment it's needed most. The question isn't whether your LLM is capable. It's whether you can trust what it tells you.

What Supervised Generation Does

Ceramic's Supervised Generation doesn't replace your primary LLM — it enhances it. Acting as a trust layer that operates alongside any existing model, the system evaluates LLM outputs in real time, grounding responses in verified web evidence before they reach the user.

The experience is straightforward: submit a query, receive a response where each claim carries a clear trust signal. Claims that are successfully grounded receive a confidence score and a citation traceable to a verified source. Claims that cannot be verified are flagged inline, giving users an immediate, honest signal about what the model knows versus what it doesn't.

For enterprises, Supervised Generation plugs into existing LLM infrastructure for generation without replacing it. Whether a company is running third-party, or internal models, the verification and citation layer adds on top, enhancing trust without disrupting existing workflows or contracts.

Built on a Model-Agnostic Verification Layer

We built Supervised Generation to work with a range of verification models. Enterprises, startups and developers should be able to choose what fits their infrastructure, cost profile and deployment requirements without being locked into a single provider.

NVIDIA Nemotron 3 Nano is our featured option at GTC. Designed for efficient, high-accuracy reasoning on constrained compute, Nemotron 3 Nano evaluates each generated claim against retrieved search evidence and outputs a grounded verification signal fast enough for production use.

Ceramic's search API returns large volumes of structured evidence per query (source text, metadata and citations), meaning the verification model needs to process long context windows efficiently. Nemotron 3 Nano's hybrid mixture-of-experts architecture is built for exactly this, delivering up to 4x higher throughput than its predecessor and dropping generation cost to $0.013 per 1M tokens at 64k context. Paired with Ceramic's search API at $0.05 per 1,000 queries, 100x below leading alternatives. Both layers are built to serve at scale together. Nemotron makes long-context reasoning affordable; Ceramic makes the information that feeds it just as affordable.

"Nemotron was built for long-context efficiency. Ceramic was built for cost-effective retrieval at scale. The two systems were made for each other," said Anna Patterson, founder and CEO of Ceramic.

Come See It at GTC — and Join the Waitlist

We're demoing Supervised Generation live at GTC 2026 in San Jose, March 16–19. Stop by booth #4035 to see real-time grounding and verification in action and speak with our team.

Can't make it to GTC? Join the waitlist at ceramic.ai for early access to Supervised Generation when it becomes available.

Ceramic.ai is a member of NVIDIA Inception, a free global program for AI startups that provides free developer training and resources, preferred pricing from NVIDIA, valuable offers from NVIDIA and cloud ecosystem partners, and other benefits to help startups grow their businesses' preferred access to NVIDIA’s AI ecosystem.