SearchCrucible: self-optimizing AI assistants for technical products

Why most AI assistants stagnate

It ships at its best. Then it quietly gets worse.

A RAG assistant is only as good as the day it was tuned. Your product keeps moving, and answer quality decays unless someone keeps tuning it. Most teams don’t have that person.

Typical assistant SearchCrucible

Same corpus, same questions, graded weekly. Continuous optimization is the difference between a line that climbs and one that decays.

Embedding drift

You rename a product. Users search for the new name. The index still speaks last year’s language.

Stale documents

Organizations add content. They almost never remove it. The model surfaces the confident, authoritative, outdated answer.

Retrieval dilution

Every document you add competes for the same retrieval slots. More knowledge means worse retrieval.

Context poisoning

The model isn’t hallucinating. The answer is in the context. It just can’t find it.

How it works

Three steps to deploy. Then it runs itself.

Connect your documentation, shape a retrieval pipeline, and ship it to the edge. From there SearchCrucible tunes and watches itself.

Connect your documentation

Point SearchCrucible at your technical docs: sitemaps, Notion, Google Drive, PDFs, Backstage, Zendesk and more.

Build your retrieval pipeline

Start from an accuracy preset, or open the node editor to shape retrieval, reranking and generation yourself.

Deploy to the edge

Ship a pipeline version and it rolls out to your Rust edge worker, everywhere at once.

Pipeline	Env	Status	Reqs	p50
Support KB Search a1f9c20	prod	Live	12.4k	180ms
Product Docs RAG 7b3e0d4	prod	Live	8.1k	210ms
Legal Clause Finder c44ab19	preview	Building	—	—

04 automatic

It improves itself

From there SearchCrucible runs the tuning loop for you, analyzing chunking and prompts and keeping what scores better. Nothing to re-tune by hand.

See the loop

05 automatic

Watch every answer

Every query is traced end to end, with quality metrics over time and the gaps in your docs surfaced for you.

See the trace

Continuous optimization

Your assistant gets better over time. Automatically.

SearchCrucible runs the tuning loop you’d otherwise hire for. It experiments with retrieval strategies, chunking, and prompts, grades answers against your own questions, and keeps whatever wins. Quality climbs instead of decaying.

Knowledge base Your docs, guides & API refs

Retrieval Finds the right passages

Answer Grounded, with citations

Evaluation Graded on your questions

Automatic improvement Keeps whatever wins

↺ The cycle repeats continuously. No prompt engineer, no notebooks, no manual re-tuning.

Global performance

Edge-deployed by default.

Answers are generated close to your users on a globally distributed edge network, not from a single central region. You don't pick regions or run servers. The experience is fast wherever your customers are.

Within 50ms of most users Reaches 95% of the world’s internet-connected population in under 50ms.

Hundreds of cities The retrieval worker runs from edge locations worldwide, near where traffic starts.

Close to your users Retrieval and generation run at the edge nearest each request. Less distance, less latency.

No region to pick Deployed everywhere at once. You never choose or manage a region.

Consistent worldwide Customers on every continent get the same fast, grounded answers.

Dedicated infrastructure, without the ops

Your assistant. Your infrastructure.

Your assistant runs on infrastructure provisioned for your company alone, not a shared multi-tenant pool. You get the isolation enterprises expect, and we handle the operations.

foo.io Runtime Index Data

Your company dedicated Runtime Index Your data

bar.dev Runtime Index Data

Your environment runs alone. Separate compute, retrieval index, and data, with nothing shared.

Dedicated compute

Provisioned for your company alone. No shared pool, no noisy neighbours.

Isolated workloads

No shared-tenant retrieval layer. Your data and traffic never mix with anyone else’s.

Procurement-ready

A clear isolation boundary makes the security review shorter and the path easier.

Full retrieval tracing

No black boxes.

Most AI assistants hide what happens between question and answer. SearchCrucible shows every chunk retrieved, every score, every prompt version, and the full evaluation history. When an answer is weak, you see why.

trace · how do I rotate an API key without downtime? answered

Retrieved chunks · ranked hybrid · topK 10

#1 0.88 used API Keys → Rotation
"…create a second key, deploy it, then revoke the old key once traffic has fully drained…"
#2 0.71 used Migration guide
"…keys created before v3 must be migrated with `keys:migrate` before they can be rotated…"
#3 0.54 dropped Authentication → Overview
"…every request must include a bearer token in the Authorization header…"

Prompt version v14 · promoted 2h ago

# system
You answer questions about Acme only from the
provided sources. Cite each claim inline. If the
sources don't cover it, say so and link the docs.

# context
{{ retrieved_chunks }}  · 2 of 3 used

The optimizer promoted v14 after it beat v13 by +1.4pp on your evaluation set. Every prompt the assistant has ever run is versioned and diffable.

Evaluation history answer quality · weekly

run #142 today 96.0% +0.8
run #141 2 days ago 95.2% +0.5
run #140 4 days ago 94.7% +1.1
run #139 6 days ago 93.6% +0.4

Deploy an assistant that improves itself.

SearchCrucible is still in development. Leave your email and we’ll reach out when early access opens.