v0.1 · Research-stage

Your GPUs are burning money.
Strided tells you where.

Strided is a kernel-level causal diagnosis engine for production LLM inference. Feed it a Nsight Compute report or DCGM telemetry; get a ranked, confidence-scored explanation of the bottleneck — and a specific fix.

See how it works → Read on GitHub

40–60%

typical GPU utilization in production

deterministic diagnosis rules

2–3%

always-on collection overhead

0.42s

to parse and diagnose a dump

Sample output

strided · diagnosev0.1.4

dev@strided ~ ›

01 / WHAT

A diagnosis engine, not a profiler.

Existing tools tell you your GPU is slow. strided tells you why it's slow, what to do about it, and how confident we are in the answer.

02 / WHO

For people who can't keep ten senior CUDA engineers on staff.

A senior engineer with two days of Nsight can find the bottleneck. strided turns that capability into a CLI you can run on every dump.

03 / WHY

The gap is the business.

You bought H100-hours. Your workload only used 60% of theoretical FLOPS. The gap between rental rate and realized compute is real money. strided measures it.

Parse. Diagnose. Done.

STEP 01

Capture

Run Nsight Compute, DCGM, or vLLM /metrics against your inference workload. Standard tools, standard outputs.

STEP 02

Parse

Strided normalizes the dump into a canonical schema. Phase timings, layer-level metrics, cluster signals, cache state.

STEP 03

Fire rules

Ten deterministic rules evaluate the schema. KV fragmentation, NCCL dominance, TP imbalance, quantization opportunity, more.

STEP 04

Rank

Each fired rule carries a confidence score and a suggested fix. Ambiguous case → no diagnosis. We don't guess.

Project status

Strided is in the 90-day validation window.

We're testing one question: can a deterministic rule engine match a senior CUDA engineer with two days of Nsight, on real customer dumps?

If you operate LLM inference at scale and have a redacted Nsight or DCGM dump from a real workload, we want to test against it. We will not retain raw data. We will share the diagnosis with you.

Send us a log → Read the spec on GitHub

Your GPUs are burning money. Strided tells you where.