MarkellR-RedHat

Markell Rawls MarkellR-RedHat

AI Advocate @ Red Hat

Highlights

llm-d-interactive-demo llm-d-interactive-demo Public

Interactive demo showcasing llm-d distributed inference scaling — from single vLLM to production-grade architecture, with industry-adaptive content

TypeScript
llm-d-benchmark llm-d-benchmark Public

llm-d KV-cache-aware routing benchmark: up to 47.5x faster TTFT on NVIDIA H200 GPUs

HTML
llm-d-prefix-cache-routing llm-d-prefix-cache-routing Public

How llm-d's prefix-cache-aware routing eliminates redundant GPU compute — interactive web app and blog companion

HTML
llm-d-agent llm-d-agent Public

Ask llm-d — A domain-specific AI agent for llm-d. Explain, deploy, and simulate LLM inference infrastructure tailored to your industry.

TypeScript
llm-d-monitoring llm-d-monitoring Public

llm-d Monitoring & Observability on Red Hat OpenShift — interactive dashboard simulator with setup guide, PromQL reference, and alert rules

HTML
llm-d-inference-routing-landscape llm-d-inference-routing-landscape Public

Inference Routing Landscape — interactive comparison of routing approaches with business view toggle

CSS