Site Reliability Engineer with 12+ years specializing in distributed systems, cloud infrastructure, and large-scale Kubernetes environments (CKS/CKA certified). Focused on building platform tooling that drives engineering productivity and eliminates toil. Currently bridging SRE and AI by scaling MLOps pipelines and model serving infrastructure. Deep expertise in observability, performance tuning, and infrastructure automation.





An autonomous Root Cause Analysis (RCA) agent designed specifically for Kubernetes clusters. Leverages LLM logic and system telemetry to automatically diagnose pod failures, resource exhaustion, and network bottlenecks, drastically reducing operational MTTR.
A centralized observability and anomaly detection platform for distributed microservices. Engineered to aggregate telemetry, system metrics, and distributed traces into actionable infrastructure insights.
The source code driving this exact platform. A Next.js (React) infrastructure executing a Python/RAG Agent pipeline that strictly parses unstructured Job Descriptions and outputs statically generated, targeted frontend bundles dynamically.
M Bottlang, J Doornink, DC Fitzpatrick, SM Madey
M Bottlang, M Lesser, J Koerber, J Doornink, S Mueller, DC Fitzpatrick...
M Bottlang, J Doornink, TJ Lujan, DC Fitzpatrick, PV Marsh...
J Doornink, DC Fitzpatrick, SM Madey, M Bottlang
J Doornink, DC Fitzpatrick, S Boldhaus, SM Madey, M Bottlang