22 posts in categories infrastructure
What is a (D)DoS - technical
Technical (D)DoS attack vectors: Layer 3/4 floods, BGP hijacking, Layer 7 application DDoS, and operational resilience strategies to keep services available.
What is a (D)DoS
Non-technical (D)DoS overview: how denial-of-service attacks operate at the Business, Social, and Informational level, and what protection looks like at each.
(D)DoS and Application Security: The Complete Guide
Index for a three-part (D)DoS and Application Security series: business, social, and technical denial-of-service, plus SQL Injection, Log4Shell, and OWASP risks.
How I work with Claude Code
My practical workflow for using Claude Code: project-level rules, persistent memory across sessions, plan mode, and what needs scaffolding to work well.
LLM Inference on OVH MKS: Connect IDEs and Web UIs
Connect Continue.dev, Zed, Cline, Open WebUI, and ownCloud Infinite Scale to a self-hosted vLLM endpoint on OVH MKS. Per-client setup guide. Part 5 of 6.
LLM Inference on OVH MKS: Terraform, Ansible, and Deployment
Provision an OVH MKS GPU node pool with Terraform, deploy vLLM, Istio, and cert-manager with Ansible, and walk through a first deployment. Part 2 of 6.
LLM Inference on OVH MKS: LiteLLM API Gateway
LiteLLM gateway on top of vLLM: per-user API keys, budget limits, and automatic fallback to commercial APIs when the local GPU node is cold. Part 6 of 6.
LLM Inference on OVH MKS: The Complete Guide
Index and reading guide for a six-part series on self-hosting LLM inference on OVH MKS — vLLM, GPU node pools, Terraform, observability, clients, and a gateway.
LLM Inference on OVH MKS: Introduction
When to self-host an LLM on Kubernetes, why vLLM, and what the stack looks like on OVH MKS. Covers use cases, cost framing, and architecture. Part 1 of 6.
LLM Inference on OVH MKS: Prometheus, Grafana, and KEDA
Scrape vLLM and DCGM metrics with kube-prometheus-stack, visualise TTFT and tokens/s in Grafana, and autoscale to zero with KEDA. Part 4 of 6.