Alek's Blog
Architecture diagram: LLM inference pipeline on OVH Managed Kubernetes Service

LLM Inference on OVH MKS: Terraform, Ansible, and Deployment

Provision an OVH MKS GPU node pool with Terraform, deploy vLLM, Istio, and cert-manager with Ansible, and walk through a first deployment. Part 2 of 6.

LLM Inference on OVH MKS: Terraform, Ansible, and Deployment
Architecture diagram: LiteLLM API gateway in front of LLM inference services

LLM Inference on OVH MKS: LiteLLM API Gateway

LiteLLM gateway on top of vLLM: per-user API keys, budget limits, and automatic fallback to commercial APIs when the local GPU node is cold. Part 6 of 6.

LLM Inference on OVH MKS: LiteLLM API Gateway
Architecture diagram: LLM inference pipeline on OVH Managed Kubernetes Service

LLM Inference on OVH MKS: Introduction

When to self-host an LLM on Kubernetes, why vLLM, and what the stack looks like on OVH MKS. Covers use cases, cost framing, and architecture. Part 1 of 6.

LLM Inference on OVH MKS: Introduction
Diagram: Prometheus, Grafana, and KEDA observability stack for LLM inference

LLM Inference on OVH MKS: Prometheus, Grafana, and KEDA

Scrape vLLM and DCGM metrics with kube-prometheus-stack, visualise TTFT and tokens/s in Grafana, and autoscale to zero with KEDA. Part 4 of 6.

LLM Inference on OVH MKS: Prometheus, Grafana, and KEDA
Architecture diagram: LLM inference pipeline on OVH Managed Kubernetes Service

LLM Inference on OVH MKS: Models, AWQ, and OpenAI API

Which models fit on a 16 GB GPU, why AWQ is required for 7B+ models on the RTX5000-28, and how to use the OpenAI-compatible API from Python. Part 3 of 6.

LLM Inference on OVH MKS: Models, AWQ, and OpenAI API
Diagram: application attack surface layered from untrusted input through libraries/dependencies to classical vulnerabilities and AI-era extensions like prompt injection

What is Application Security

Application security: SQL Injection, Log4Shell, OWASP Top 10, and Input Validation — attacks that affect confidentiality and integrity, not availability.

What is Application Security