kubernetes - Laytoun' thoughts!

Building Production-Grade RAG Systems: Kubernetes, Autoscaling & LLMs

By Mohammed Aboullaite in kubernetes on 22 Nov 2025

Why Kubernetes for LLM workloads: GPU scheduling, autoscaling, and serving models like Gemma in a production-grade Java RAG system. Part 3 of the series.…

Building Production-Grade RAG Systems: Architecture Deep Dive

By Mohammed Aboullaite in Java on 16 Nov 2025

Architecture deep dive of a production RAG system in Java 25 and Spring Boot WebFlux: service boundaries, retriever design, and tradeoffs explained.…

Building Production-Grade RAG Systems: Understanding the Problem Space

By Mohammed Aboullaite in Java on 10 Nov 2025

The real production challenges of RAG systems: latency, reliability, cost, quality, and observability. Part 1 of building production-grade RAG in Java.…

Pixie, the missing developer observability tool!

By Mohammed Aboullaite in kubernetes on 28 May 2023

How Pixie brings instant, eBPF-powered observability to Kubernetes: debug services, spot bottlenecks, and profile apps without changing code.…

Skaffold, OKE & OCIR!

By Mohammed Aboullaite in Docker on 06 Mar 2020

Speed up Kubernetes development with Skaffold on Oracle Kubernetes Engine (OKE) and OCIR: automated build, push, and deploy loops for containers.…