Building Production-Grade RAG Systems: Architecture Deep Dive

In the first part, we explored the production challenges of RAG systems: latency, reliability, cost, quality, and observability. Now let's get our hands dirty with the actual architecture and implementation. The codebase uses Java 25, Spring Boot 3.5.7, reactive programming with WebFlux, and follows production patterns…

A look into Deep Java Library!

When you think about building machine learning apps, Java is not the first language that comes to mind, probably not even in the top 3 or 5! But Java has proved time and again that it is capable of modernising itself, and even if it's not the first…

Pixie, the missing developer observability tool!

Needless to say how important monitoring and observability is, especially in a cloud native, distributed world! No system should got to production without having monitoring tools in place. On the other hand, the devops movement and cloud native era introduced a plethora of tools to run, deploy and monitor our…

What the CRaC ?!

If you've been following the news lately in the Java ecosystem (aside from Java 28th anniversary), you should've heard of CRaC. Two big announcements were revealed this week: * Azul announced earlier this week the general availability of and commercial support for Azul Zulu Builds of OpenJDK…

My home office setup!

Hello dear reader 👋 Let me set come context some context first before diving into how I set up my home office.  I am a software engineer, a meticulous one you can say! I am sharing my own setup because many friends asked me to do so (and I truly thank…