top of page

Profile

Join date: Aug 21, 2025

Posts (1)

Sep 4, 2025 ∙ 4 min

AI in Kubernetes – how to get started without expensive GPUs

Do you want to run AI models without expensive GPUs? In this guide, we show you how to run lightweight LLMs in Kubernetes locally – with CPU and ONNX Runtime. Perfect for testing, development, and prototyping, and a great starting point before scaling up in the cloud with GCP or AWS.

Jasmina Dimitrievska

Jasmina Dimitrievska

Writer

DevOps Consultant

bottom of page