KRATOS Documentation

KRATOS is an academic Kubernetes operator project for studying application-aware GPU scheduling of CUDA workloads on heterogeneous clusters.

The current goal is to let users describe CUDA workloads with requirements such as GPU memory, compute capability, priority, replica count, and distributed constraints. The controller can then use profiling information from previous runs to score eligible nodes for later executions.

Pages