Vllm multi gpu inference tutorial. yaml: Kubernetes service configuration.