Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference

Published in 7th International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC (CANOPIE-HPC), 2025

Direct Link