Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference
Published in 7th International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC (CANOPIE-HPC), 2025
Published in 7th International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC (CANOPIE-HPC), 2025