For those that have the need, we'll make it possible for sure! Otherwise, the models are ready for inference directly through Augento - say you’ve been working with the OpenAI Chat completion API, you'll just have to change the model string, e.g. to "augento:v2"