It's all mostly just inference, though some train LoRAs directly on quantized mo...

Gracana · on Feb 23, 2024

> and also includes prompt templates

That sounds very convenient. What software makes use of the built-in prompt template?

moffkalast · on Feb 23, 2024

Of the ones I commonly use, I've only seen it read by text-generation-webui, in the GGML days it had a long hardcoded list of known models and which templates they use so they could be auto-selected (which was often wrong), but now it just grabs it from any model directly and sets it when it's loaded.