tau@lemmings.worldtoTechnology@lemmy.world•Local AI is one step closer through Mistral-NeMo 12BEnglish
0·
4 months agoJust beware that like AMD, Intel GPUs suffer a performance hit when using LLMs because of the CUDA specific optimizations in frameworks like llama.cpp
Do you have any tips (or examples) using quadlets? I tried using them but I couldn’t wrap my head around them.