Do you host your own ML / AI / LLM? What do you use, and what do you use it for?

  • Domi@lemmy.secnd.me
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 hours ago

    Curious about the quant tho.

    Q8 from unsloth.

    Something like Qwen3.5-122b

    My go to model for knowledge. Definitely much faster at Q5 but it lacks the tool calling quality of the Qwen3.6 models. Really hoping we see a Qwen3.6-122b soon…