Do you host your own ML / AI / LLM? What do you use, and what do you use it for?

  • robber@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 days ago

    You can control how much context should be fitted with --fit-ctx and how much space the algorithm should leave unallocated (even on a per-GPU basis) with --fit-target.