• FooBarrington@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    1
    ·
    1 day ago

    My guy, we’re not talking about just leaving a model loaded, we’re talking about actual usage in a cloud setting with far more GPUs and users involved.

      • FooBarrington@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        23 hours ago

        Given that cloud providers are desperately trying to get more compute resources, but are limited by chip production - yes, of course? Why do you think they’re trying to expand their resources while their existing resources aren’t already limited?