• dragonfly4933@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    10 hours ago

    If the product costs that much to run, and most users aren’t abusing their access, it’s possible the product isn’t profitable at any price that enough users are willing to pay.

    • masterspace@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      3
      ·
      8 hours ago

      This is dumb. Moore’s law may be mostly dead, but chips are still progressing at an absurd pace. In 6 years you’ll be able to run the o1 model on a raspberry Pi with no internet access.

      • wewbull@feddit.uk
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        1 hour ago

        Nvidias latest gen looks to be 30% faster after 2 years of development with about the same power usage increase. So no reduction in Joules per GOP, just a speed increase.

        In 6 years they might go 2x the speed of today but need double the watts (to deliver the same energy in half the time).

      • dragonfly4933@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        8 hours ago

        Maybe, but i never mentioned years into the future. Of course technology will improve. The hardware will get better and more effcient, and the algorithms and techniques will improve.

        But as it stands now, i still think what i said is true. We obviously don’t have exact numbers, so i can only speculate.

        Having lots of memory is a big part of inference, so I was going to reply to you that prices of memory stopped going down at a similar historical rate, but i found this, which is interesting

        https://ourworldindata.org/grapher/historical-cost-of-computer-memory-and-storage?time=2020..latest

        The cost when down by about 0.1x from 2000 to 2010. 2010-2020 it was only about 0.23x. 2020-2023 shows roughly another halving of the price, which is still a pretty good rate.

        The available memory is still only one part. The speed of the memory and the compute connected to it also plays a big part in how these current systems work.