• guillem
    link
    fedilink
    English
    arrow-up
    7
    ·
    edit-2
    5 months ago

    The description of the problem usually states that “you” are on the trolley. So maybe that’s the model’s interpretation of what they told it “you” (i.e., itself) is?

    • Cloudless ☼@lemmy.cafeOP
      link
      fedilink
      English
      arrow-up
      6
      ·
      5 months ago

      The LLM might be using this definition from Wikipedia:

      The trolley problem is a series of thought experiments in ethics, psychology and artificial intelligence involving stylized ethical dilemmas of whether to sacrifice one person to save a larger number.

      • guillem
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        5 months ago

        Sorry, when I said description i meant the wording of the problem, for example the one that comes further down after that quote: You are standing some distance off in the train yard,… But yeah that mention to AI might also be it. Or both.