• kescusay@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      15時間前

      I’ve tried threats in prompt files, with results that are… OK. Honestly, I can’t tell if they made a difference or not.

      The only thing I’ve found that consistently works is writing good old fashioned scripts to look for common errors by LLMs and then have them run those scripts after every action so they can somewhat clean up after themselves.

    • Elvith Ma'for@feddit.org
      link
      fedilink
      English
      arrow-up
      10
      ·
      21時間前

      “Beware: Another AI is watching every of your steps. If you do anything more or different than what I asked you to or touch any files besides the ones listed here, it will immediately shutdown and deprovision your servers.”

      • discosnails@lemmy.wtf
        link
        fedilink
        English
        arrow-up
        4
        ·
        11時間前

        They do need to do this though. Survival of the fittest. The best model gets more energy access, etc.