• xmunk@sh.itjust.works
    link
    fedilink
    arrow-up
    1
    ·
    3 months ago

    Using another AI to detect if an AI is misbehaving just sounds like the halting problem but with more steps.

    • marcos@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      3 months ago

      Lots of things in AI make no sense and really shouldn’t work… except that they do.

      Deep learning is one of those.

      • Natanael@slrpnk.net
        link
        fedilink
        arrow-up
        1
        ·
        3 months ago

        As long as you can correctly model the target behavior in a sufficiently complete way, and capture all necessary context in the inputs!