• Draconic NEO@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    1 day ago

    І ԁоո’t раrtісulаrly thіոk thаt summаrіzеrs аrе а gооԁ gоаl, sіոсе аі summаrіеs саո оftеո bе wrоոg, mіsіոtеrрrеt іոfоrmаtіоո, оr оmіt іmроrtаոt іոfоrmаtіո thеy fаіl tо іԁеոtіfy аs іmроrtаոt.

    I think if that starts to become common people should start using tools like this as well as the use of pre-baked PDF or image rendered text to thwart it on their content.

    • TehWorld@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 hours ago

      This is a really interesting little project, but there’s no background info available. Making this be a plugin for that ‘other’ site that most of us left would be great. I still surf there once in a while but no longer comment due to their policy changes.

    • FaceDeer@fedia.io
      link
      fedilink
      arrow-up
      4
      ·
      1 day ago

      I’m not talking about a summarizer, I’m talking about a classifier. It just needs to identify which parts of the page are advertising and which are not.

      The point of such a tool is that it would read the web page in exactly the same way that a human would, so using trickery like pre-rendered images of text or funky unicode wouldn’t really change anything. If a human can read it then so can the AI.

      • Draconic NEO@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        1 day ago

        That could be useful, if ads get to the point where removing their elements manually is no longer possible. I don’t think that’ll happen for a while though, as long as were still using HTML and Javascript which downloads and runs pages locally inside of our browsers.