“Suno’s training data includes essentially all music files of reasonable quality that are accessible on the open internet.”

“Rather than trying to argue that Suno was not trained on copyrighted songs, the company is instead making a Fair Use argument to say that the law should allow for AI training on copyrighted works without permission or compensation.”

Archived (also bypass paywall): https://archive.ph/ivTGs

  • mikwee@thebrainbin.org
    link
    fedilink
    arrow-up
    52
    arrow-down
    31
    ·
    3 months ago

    If you post stuff online for people to view it, don’t be surprised when computers see it too.

    • stiephelando@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      39
      arrow-down
      4
      ·
      3 months ago

      Posting doesn’t mean anyone can exploit it for commercial gain. Also a lot of music is pirated against the Creator’s wishes.

    • diffusive@lemmy.world
      link
      fedilink
      English
      arrow-up
      32
      arrow-down
      7
      ·
      3 months ago

      Copyright doesn’t work like that.

      The fact that you put online (e.g. in an online shop or, heck, even for free on your website) doesn’t imply anyone can use it for anything they please.

      For example an mp3 on a indie musician website for making people know their music, doesn’t mean people can start making CD out of it and selling them

      You may say that piracy exists but it is illegal and AI training is pretty much for profit piracy (using something outside the intended scope defined by the author for profit)

      • ɔiƚoxɘup@infosec.pub
        link
        fedilink
        English
        arrow-up
        9
        arrow-down
        2
        ·
        3 months ago

        I think that the argument is that it’s not actually copying it and instead learning from it.

        I feel like they really have to pay something. Maybe their argument is right and they shouldn’t pay full price given that if a human did the same thing to learn to make music, they also wouldn’t be billed in that way, given it’s openly available.

        Maybe they are required to give a portion of profits back to the source materials creators?

        The purpose of the copyright system is that someone should not be allowed to take your material and sell it to someone else passing it off as your own and make the profit thereby stealing the profit from you. I don’t think that’s what’s happening here but also at the same time it doesn’t seem like what they’re doing is right because they’re making profit off of somebody else’s work without paying anything.

        It feels to me like the answer should be somewhere in the middle.

        I certainly don’t have the answers though.

        • leftzero@lemmynsfw.com
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          3 months ago

          the argument is that it’s not actually copying it and instead learning from it.

          It’s not learning, though.

          Saying that these models have learnt the data they’ve been built with is akin to saying a zip file has learnt the data it’s storing.

          It’s not AI or anything remotely resembling AI, it’s just a new form of storage that’s very good at classifying the data that’s been stored in it and retrieving stored data that shares similar classification properties.

          Its only difference from straight up copying are the classifications it builds. Whether that’s transformative enough to count as fair use, though, is open to debate.

          If you make a software that takes random pictures off the web and randomly makes collages out of them, would it be fair use? Would those collages be copyrightable?

          Whatever you answer should be your same answer for these models (I believe current copyright law would say that the software is copyrightable, but its works aren’t, and don’t fall under fair use, but I might be wrong).

          • ɔiƚoxɘup@infosec.pub
            link
            fedilink
            English
            arrow-up
            1
            ·
            3 months ago

            A collage would be fair use, yes I believe that’s already been established but again I’m not like a copyright lawyer or anything like that. I’ll leave that for others to research and prove in the courtroom.

            I would say that your first example, as it file is not really accurate but the collage may be more so. It’s a statistical model that calculates statistically what a thing should be. Is it learning? Maybe maybe not.

            • leftzero@lemmynsfw.com
              link
              fedilink
              English
              arrow-up
              1
              ·
              3 months ago

              A collage would be fair use

              A collage made by a human, sure. One made automatically by a piece of software, though, I’m not so sure.

      • GBU_28@lemm.ee
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        2
        ·
        3 months ago

        Sure, this company will burn for this, but Pandora’s box is wide open now.

        I’m not condoning anything, but the original comment is unfortunately 100% true.

      • ArmokGoB@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        2
        ·
        3 months ago

        UMG can go to hell. Musicians already make almost next to nothing from song sales because the labels take so much.

        • The Quuuuuill@slrpnk.net
          link
          fedilink
          English
          arrow-up
          1
          ·
          3 months ago

          Yeah but the fucking labels aren’t even being that resistant to AI. AI is robbing the labels a little, and the artists a lot. It’s like how in 1984 george orwell points out the proletariat and the middle class should be allies, but the middle class always installs itself as the upper class and leaves the proletariat behind. The labels are willing to ditch the artists so long as they get to stay in one of the upper two classes. So they’re willing to sell access to their back catalog and their lost masters to the AI companies so long as that pays them a little bit better than paying out for new artists to join the label. They’re the fucking overseers or the small whites in Haiti. They hurt the people they should be working with because they’re too short sighted to see what’s gonna happen once everything shakes out. And you’re over here rooting the AI robbery on because it will hurt the labels instead of realizing who it’s REALLY gonna hurt