• mountainriver@awful.systems
    link
    fedilink
    English
    arrow-up
    15
    ·
    29 days ago

    So they named the product sucking the data after the Facehugger? At least they know that they are in the abomination business. Will they be releasing an AI named Bursting Chest?

    • Architeuthis@awful.systems
      link
      fedilink
      English
      arrow-up
      16
      ·
      29 days ago

      The company was named after the U+1F917 🤗 HUGGING FACE emoji.

      HF is more of a platform for publishing this sort of thing, as well as the neural networks themselves and a specialized cloud service to train and deploy them, I think. They are not primarily a tool vendor, and they were around well before the LLM hype cycle.

      • self@awful.systems
        link
        fedilink
        English
        arrow-up
        17
        ·
        29 days ago

        to be honest, they give me a lot of mtgox vibes:

        • extremely stupid name
        • technically predates the worst excesses of the AI bubble
        • very eager to enable the worst excesses of the AI bubble
        • David Gerard@awful.systemsOPM
          link
          fedilink
          English
          arrow-up
          13
          ·
          29 days ago

          they also went full DARVO complaining how mean everyone had been to them over them abusing personal data and then telling us we needed to get with the program

        • Architeuthis@awful.systems
          link
          fedilink
          English
          arrow-up
          8
          ·
          29 days ago

          It’s also the place where you go to to download models to use by yourself instead of sending all your data to the most unscrupulous people possible, so at least they’ve got that going for them.

    • self@awful.systems
      link
      fedilink
      English
      arrow-up
      13
      ·
      27 days ago

      my colleagues are kind, caring people & they were attacked (idc if I get attacked so long as it doesn’t touch my company/colleagues) we’ve always seen love for our work, this incident shocked me

      we’ll keep shipping 📦💗 can’t satisfy all

      Don’t take out your frustration from election results on them, LOSERS

      it’s really jarring seeing one of the biggest hosts for generative AI projects simultaneously do “we’re just an uwu smol bean open source passion project why are you attacking us” while boosting and officially supporting chan-coded fash shit from an e/acc account

    • flere-imsaho@awful.systems
      link
      fedilink
      English
      arrow-up
      6
      ·
      28 days ago

      i bet that bluesky worked with them to get their company and all their personal accounts established at bluesky as the primary social media channel.

      i don’t expect they will get anything more than a slap on the wrists, if at all.

      • bizarroland@fedia.io
        link
        fedilink
        arrow-up
        12
        ·
        29 days ago

        The legal system is the perfect place for working out nuanced issues like this.

        If I were a lawyer and making this lawsuit I would argue that “publicly available” does not mean “public domain”, and that without acquiring usage rights for the data then you don’t have the right to use the data.

        If the courts rule against a decision like this then that would mean that any website that hosts any materials that can be accessed without an account must then provide that material to any person who accesses it free of charge which is a gigantic consequence to this nuanced issue.

  • Architeuthis@awful.systems
    link
    fedilink
    English
    arrow-up
    11
    ·
    29 days ago

    404 added an update that the dataset was removed:

    Update: Following the publication of this article on Tuesday evening, van Strien removed the dataset. “I’ve removed the Bluesky data from the repo,” he wrote on Bluesky. “While I wanted to support tool development for the platform, I recognize this approach violated principles of transparency and consent in data collection. I apologize for this mistake.”