The ComfyUI prompt and workflow is attached to the image: https://files.catbox.moe/s3qufb.png

You can’t copy pasta this prompt. There are a few nodes that are specific to SD3 and required.

EDIT: more proof of the chain that lead to this image. They were not all this good. I'm cherry picking for sure and these are just webp's without workflows attached:

  • Fubarberry
    link
    fedilink
    English
    arrow-up
    3
    ·
    5 months ago

    No, the version they released isn’t the full parameter set, and it’s leading to really bad results in a lot of prompts. You get dramatically better results using their API version, so the full sd3 model is good, but the version we have is not.

    Here’s an example of SD3 API version: SD3 API

    And here’s the same prompt on the local weights version they released: SD3 local weights 2B

    People think stability AI censored NSFW content in the released model, which has crippled its ability to understand a lot of poses and how anatomy works in general.

    For more examples of the issues with SD3, I’d recommend checking this reddit thread.

    • istanbullu@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 months ago

      Thanks, I’m sticking to SDXL finetunes for now. I expect the community will uncensor the model fairly quickly.

    • j4k3@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 months ago

      I think the difference is typical of any base model. I have several base models on my computer and the behavior of SD3 is quite typical. I fully expect their website hosts a fine tune version.

      There are a lot of cultural expectations that any given group around the world has about generative AI and far more use cases than any of us can imagine. The base models have an unbiased diversity that reflects their general use; much is possible, but much is hard.

      If “woman lying in grass” was truly filtered, what I showed here would not be possible. If you haven’t seen it, I edited the post with several of the images in the chain I used to get to the main post image here. The post image is not an anomaly that got through a filter, it is an iterative chain. It is not an easy path to find, but it does exist in the base training corpus.

      Personally, I think the real secret sauce is the middle CLIP agent and how it relates to the T5 agent.