• MentalEdge
    link
    fedilink
    English
    arrow-up
    10
    ·
    edit-2
    1 month ago

    And to have conversation, behind the scenes, each prompt gets the entire conversation so far tacked on.

    The model itself is static, it doesn’t work like a brain that changes in response to stimulus, or form memories.

    To converse about something, the entirety of an exchange is fed back into the model all over again each time it needs to produce a response. In fact, this can happen several times over for each word in a response.

    It’s basically an attempt at duct-taping the ability to form memories onto an otherwise static system. It works, but I don’t see how that way of doing it could ever land LLMs in the land of real consciousness.

    It basically means these models “think” in frames, but each frame gets exponentially heavier to process, as it has to ingest every frame that came before.

    • mozz@mbin.grits.dev
      link
      fedilink
      arrow-up
      3
      ·
      1 month ago

      OpenAI at least is now attempting to bolt on a “memory” by having the LLM spit out short snippets of what it might need to know later, which it then has access to when completing later prompts. Like everything else post-GPT-4, it seems fine but doesn’t work really all that well at what it is intended to do.