• olicvb@lemmy.ca
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    7 months ago

    Yea they exists, I think they are mostly merges, optimization, deviations, or tuning of the models released by Facebook or OpenAI and etc.

    Based on what i know from image generator models, they take alot of computing to make/tune, but not as much to run it.

    I was able to run this Mistral-7B model made by the Mistral team. And there are many others available here (these are re-released model that are tuned by a third party for use with GPT4All)

    While this 7B model runs it definitely dosent give the same results as one from Big AI, I did manage to run (albeit very slowly 1Token/s) a model that is said to surpass GPT4 (https://huggingface.co/WizardLM/WizardCoder-Python-34B-V1.0) made by the WizardLM team.

    AI chatbots aside, there are also Image generation models created by small/indie dev teams, almost everything on civit.ai is made third party (so ALOT of models). Stable Diffusion has been keeping up and even in some cases surpassing the big guys mostly using indie dev work.