I’m looking for an open-source alternative to ChatGPT which is community-driven. I have seen some open-source large language models, but they’re usually still made by some organizations and published after the fact. Instead, I’m looking for one where anyone can participate: discuss ideas on how to improve the model, write code, or donate computational resources to build it. Is there such a project?
@lily33 Yes. There are a number I believe that fit these criteria hosted on Hugging Face I feel. Bloom is the first one that came to mind.
Ironically I asked ChatGPT this question and it responded to check out EleutherAI. I do not know anything about that group but looks like they may have helped worked on Bloom, so maybe they are worthy of consideration. Anyway here is Bloom.
https://huggingface.co/blog/bloom-megatron-deepspeed
HuggingFace looks to me like it’s a corporation. Like, when I click on “about > join us”, I’m sent to their job offer page.
@lily33 like Github, HuggingFace is a private company that can host public models. I’m pretty sure this one is fully public. But you’re right that it does look like someone from HF started it so perhaps it does not meet your criteria after all. My apologies if so.
I was (am?) of the understanding though that Bloom is being researched openly such that it can be reproduced locally (and contributed to on HF)
To add to this, here is another model that seems to aim to to be a poor man’s chatgpt: https://huggingface.co/togethercomputer/GPT-NeoXT-Chat-Base-20B