Mark Zuckerberg open sources 3 new LLMs

lidd1ejimmy@lemmy.ml · edit-2 5 months ago

Mark Zuckerberg open sources 3 new LLMs

Fisch@discuss.tchncs.de · 5 months ago

The low quant versions of a 70B model are still way better than a high quant version of an 8B model tho. But yeah, performance might be ass, I don’t have anything like a 4090, so I couldn’t tell you. The main thing I do with these locally run models is use it for SillyTavern, which lets you kinda do roleplay with fictional characters. That’s kinda fun sometimes. But I don’t really use it much besides that either. Just testing how well different models perform and what I can run on my GPU is kinda fun in itself too tho.

MudMan@fedia.io · edit-2 5 months ago

For sure, it’s a bit of technical curiosity and an opportunity for tinkering.

And given the absolute flood of misinformation around and about machine learning and “AI”, I also find it to be a hygiene thing to be able to identify bullshit on both the corporate camp and the terminally online criticism. Because man, do people say a lot of wild stuff that doesn’t make sense about this subject. Looking under the hood seems like a good thing to do.

Mark Zuckerberg open sources 3 new LLMs

Mark Zuckerberg open sources 3 new LLMs

- YouTube