Easy to setup locally hosted LLM with access to file system

youreusingitwrong@programming.dev · 1 day ago

Easy to setup locally hosted LLM with access to file system

nocteb@feddit.org · edit-2 1 day ago

Look into setting up the “continue” plugin in vs code. It supports an ollama backend and can even do embeddings if setup correctly. That means it will try to select files itself based on your question which helps with prompt size. Here is a link to get started, you might need to choose smaller models with your card.

https://ollama.com/blog/continue-code-assistant

JASN_DE@feddit.org · 1 day ago

with similar capabilities

What’s your budget?

youreusingitwrong@programming.dev · 1 day ago

Zero, as said I’d prefer to self host.

JASN_DE@feddit.org · 1 day ago

What hardware do you have available then?

youreusingitwrong@programming.dev · 1 day ago

Just a 1080, though it handles just fine with 7b models, could also work with a 14b probably.

webghost0101 · 1 day ago

With sincere honesty i doubt a 7B model will grant you much coherent/usefull results. 14b won’t either.

I can run deepseek 30b on a 4070ti super and i am very not impressed. I can do more but its too slow. 14b is optimal speed size balance.

I am used to clause opus pro though which is one of the best.

You are 100% allowed to proof me wrong. In fact i hope you do and build something small and brilliant but i personally recommend adjusting expectations and upgrading that card.