I'm tired of LLM bullshitting. So I fixed it.

SuspciousCarrot78@lemmy.world · 24 hours ago

I have ASD; I made several tools that explicitly convert web sources to .md and JSON.

The shitty thing is, a lot of sites - even if they have stuff available in simple, beautiful JSON format, refuse to give public access to it. Notoriously, movie session times for local cinemas. That should be a simple look up…but no.

Oh well, at least cool shit like this still exists

https://github.com/chubin/wttr.in

https://github.com/scrapy/scrapy

SuspciousCarrot78@lemmy.world · 1 day ago

I coded a OpenMateo weather call into my python router, so I can type >>weather (city, country) and get live updates.

Failing that, you can search or API this https://openweathermap.org/

Or if you wanna go ultra-minimalist (and super cool)

https://github.com/chubin/wttr.in

https://wttr.in/
^^ if you just type that, it will give you the weather matching your current IP address. Else use /cityname

eg: wttr.in/London

SuspciousCarrot78@lemmy.world · 1 day ago

I mean…you could just get them to deliver what you need :) Or try your local hardware store (if one still exists near you)

SuspciousCarrot78@lemmy.world · 1 day ago

I gotchu fam

https://invidious.nerdvpn.de/

SuspciousCarrot78@lemmy.world · 1 day ago

I wonder…does it need to be a real phone number, or can it be a virtual phone number that forwards?

https://www.burnerapp.com/

You just need to authenticate once, right? Then you can drop the burner number and you’re clean?

EDIT: Yup!

https://www.burnerapp.com/blog/signal-without-phone-number

SuspciousCarrot78@lemmy.world · 2 days ago

Counter point

https://www.youtube.com/watch?v=ViI8FUU9lHM

SuspciousCarrot78@lemmy.world · 2 days ago

Fallout for me…but curiously (or not) my media server recommended “Jericho” and “Falling Skies” as thematic matches / worth a rewatch.

SuspciousCarrot78@lemmy.world · 2 days ago

deleted by creator

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

deleted by creator

SuspciousCarrot78@lemmy.world · 3 days ago

Agree. Unless Sama has the mother of all rabbits in his hat, I dunno how they unfuck themselves.

OAI is betting hard on AGI…but AFAIK they’re trying for it by “do the same shit, harder and faster”. Man, I dunno.

SuspciousCarrot78@lemmy.world · edit-2 3 days ago

Yeah me too. Opus 4.5 is awesome but my god…om nom nom go my daily / weekly quotas. Probably I should not yeet the entire repo at it lol.

4.6 is meant to be 2x worse for not much better output.

Viewed against that, Codex 5.3 @ medium is actual daylight robbery of OAI.

I was just looking at benchmarks and even smaller 8-10B models are now around 65-70% Sonnet level (Qwen 3-8, Nemotron 9B, Critique) and 110-140% Haiku.

If I had the VRAM, I’d switch to local Qwen3 next (which almost 90% of Opus 4.5 on SWE Bench) and just git gud. Probably I’ll just look at smaller models, API calls and the git gud part.

RTX 3060 (probably what you need for decent Qwen 3 next) is $1500 here :(

For that much $$$ I can probably get 5 years of surgical API calls via OR + actual skills.

PS: how are you using batch processing? How did you set it up?

SuspciousCarrot78@lemmy.world · 3 days ago

Woof - the axes on that chart LOL. Suffice it to say, they’re all pretty dang close. Interesting. Maybe the easter bunny can bring me something with >8GB VRAM so I can actually run em locally. I’m guessing Kimi-2 eats about what…500GB+ for 128K context?

SuspciousCarrot78@lemmy.world · 3 days ago

Ah but subscription to OpenAI ChatGPT ($20/USD) gives you access to ChatGPT 5.3 codex bundled in, with some really generous usage allowances (well, compared to Claude)

I haven’t looked recently, but API calls to Codex 5.2 via OR were silly expensive per million tokens; I can’t imagine 5.3 is any cheaper.

To be fair to your point: I doubt many people sign up specifically for this (let’s say 20% if were making up numbers). Its still a good deal though. I can chew thru 30 million tokens in pretty much a day when I’m going hammer at tongs at stuff.

Frankly, I don’t understand how OAI remain solvent. They’re eating a lot of shit in their “undercut the competition to take over the market” phase. But hey, if they’re giving it away, sure, I’ll take it.

SuspciousCarrot78@lemmy.world · 3 days ago

Let’s be fair - not all of the masses are so ignorant.

If you consider API vs subscription, you probably get more bang for buck out of paying $20/USD than just paying per million tokens via API calls. At least for OAI models. It’s legitimately a good deal for heavy users.

For simipler stuff and/or if you have decent hardware? For sure - go local. Qwen3-4B 2507 instruct matches or surpasses ChatGPT 4.1 nano and mini on almost all benchmarks…and you can run it on your phone. I know because it (or the ablit version) is my go to at home. Its stupidily strong for a 4B.

But if you need SOTA (or near to) and are rocking typical consumer grade hardware, then $20/month for basically unlimited tokens is the reason for subscription.

SuspciousCarrot78@lemmy.world · edit-2 3 days ago

M$ are dragging their feet with BITNET for sure and no one else seems to be cooking. They were meant to have released 8b and 70b models by now (according to source files in repo). Here’s hoping.

SuspciousCarrot78@lemmy.world · 3 days ago

Thank you for honestly stating that. I am in similar position myself.

How do you like Qwen 3 next? With only 8GB vram I’m limited in what I can self host (maybe the Easter bunny will bring me a Strix lol).

SuspciousCarrot78@lemmy.world · 3 days ago

Same :)

SuspciousCarrot78@lemmy.world · edit-2 3 days ago

I really like Claude, but the way that it chews thru tokens def cements it as a “rich man’s” AI. Codex surprised me at how capable it is vs how much (little) it costs to operate. Previously, I’d been trying to use ChatGPT + web + project containers…with really sub-par refactoring results.

Tbf, I’ve only really used Claude Opus 4.5 and GPT Codex5.3 for code, so pardon my ignorance.

How well do open weight models like Kimi et al stack up? Can I call them via VsCodium to reason over local mirror of files on my repo? I’m hardware bound with limited compute. I’ve played around a bit with Open Router before, so have passing familiarity with things like TNG Deepseek R1T2, mimo-v2-flash etc.

SuspciousCarrot78@lemmy.world · 3 days ago

I have no idea what that is, so let me look it up…

https://en.wikipedia.org/wiki/BonziBuddy

Huh; must have missed that one back in the day! Though I do recall the Homer Simpson that was like that, and e-sheep

SuspciousCarrot78@lemmy.world · 3 days ago

You guys get ads? 😈

Actually I’m always surprised at how pervasive they are. I keep thinking the way my home infra is set up is normal and people are exaggerating about ads for lulz.

But every now and then when I jump on a unfiltered system I realise “No, no they are not”.

SuspciousCarrot78@lemmy.world · edit-2 21 days ago

I'm tired of LLM bullshitting. So I fixed it.