Hi, I’m Eric and I work at a big chip company making chips and such! I do math for a job, but it’s cold hard stochastic optimization that makes people who know names like Tychonoff and Sylow weep.

My pfp is Hank Azaria in Heat, but you already knew that.

  • 6 Posts
  • 135 Comments
Joined 1 year ago
cake
Cake day: January 22nd, 2024

help-circle



  • I had a similar disc with one of my friends! Anthropic is bragging that the model was not trained to play pokemon, but pokemon red has massive wikis for speed running that based on the reasoning traces are clearly in the training data. Like the model trace said it was “training a nidoran to level 12 b.c. at level 12 nidoran learns double kick which will help against brock’s rock type pokemon”, so it’s not going totally blind in the game. There was also a couple outputs when it got stuck for several hours where it started printing things like “Based on the hint…” which seemed kind of sus. I wouldn’t be surprised if it there is some additional hand holding going on in the back based on the game state (i.e., go to oaks, get a starter, go north to viridian, etc.) that help guide the model. In fact, I’d be surprised if this wasn’t the case.



  • Bruh, Big Yud was yapping that this means the orthogonality thesis is false and mankind is saved b.c. of this. But then he immediately retreated to, “we are all still doomed b.c. recursive self-improvement.” I wonder what it’s like to never have to update your priors.

    Also, I saw other papers that showed almost all prompt rejection responses shared common activation weights and tweeking them can basically jailbreak any model, so what is probably happening here is that by finetuning to intentionally make malicious code, you are undoing those rejection weights + until this is reproduced by nonsafety cranks im pressing x to doubt.


  • Bruh, Anthropic is so cooked. < 1 billion in rev, and 5 billion cash burn. No wonder Dario looks so panicked promising super intelligence + the end of disease in t minus 2 years, he needs to find the world’s biggest suckers to shovel the money into the furnace.

    As a side note, rumored Claude 3.7(12378752395) benchmarks are making rounds and they are uh, not great. Still trailing o1/o3/grok except for in the “Agentic coding benchmark” (kek), so I guess they went all in on the AI swe angle. But if they aren’t pushing the frontier, then there’s no way for them to pull customers from Xcels or people who have never heard of Claude in the first place.

    On second thought, this is a big brain move. If no one is making API calls to Clauderino, they aren’t wasting money on the compute they can’t afford. The only winning move is to not play.



  • Deep thinker asks why?

    Thus spoketh the Yud: “The weird part is that DOGE is happening 0.5-2 years before the point where you actually could get an AGI cluster to go in and judge every molecule of government. Out of all the American generations, why is this happening now, that bare bit too early?”

    Yud, you sweet naive smol uwu babyesian boi, how gullible do you have to be to believe that a) tminus 6 months to AGI kek (do people track these dog shit predictions?) b) the purpose of DOGE is just accountability and definitely not the weaponized manifestation of techno oligarchy ripping apart our society for the copper wiring in the walls?











  • Good news everyone, Dan has released his latest AI safety paper, we are one step closer to alignment. Let’s take a look inside:

    Wow, consistent set of values you say! Quite a strong claim. Let’s take a peek at their rigorous, unbiased experimental set up:

    … ok, this seems like you might be putting your finger on the scales to get a desired outcome. But I’m sure at least your numerical results are stro-

    Even after all this shit, all you could eek out was a measly 60%? C’mon you gotta try harder than that to prove the utility maximizer demon exists. I would say our boi is falling to new levels of crankery to push his agenda, but he did release that bot last year that he said was capable of superhuman prediction, so this really just par for the course at this point.

    The most discerning minds / critical thinkers predictably reeling in terror at another banger drop from Elon’s AI safety toad.

    *** terrifying personal note: I recently found out that Dan was my wife’s roommate’s roommate’s roommate back in college. By the transitive property, I am Dan’s roommate, which explains why he’s living rent free in my head