BigMuffN69

joined 7 months ago
[–] BigMuffN69@awful.systems 3 points 1 day ago

AcerFur (who is quoted in the article) tried them himself and said he got similar answers with a couple guiding prompts on gpt 5.3 and that he was “disappointed”

That said, AcerFur is kind of the goat at this kind of thing 🦊==🐐

[–] BigMuffN69@awful.systems 3 points 1 day ago* (last edited 1 day ago)

Also Martin Hairer is incredibly based. He gave this nice talk 2 months ago if any peeps want to see what he thinks comes next for math.

https://www.youtube.com/watch?v=fbVqc1tPLos

[–] BigMuffN69@awful.systems 5 points 1 day ago* (last edited 1 day ago) (2 children)

This was a very nice problem set. Some were minor alterations to thms in literature but ranged up to problems that were quite involved. It appears that OAI got about 5 (possibly 6) of them but even then, this was accomplished with expert feedback to the model, which is quite different from the models just 1 shotting them on their own.

But I think this is what makes it so well done! A 0/10 or a 10/10 ofc gives very little info, a middling score that they admit they put a shit ton of effort into and tried to coax the right answers out of the models via hints says a lot about how much these systems can currently help prove lemmata.

Side note: I asked a FB friend of mine at one of the math + ai startups if they attempted the problems and he said "they had more pressing issues this week they couldnt be pulled away from" (no comment, :P I want to stay friends with them)

The lack of similar attempts being released by big companies like Google or Anth or X also should be a big red flag that their attempts were not up to snuff of even attempting.

[–] BigMuffN69@awful.systems 5 points 2 weeks ago

“(((We’re))) never beating the allegations, are we?” -my wife

[–] BigMuffN69@awful.systems 30 points 2 weeks ago (41 children)

Gentlemen, it’s been an honour sneering w/ you, but I think this is the top 🫡 . Nothings gonna surpass this (at least until FTX 2 drops)

[–] BigMuffN69@awful.systems 13 points 2 weeks ago

hits blunt

What if we make an ai too based?

[–] BigMuffN69@awful.systems 7 points 2 weeks ago

On one hand as a poor grad student in the past, I could imagine working for a truly repugnant corp. but like if you’ve already made millions from your stock options, wtf are you doing. Idk, i really thought they’d have some shame over it, but they said shit like “our customers really like our deliverables” and i just fucking left with my wife

[–] BigMuffN69@awful.systems 14 points 2 weeks ago* (last edited 2 weeks ago) (2 children)

I have family working there, who told me during the holidays, “Current leadership makes me uncomfortable, but money is good”

Every impression I had of them completely shattered, cannot fathom that level out sell out exists in people I thought I knew.

As a bonus, their former partner was a former employee who became a whistleblower and has now gone full howard hughes

[–] BigMuffN69@awful.systems 6 points 3 weeks ago (1 children)

Without doxxing, my job has a contract with nvidia and my boss said we are doing it to make agi. Can i build a little of a torment nexus as a treat? Ty ans bless

[–] BigMuffN69@awful.systems 5 points 3 weeks ago

Shit like this ^ makes me feel insane when otherwise reputable experts start talking about llms taking over

[–] BigMuffN69@awful.systems 7 points 1 month ago (1 children)

In b4 METR drops the next shoddy study and the promptfondlers go wild

 

"Anthropic cofounder admits he is now "deeply afraid" ... "We are dealing with a real and mysterious creature, not a simple and predictable machine ... We need the courage to see things as they are."

https://www.reddit.com/r/ArtificialInteligence/comments/1o6cow1/anthropic_cofounder_admits_he_is_now_deeply/?share_id=_x2zTYA61cuA4LnqZclvh

There's so many juicy chunks here.

"I came to this position uneasily. Both by virtue of my background as a journalist and my personality, I’m wired for skepticism...

...You see, I am also deeply afraid. It would be extraordinarily arrogant to think working with a technology like this would be easy or simple....

...And let me remind us all that the system which is now beginning to design its successor is also increasingly self-aware and therefore will surely eventually be prone to thinking, independently of us, about how it might want to be designed. Of course, it does not do this today. But can I rule out the possibility it will want to do this in the future? No."

Despite my jests, I gotta say, posts reeks of desperation. Benchmaxxxing just isn't hitting like it used, bubble fears at all time high, and OAI and Google are the ones grabbing headlines with content generation and academic competition wins. The good folks at Anthropic really gotta be huffing their own farts to be believing they're in the race to wi-

"Years passed. The scaling laws delivered on their promise and here we are. And through these years there have been so many times when I’ve called Dario up early in the morning or late at night and said, 'I am worried that you continue to be right'. Yes, he will say. There’s very little time now."

LateNightZoomCallsAtAnthropic dot pee en gee

Bonus sneer: speaking of self aware wolves, Jagoff Clark somehow managed to updoot Doom's post?? Thinking the frog was unironically endorsing his view that the server farm was going to go rogue???? Will Jack achieve self awareness in the future? Of course, he does not do this today. But can I rule out the possibility he will do this in the future? Yes.

view more: next ›