27
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 09 Sep 2024
27 points (100.0% liked)
TechTakes
1416 readers
217 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 1 year ago
MODERATORS
OpenAI manages to do an entire introduction of a new model without using the word "hallucination" even once.
Apparently it implements chain-of-thought, which either means they changed the RHFL dataset to force it to explain its 'reasoning' when answering or to do self questioning loops, or that it reprompts itsefl multiple times behind the scenes according to some heuristic until it synthesize a best result, it's not really clear.
Can't wait to waste five pools of drinkable water to be told to use C# features that don't exist, but at least it got like 25.2452323760909304593095% better at solving math olympiads as long as you allow it a few tens of tries for each question.
Some of my favorite reactions to this paradigm shift in machine intelligence we are witnessing:
bless you Melanie.
Mine olde friend, the log scale, still as beautiful the day I met you
Weird, the AI that has read every chess book in existence and been trained on more synthetic games than any one human has seen in a lifetime still doesn't understand the rules of chess
^(just an interesting data point from Ernie, + he upvotes pictures of my dogs on FB so I gotta include him)
Dog tax
Would there ever be a way to tell that they didn't just feed the answers into the training data?