People Twitter

9972 readers

134 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

Mark NSFW content.
No doxxing people.
Must be a pic of the tweet or similar. No direct links to the tweet.
No bullying or international politcs
Be excellent to each other.
Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 2 years ago

MODERATORS

SendMeYourTaTas@sh.itjust.works

pelespirit@sh.itjust.works

701

Unbothered. In its lane. Flourishing. (media.piefed.world)

submitted 4 months ago by The_Picard_Maneuver@piefed.world to c/whitepeopletwitter@sh.itjust.works

38 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] sp3ctr4l@lemmy.dbzer0.com 13 points 4 months ago (1 children)

No, its much, much more primitive than an LLM.

It scans your last message for keywords, potentially multiple keywords, keywords in some order, etc, fairly simple patterns you can use something like regex to parse.

Then, based on what it detects, it picks from something like a tree of responses, maybe reinserting the specific keyword you used.

Basically, imagine plotting out the entire dialogue tree from some video game.

... It really is not too much more complex than that.

An LLM, on the other hand, has been trained on something like trillions of pages of text, which then gets processed through multiple billions of layers of per word/character comparative analysis, producing a very complex set of relationships between characters and words, that it then uses to evaluate responses.

And when I say 'very complex' I mean that the results of parsing all the training data are not human readable, even by experts, its a gibberish mass of relationships between billions of matrices, something like that... its not even really code that you could read and then say 'oh! that part is causing this problem!'

So tldr:

I could probably teach you how to write a simple oldschool chatbot that works in a terminal or on IRC, in like, a week or two, even if you have literally 0 prior coding experience. You could easily make a simple chatbot fit in under a megabyte of code, even under a tenth or hundredth of a megabyte, for the actual chat parts of it.

... I absolutely could not teach you how to make an LLM from scratch, and even if I could, we'd have to rent some server clusters to process even a tiny training data set, for a very primitive version of al LLM. And it would take up gigabytes of local space, and thats with the finished, condensed, 'trained' model. Could easily be thousands of times more data that would go into the training.

[–] dave@feddit.uk 7 points 4 months ago (1 children)

That tldr needs a tldr…

But also you absolutely can learn & build small versions of LLMs on a regular laptop. I did it on my old 2017 Dell XPS, and trained it on the complete works of Shakespeare. It learnt to write almost passable Shakespeare hallucination in a couple of hours. There’s a good tutorial online if you search for it.

[–] sp3ctr4l@lemmy.dbzer0.com 1 points 4 months ago (1 children)

Well shit.

I've only figured out how to just run one locally on a Steam Deck, not build and train one.

Still though, even for this more primitive one you built, I'm guessing the overall file footprint size of it was orders of magnitude greater than what you could fit into a megabyte of more simpler chatbot that runs in a local terminal, right?

[–] dave@feddit.uk 3 points 4 months ago

Oh sure. The training data is about 5mb so easily manageable on a relatively modern machine, but not on the kind of thing that was used for ELIZA.