Technology

42935 readers

196 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 4 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org

100

A jargon-free explanation of how AI large language models work (arstechnica.com)

submitted 2 years ago by Gaywallet@beehaw.org to c/technology@beehaw.org

14 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] CarbonIceDragon@pawb.social 5 points 2 years ago (1 children)

It does make me vaguely curious what happens if you try to make one of these on the more powerful end explain step by step how its own program works. I dont really expect it to be accurate, given that if people dont know how the thing works, it probably wont find much about that in it's training data, but if what it learns ultimately enables it to make connections about how the real world works to some degree, could it figure out enough to give even marginally useful hints?

[–] Czorio@kbin.social 3 points 2 years ago (2 children)

Not really, it’s super fucking expensive to train one of these, on-line training would simply not be economically feasible.

Even if it was, the models don’t really have any agency. You prompt, they respond. There’s not much prompting going on from the model, and if there was, you can choose to not respond, which the model can’t really do.

[–] PenguinTD@lemmy.ca 1 points 2 years ago

Wrong, the cat is out of bag, it takes one leak to do some serious impact to the whole industry.

https://www.semianalysis.com/p/google-we-have-no-moat-and-neither

You can try the various free open source version trained by community here: https://chat.lmsys.org/

[–] shanghaibebop@beehaw.org 1 points 2 years ago

You can train an effective one for a few hundred bucks now.

https://crfm.stanford.edu/2023/03/13/alpaca.html