236
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 23 May 2024
236 points (99.2% liked)
Firefox
17951 readers
361 users here now
A place to discuss the news and latest developments on the open-source browser Firefox
founded 4 years ago
MODERATORS
It's local. You're not sending data to their servers.
At least use the whole quote.
yeah, of course its gonna look like its not local if you take out the part where it says its local
That's somewhat awkward phrasing but I think the visual processing will also be done on-device. There are a few small multimodal models out there. Mozilla's llamafile project includes multimodal support, so you can query a language model about the contents of an image.
Even just a few months ago I would have thought this was not viable, but the newer models are game-changingly good at very small sizes. Small enough to run on any decent laptop or even a phone.