Technology
This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.
Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.
Rules:
1: All Lemmy rules apply
2: Do not post low effort posts
3: NEVER post naziped*gore stuff
4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.
5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)
6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist
7: crypto related posts, unless essential, are disallowed
view the rest of the comments
I don’t know much about running this on my own computer other than using ollama. Is that what you mean about running it on my own?
I haven't tried it with ollama, but it can download gguf files directly if you point it to a huggingface repo. There are a few other runners like vllm and llama.cpp, you can also just run the project directly with Python. I expect the whole Product of Experts algorithm is going to get adopted by all models going forward since it's such a huge improvement, and you can just swap out the current approach.
So is this a huge breakthrough that’s going to be adopted by ai companies across the board? Or maybe there is some downside.
Almost certainly given that it drastically reduces the cost of running models. Whether you run them locally or it's a company selling a service, the benefits here are pretty clear.
It just sounds too good to be true. So, no critics have claimed downsides to this?
I mean the paper and code are published. This isn't a heuristic, so there's no loss of accuracy. I'm not sure why you're saying this is too good to be true, the whole tech is very new and there are lots of low hanging fruit for optimizations that people are discovering. Every few months some discovery like this is made right now. Eventually, people will pluck all the easy wins and it's going to get harder to dramatically improve performance, but for the foreseeable future we'll be seeing a lot more stuff like this.