Free Open-Source Artificial Intelligence

1

4

Baidu announces open source release of the ERNIE 4.5 model family (yiyan.baidu.com)

submitted 1 week ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

2

19

resemble-ai/chatterbox: SoTA open-source TTS (github.com)

submitted 1 month ago by Even_Adder@lemmy.dbzer0.com to c/fosai@lemmy.world

2 comments fedilink

3

11

PlayDiffusion - Advanced AI Voice Inpainting Model (playdiffusion.com)

submitted 1 month ago* (last edited 1 month ago) by Even_Adder@lemmy.dbzer0.com to c/fosai@lemmy.world

0 comments fedilink

Hugging Face: https://huggingface.co/PlayHT/PlayDiffusion

4

9

[HELP] In GPT4All settings, selecting AMD graphics card yields no performance improvement over CPU (lemmy.ml)

submitted 1 month ago* (last edited 1 month ago) by yo_scottie_oh@lemmy.ml to c/fosai@lemmy.world

17 comments fedilink

Background: This Nomic blog article from September 2023 promises better performance in GPT4All for AMD graphics card owners.

Run LLMs on Any GPU: GPT4All Universal GPU Support

Likewise on GPT4All's GitHub page.

September 18th, 2023: Nomic Vulkan launches supporting local LLM inference on NVIDIA and AMD GPUs.

Problem: In GPT4All, under Settings > Application Settings > Device, I've selected my AMD graphics card, but I'm seeing no improvement over CPU performance. In both cases (AMD graphics card or CPU), it crawls along at about 4-5 tokens per second. The interaction in the screenshot below took 174 seconds to generate the response.

Question: Do I have to use a specific model to benefit from this advancement? Do I need to install a different AMD driver? What steps can I take to troubleshoot this?

Sorry if this is an obvious question. Sometimes I feel like the answer is right in front of me, but I'm unsure of which key words from the documentation should jump out at me.

My system info:

GPU: Radeon RX 6750 XT
CPU: Ryzen 7 5800X3D processor
RAM: 32 GB @ 3200 MHz
OS: Linux Bazzite
I've installed GPT4All as a flatpak

5

23

Any alternatives to GPT4All for local ChatGPT-like experience on Linux? (lemmy.ml)

submitted 1 month ago* (last edited 1 month ago) by yo_scottie_oh@lemmy.ml to c/fosai@lemmy.world

12 comments fedilink

I don't have many specific requirements, and GPT4All is working mostly well for me so far. That said, my latest use case for GPT4All is to help me plan a new Python-based project with examples as code snippets, and it lacks a specific quality of life feature, that is the "Copy Code" button.

There is an open issue on GPT4All's GitHub, but as there is no guarantee that feature will ever be implemented, I thought I'd take this opportunity to explore if there are any other tools out there like GPT4All that offer a ChatGPT-like experience in the local environment. I'm neither a professional developer nor a sysadmin, so a lot of self hosting guides go over my head, which is what drew me to GPT4All in the first place, as it's very accessible to non-developers like myself. That said, I'm open to suggestions and willing to learn new skills if that's what it takes.

I'm running on Linux w/ AMD hardware: Ryzen 7 5800X3D processor + Radeon RX 6750 XT.

Any suggestions? Thanks in advance!

6

16

New fully open source vision encoder OpenVision arrives to improve on OpenAI's Clip, Google's SigLIP (venturebeat.com)

submitted 1 month ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

7

11

OpenAlpha_Evolve is an open-source Python framework inspired by the AlphaEvolve research paper on autonomous coding agents (github.com)

submitted 1 month ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

the goal is to have an agent that can:

Understand a complex problem description.

Generate initial algorithmic solutions.

Rigorously test its own code.

Learn from failures and successes.

Evolve increasingly sophisticated and efficient algorithms over time.

https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/AlphaEvolve.pdf

8

13

LiteLLM: An open-source gateway for unified LLM access (www.infoworld.com)

submitted 1 month ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

9

7

fpgaminer/joycaption: JoyCaption Beta One Release - An image captioning Visual Language Model (lemmy.dbzer0.com)

submitted 2 months ago by Even_Adder@lemmy.dbzer0.com to c/fosai@lemmy.world

0 comments fedilink

Model: https://huggingface.co/fancyfeast/llama-joycaption-beta-one-hf-llava

Demo: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one

Release Post: https://civitai.com/articles/14672

10

13

Ace-Step Audio Model Native Support in ComfyUI! (blog.comfy.org)

submitted 2 months ago by Even_Adder@lemmy.dbzer0.com to c/fosai@lemmy.world

0 comments fedilink

11

18

ACE-Step: A Step Towards Music Generation Foundation Model (ace-step.github.io)

submitted 2 months ago by Even_Adder@lemmy.dbzer0.com to c/fosai@lemmy.world

3 comments fedilink

Github: https://github.com/ace-step/ACE-Step

Checkpoints: https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B

Demo: https://huggingface.co/spaces/ACE-Step/ACE-Step

12

11

New TTS/ASR Model that is better that Whisper3-large with fewer parameters (huggingface.co)

submitted 2 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

2 comments fedilink

https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2

13

10

JetBrains releases Mellum, an 'open' AI coding model | TechCrunch (techcrunch.com)

submitted 2 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

14

10

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis (fantasy-amap.github.io)

submitted 2 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

1 comments fedilink

15

14

Alibaba launches open source Qwen3 besting OpenAI o1 | VentureBeat (venturebeat.com)

submitted 2 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

16

18

Autonomous coding agent with granular permissions capable of creating/editing files, executing commands, using the browser, and more (github.com)

submitted 2 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

17

24

A new, open source text-to-speech model called Dia has arrived to challenge ElevenLabs, OpenAI and more (venturebeat.com)

submitted 2 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

1 comments fedilink

18

5

Paper page - InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models (huggingface.co)

submitted 2 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

0 comments fedilink

19

14

DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level (www.together.ai)

submitted 3 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

2 comments fedilink

20

8

[Help] how does one interact with MCP servers without mcp-libraries? (lemmy.blahaj.zone)

submitted 3 months ago by Smorty@lemmy.blahaj.zone to c/fosai@lemmy.world

4 comments fedilink

im building som dum lil foss llm thingy for godot and now im interested in letting users implement their own MCP servers.

so like - okay, the model context protocol page says, that most servers use stdio for every interaction. So now - the request format can be seen here, its apparently a JSONrpc thing.

so - first thing i want to do is retrieving all the capabilities the server has.

i looked through all the tabs in the latest docs, but could not find the command for listing all the capabilities. so i installed some filesystem mcp server which runs well and tried this:

PS C:\Users\praktikant> npx -y @modelcontextprotocol/server-filesystem "C:\Users\praktikant\Desktop"
Secure MCP Filesystem Server running on stdio
Allowed directories: [ 'C:\\Users\\praktikant\\Desktop' ]
{\
"jsonrpc": "2.0",\
"id": 1,\
"method": "capabilities",\
"params": {}\
}

- aaaaaand nothing was returned. no string, no nothing.

so maybe its not a string which is sent via stdio but some other byte-based thing?

if anyone has experience with this, or is gud at guessing, pls tell me what u think i might be missing here <3

21

1

The Myth of Alignment: Intelligence Beyond Human Values (lemmy.world)

submitted 3 months ago by Keji123@lemmy.world to c/fosai@lemmy.world

1 comments fedilink

There are two main approaches in total:

Step to step
Begin to steps to end
Currently, these are the two mainstream methods of instantiation.

It is widely recognized that if AI is not aligned with human values, it could cause harm to society.
Yet, this does not mean such systems lack intelligence.

So, what truly defines intelligence?
Why do so many researchers focus solely on intelligence aligned with human values?
Is it because their own understanding is limited, or because machines are not yet truly intelligent?

I believe intelligence should not be confined to narrow, human-centric definitions.
What we call "intelligence" today might be an illusion.
True intelligence cannot be defined—
the moment we define it, we lose its essence.

22

12

where have a group for talk about ai (lemmy.world)

submitted 3 months ago by Keji123@lemmy.world to c/fosai@lemmy.world

5 comments fedilink

please tell me, thanks

23

17

Mistral Small 3.1 | Mistral AI (mistral.ai)

submitted 3 months ago by Fitik@fedia.io to c/fosai@lemmy.world

1 comments fedilink

Today we announce Mistral Small 3.1: the best model in its weight class.

Building on Mistral Small 3, this new model comes with improved text performance, multimodal understanding, and an expanded context window of up to 128k tokens. The model outperforms comparable models like Gemma 3 and GPT-4o Mini, while delivering inference speeds of 150 tokens per second.

Mistral Small 3.1 is released under an Apache 2.0 license.

24

2

What Is the Most Popular Open-Source AI Stack? (www.youtube.com)

submitted 4 months ago by cm0002@lemmy.world to c/fosai@lemmy.world

11 comments fedilink

25

2

Easy to setup locally hosted LLM with access to file system (programming.dev)

submitted 4 months ago by youreusingitwrong@programming.dev to c/fosai@lemmy.world

6 comments fedilink

Hello, I am currently using codename goose as an AI client to proofread and help me with coding. I have it setup towards Googles Gemini, however I find myself quickly running out of tokens with large files. I was wondering if there are any easy way to self host an AI with similar capabilites but still have access to read and write files. I've tried both ollama and Jan, but neither have access to my files. Any recommendations?

Free Open-Source Artificial Intelligence

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

FOSAI Time Capsule