Technology

40600 readers
111 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 6 years ago
MODERATORS
1
 
 

PE-AV - Audiovisual Perception with Code

  • Meta's perception encoder for audio-visual understanding with open code release.
  • Processes both visual and audio information to isolate sound sources.
  • Paper | Code

https://preview.redd.it/k6lp7cgbou8g1.png?width=1456&format=png&auto=webp&s=f928bbd8d184e9094e7130cb36adff5f51830a80

T5Gemma 2 - Open Encoder-Decoder

  • Next generation encoder-decoder model with full open-source weights.
  • Combines bidirectional understanding with flexible text generation.
  • Blog | Model

Qwen-Image-Layered - Open Image Decomposition

  • Decomposes images into editable RGBA layers with full model release.
  • Each layer can be independently manipulated for precise editing.
  • Hugging Face | Paper | Demo

https://reddit.com/link/1ptg2x9/video/72skjufkou8g1/player

N3D-VLM - Open 3D Vision-Language Model

  • Native 3D spatial reasoning with open weights and code.
  • Understands depth and spatial relationships without 2D distortions.
  • GitHub | Model

https://reddit.com/link/1ptg2x9/video/h1npuq1mou8g1/player

Generative Refocusing - Open Depth Control

  • Controls depth of field in images with full code release.
  • Simulates camera focus changes through 3D scene inference.
  • Website | Demo | Paper | GitHub

StereoPilot - Open 2D to 3D Conversion

  • Converts 2D videos to stereo 3D with open model and code.
  • Full source release for VR content creation.
  • Website | Model | GitHub | Paper

https://reddit.com/link/1ptg2x9/video/homrv9tmou8g1/player

Chatterbox Turbo - MIT Licensed TTS

  • State-of-the-art text-to-speech under permissive MIT license.
  • No commercial restrictions or cloud dependencies.
  • Hugging Face

https://reddit.com/link/1ptg2x9/video/iceqr03jou8g1/player

FunctionGemma - Open Function Calling

  • Lightweight 270M parameter model for function calling with full weights.
  • Creates specialized function calling models without commercial restrictions.
  • Model

FoundationMotion - Open Motion Analysis

  • Labels spatial movement in videos with full code and dataset release.
  • Automatic motion pattern identification without manual annotation.
  • Paper | GitHub | Demo | Dataset

DeContext - Open Image Protection

  • Protects images from unwanted AI edits with open-source implementation.
  • Adds imperceptible perturbations that block manipulation while preserving quality.
  • Website | Paper | GitHub

EgoX - Open Perspective Transformation

  • Transforms third-person videos to first-person with full code release.
  • Maintains spatial coherence during viewpoint conversion.
  • Website | Paper | GitHub

https://reddit.com/link/1ptg2x9/video/2h8x59qpou8g1/player

Step-GUI - Open GUI Automation

  • SOTA GUI automation with self-evolving pipeline and open weights.
  • Full code and model release for interface control.
  • Paper | GitHub | Model

IC-Effect - Open Video Effects

  • Applies video effects through in-context learning with code release.
  • Learns effect patterns from examples without fine-tuning.
  • Website | GitHub | Paper
2
 
 

Here's the corresponding youtube video by Benn Jordan https://www.youtube.com/watch?v=vU1-uiUlHTo

3
4
5
 
 

A tape-based piece of unique Unix history may have been lying quietly in storage at the University of Utah for 50+ years. The question is whether researchers will be able to take this piece of middle-aged media and rewind it back to the 1970s to get the data off.

See also

https://archive.org/details/utah_unix_v4_raw

TAR file

http://squoze.net/UNIX/v4/

6
7
8
9
10
11
12
13
14
15
16
17
 
 

I have an old QNAP and I hate it. It's full of proprietary software that can't be removed, and is slow, probably because it doesn't have SSDs. It's, I think, RAID1.

Basically just need something to back up my data on my local network that has encryption. Open source is always nice as well. Simple and fast!

18
19
20
21
22
23
24
25
view more: next ›