this post was submitted on 17 Mar 2024
231 points (98.7% liked)

Science Memes

19737 readers
2100 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.



Rules

  1. Don't throw mud. Behave like an intellectual and remember the human.
  2. Keep it rooted (on topic).
  3. No spam.
  4. Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.



Research Committee

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 3 years ago
MODERATORS
 
top 7 comments
sorted by: hot top controversial new old
[–] Spiralvortexisalie@lemmy.world 24 points 2 years ago (1 children)

Not sure if you have tried/heard of Whisper. It automatically transcribes audio, I use it for meetings/lectures that don’t come with Closed Captioning, it supports audio/video files and a few languages. I had tried a few solutions with mixed results (e.g. Google is slow, many places limit lengths/sizes), IBM is supposed to be the best free/low cost cloud model but they would never approve my accounts. In the end locally with whisper in an Anaconda/Python environment was best cheap option for me.

[–] weariedfae@lemmy.world 3 points 2 years ago* (last edited 2 years ago) (2 children)

Not OP but I've been looking for one to help me with meetings and disorganized notes. How well would you say it works? Does it only transcribe or will it help organize notes (create categories, cluster analysis, tags, action items, whatever)?

[–] Spiralvortexisalie@lemmy.world 6 points 2 years ago (1 children)

Only transcription, it outputs to a few formats that amount to plain text with or without time coding including srt subtitles. It transcribes really well, one bit of note is that sometimes with more technical discussions I find better results using the smaller models. My best theory is the technical words are less likely to be assumed to be an accent/variation.

[–] weariedfae@lemmy.world 2 points 2 years ago

Thanks for posting, I'll check it out :)

[–] kakes@sh.itjust.works 3 points 2 years ago

I haven't used it much, but I ran a podcast through it once to test it, and I was honestly impressed by the accuracy.

[–] FiniteBanjo@lemmy.today 9 points 2 years ago

The worst part is when people don't sign off on your use of the interview because they're afraid of their signature being used against them, which is an understandable concern of course.

[–] weariedfae@lemmy.world 3 points 2 years ago* (last edited 2 years ago)

Hahahaha

I just vaguely cited the gist in text and then did the "so-and-so, personal communication" in the references but then again mine was only a M.S. thesis.