this post was submitted on 07 Feb 2025
465 points (98.1% liked)

Programmer Humor

34289 readers
165 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS
 
top 50 comments
sorted by: hot top controversial new old
[–] LGTM@discuss.tchncs.de 128 points 1 month ago (1 children)

No need, there's an unmaintained javascript library for that (written by a 12-yr old)

[–] NocturnalMorning@lemmy.world 44 points 1 month ago* (last edited 1 month ago) (1 children)

Omg, sign me up! I'm gonna put that script in production for a server used by millions of customers around the world!

[–] ogeist@lemmy.world 20 points 1 month ago (1 children)

Oh no, now there is a security audit and the pdf generated is insecure, the unpaid developer that has not logged in since 2015 has to fix this ASAP

[–] umbrella@lemmy.ml 14 points 1 month ago (1 children)

what? invest money to pay for open source software? are you nuts?

[–] GenosseFlosse@feddit.org 4 points 1 month ago* (last edited 1 month ago)

Just replace it with AI!!!

[–] HStone32@lemmy.world 100 points 1 month ago (1 children)

The secret to success in software engineering:

  1. Lie and say that there is
  2. Write or use a conversion algorithm
  3. Boss won't know the difference
  4. Collect bonus at performance evaluation
  5. Put "AI engineer" on resume
[–] pastel_de_airfryer@lemmy.eco.br 37 points 1 month ago (1 children)
  1. Boss thinks AI can code at senior developer level and fires you and the entire team
[–] HStone32@lemmy.world 16 points 1 month ago
  1. Never plan on staying at a SE job for longer than a few years. Not in a market that volitile.
[–] traches@sh.itjust.works 72 points 1 month ago (2 children)

Yes it’s an LLM called pandoc, you can run it locally

[–] CaptDust@sh.itjust.works 44 points 1 month ago (1 children)

You don't need a private nuclear plant to run it? Wow very efficient.

[–] avidamoeba@lemmy.ca 9 points 1 month ago* (last edited 1 month ago)

Black magic software.

[–] SnotFlickerman@lemmy.blahaj.zone 59 points 1 month ago (1 children)

Initially, I didn't think these kids were fall guys.

Now I think they're fall guys.

[–] lka1988@sh.itjust.works 55 points 1 month ago (1 children)

That was my thought. Young kids fresh out of school are really easy to manipulate into delusions of grandeur, especially when said delusions are offered by the richest person in the world. He's gonna leave them out for the wolves.

[–] SnotFlickerman@lemmy.blahaj.zone 46 points 1 month ago (2 children)

Either that or Musk himself is truly so incompetent he thinks these kids are true geniuses. Honestly, with how things are going, that's a fiddy-fiddy chance, because Musk is somehow almost as unbelievably stupid as Trump.

[–] BartyDeCanter@lemmy.sdf.org 19 points 1 month ago

Why not both?

[–] lka1988@sh.itjust.works 9 points 1 month ago

You're probably not wrong, what with him awkwardly hopping around onstage at multiple trump rallies.

[–] Opisek@lemmy.world 56 points 1 month ago (3 children)

Technically OCR is an application of machine learning.

Not an LLM, though.

[–] SwordInStone@lemmy.world 17 points 1 month ago

A world of difference

load more comments (2 replies)
[–] tsiad_mordecai_miktros@lemmy.world 50 points 1 month ago (1 children)

yes me send me what you want me to parse and i will get back to you in 3-4 business days

[–] eldavi@lemmy.ml 20 points 1 month ago (1 children)

be sure to include the metadata too. lol

[–] OwlPaste@lemmy.world 15 points 1 month ago (5 children)
load more comments (5 replies)
[–] NicolaHaskell@lemmy.ml 48 points 1 month ago

This is that special blend of Tablet Kid "I don't need to know things I can google them" and Rich Kid "I don't need to do things I can crowdsource them" that makes for that Distinctively VP "I don't know what I'm doing and nobody can tell 👈😎👉"

[–] mesamunefire@lemmy.world 45 points 1 month ago (3 children)

Imagine getting a job like this and now half the nation knows your name...thats terrifying. being an intern may mean you have no idea of the true scope of what they are asking you to do.

[–] Pieisawesome@lemmy.dbzer0.com 22 points 1 month ago

They are public employees who are changing things at the core of our government. Why wouldn’t we know their names?

Government employees names aren’t secret (asides from a few exceptions) nor is their pay

[–] GrumpyDuckling@sh.itjust.works 18 points 1 month ago

We know that his dad is an engineering professor at university of Nebraska too. Really calls into question his credentials. I checked the other day and they had already removed his contact info from their website.

[–] crusa187@lemmy.ml 15 points 1 month ago (1 children)

Yeah, seems that’s the point. Old enough to competently perform what they’re told, but too young to realize the gravity of the situation and how wrong it is to partake in it.

[–] eldavi@lemmy.ml 18 points 1 month ago (1 children)

that's why we have 18 year soldiers ....

[–] ChapulinColorado@lemmy.world 11 points 1 month ago

It’s ok, with the experienced gained from being forced to grow up, some will come home and use their savings to buy a dodge ram on a 7 year loan at 18% apr.

[–] CapriciousDay@lemmy.ml 25 points 1 month ago

Oh yeah the hosted DeepSeek has that

[–] tetris11@lemmy.ml 22 points 1 month ago* (last edited 1 month ago) (3 children)

I have to admit, PDF parsing being such a hot and profitable topic in computer science was really something I never saw coming.

PDFs? The things you can select text from? And when not, there's decent OCR? And when not, you just ask the person to send you an email or a word doc?

It sounds like LLMs are looking for a new unpolluted source of historical data that they can learn from, and this source exists in the form of old scanned-in paper documents. That's the only reason I can fathom as to why this is such a big thing now.

[–] chicken@lemmy.dbzer0.com 10 points 1 month ago (1 children)

Every time I try to convert a PDF to epub or something, or OCR one that doesn't actually have selectable text, it turns out shit. I assume the real reason people would want to get LLMs involved is that there is actually a lot of ambiguity in what a correct conversion would be, and there are a lot of PDFs out there.

load more comments (1 replies)
[–] sudo@programming.dev 6 points 1 month ago

Training the most insane AI model on classified federal documents.

[–] MonkderVierte@lemmy.ml 5 points 1 month ago

Selecting text doesn't work in most multi-column pdfs and good OCR cost money. And if the original source is lost and you want an exact copy in word, the OCR tools need to be really good at guessing whitespace-to-line ratio, because pdf is only an output format and not a processing format.

For most other converting needs, there's pandoc, imagemagick and ffmpeg.

[–] GissaMittJobb@lemmy.ml 19 points 1 month ago (1 children)

Is this fake?

For context, this is the guy who figured out how to see what's written on some ancient Greek Scrolls without destroying them. It seems slightly far-fetched that he wouldn't know better.

[–] SwordInStone@lemmy.world 5 points 1 month ago (1 children)
[–] GissaMittJobb@lemmy.ml 11 points 1 month ago (1 children)

Ok so they were apparently in Greek but not from Greece. Source: https://news.unl.edu/article-2

[–] Telorand@reddthat.com 10 points 1 month ago (1 children)

Even granting the quality or contribution he had to that effort, there's a huge difference between, "I can make a computer read burnt scrolls," and, "I can make government software with industry-standard protocols and security."

By way of comparison, just because I can write automation software for my company's apps doesn't mean I could just jump into doing Linus Torvalds' job maintaining the Linux kernel.

[–] GissaMittJobb@lemmy.ml 11 points 1 month ago (2 children)

Sure, but the difference between "I win awards for figruing out how to decipher ancient scrolls that no one has been able to do despite their best efforts" and "I can't research well enough to discover appropriate existing tools for document conversion" is very hard to reconcile.

[–] SwordInStone@lemmy.world 8 points 1 month ago

It's not even a matter of research. The whole question is demented. It's like asking what is the best pizza cutter to write with.

[–] Telorand@reddthat.com 4 points 1 month ago

Not particularly, in my book. Like I said, a specific skillset that allowed him to decipher ancient scrolls doesn't equate to practical skills that translate to other things like document conversion.

By way of another analogy, I can build a fine cabinet out of wood, but that doesn't mean I can also build a wooden cuckoo clock.

[–] rustydrd@sh.itjust.works 17 points 1 month ago (1 children)

Ah yes, the famed document <-> JSON converter.

[–] xthexder@l.sw0.com 12 points 1 month ago

It's so easy! Watch:

{"contents": "<garbled .docx contents goes here>"}
[–] cmgvd3lw@discuss.tchncs.de 14 points 1 month ago (3 children)

Actually this is what they do.

[–] sevenapples@lemmygrad.ml 5 points 1 month ago

Not at all. The only similarity is that LLMs work with text, and the document formats can also represent text.

Each format (E.g pdf, json, excel) has a defined standard, so all you have to do to change between each other is to map one format's fields to the others. You don't need (and won't get good results) from having an LLM produce the new format from scratch.

What he's asking is the equivalent of asking if there's an LLM made specifically for solving arithmetic problems. Why would you try to solve addition using an LLM?

load more comments (2 replies)
[–] ininewcrow@lemmy.ca 11 points 1 month ago

Just use Deepseek for US government data .... what could go wrong?

[–] sudo@programming.dev 5 points 1 month ago

Like opening source code in Word.

load more comments
view more: next ›