news
Welcome to c/news! Please read the Hexbear Code of Conduct and remember... we're all comrades here.
Rules:
-- PLEASE KEEP POST TITLES INFORMATIVE --
Overly editorialized titles, particularly if they link to opinion pieces, may get your post removed.
All posts must include a link to their source. Screenshots are fine IF you include the link in the post body.
If you are citing a Twitter post as news, please include not just the twitter.com URL but also Xcancel.com (or another Nitter instance). There is also a Firefox extension that can redirect Twitter links to a Nitter instance, such as Libredirect or archive them as you would any other reactionary source (archive.today, web.archive.org, ghostarchive.org). Twitter screenshots still need to be sourced or they will be removed.
Mass-tagging comm moderators across multiple posts like a broken Markov chain bot will result in a comm ban.
Repeated consecutive posting of reactionary sources, fake news, misleading / outdated news, false alarms over ghoul deaths, and/or shitposts will result in a comm ban.
Neglecting to use content warnings or NSFW when dealing with disturbing content will be removed until in compliance. Users who are consecutively reported due to failing to use content warnings or NSFW tags when commenting on or posting disturbing content will result in the user being banned.
Using April 1st as an excuse to post fake headlines, like the resurrection of Kissinger while he is still fortunately dead, will result in the poster being thrown in the gamer gulag and be sentenced to play and beat trashy mobile games like 'Raid: Shadow Legends' in order to be rehabilitated back into general society.
view the rest of the comments


I used the pdfimages command in poppler-utils.
https://packages.debian.org/trixie/poppler-utils
I did some digging and it looks like lossless images aren't really stored as PNGs, per se:
https://en.wikipedia.org/wiki/PDF#Raster_images
pdfimageshints at this when all the other images output options say things like "write JPEG images as JPEG files" but then the PNG output option says "change the default output format to PNG" (if you don't supply any arguments it spits out raw PPM files).In fact, if you look at the size of the original PDF, it's 385 kB—more in line with the optimized filesize I ended up with. My guess is that
mutool extractsimply makes a bit more of an effort to recompress the image thanpdf2images, but in both cases they're falling short of the original compression (at least for this PDF).(completely unrelated, but I found it funny that the PDF uses the woke sans-serif font Helvetica)