untitled_backer

joined 1 month ago
[–] untitled_backer@lemmy.ml 0 points 1 month ago (2 children)

... So you did. I guess I was responding to your comment, forgot to read the thread. That's embarrassing. Not sure why you had problems. Are you still having trouble?

[–] untitled_backer@lemmy.ml 1 points 1 month ago (6 children)
 

I'm compiling a list of metadata (https://github.com/monotype-favorably/ep_files_list) and remember at one point seeing someone who had compiled a Google Doc of all the files which weren't actually PDFs. Anyone know of that?

To be clear, this is not request for where to find the files themselves but where to find a list of the filenames.

Happy to receive any other datasets in my search for compilation!

[–] untitled_backer@lemmy.ml 0 points 1 month ago (8 children)

Good link, thank you, that's awesome! I'm going to use that

[–] untitled_backer@lemmy.ml 3 points 1 month ago

Awesome, thank you! Great search-ability on that one

[–] untitled_backer@lemmy.ml 1 points 1 month ago

Thank you! Bates numbers, didn't know that's what they were called. Neat.

 

cross-posted from: https://lemmy.ml/post/43038910

I see a lot of fragmented datasets out there, does anyone know of something comprehensive (e.g. all files from all datasets) who is annotating the files and accepting submissions?

 

I see a lot of fragmented datasets out there, does anyone know of something comprehensive (e.g. all files from all datasets) who is annotating the files and accepting submissions?