WhatCD

joined 2 days ago
[–] WhatCD@lemmy.world 3 points 1 day ago* (last edited 1 day ago) (1 children)

Sounds good, thank you, just thinking we should avoid platforms like discord and look for something more respectful of privacy

[–] WhatCD@lemmy.world 1 points 1 day ago (4 children)

I would really like a chat of some kind, matrix maybe?

[–] WhatCD@lemmy.world 1 points 1 day ago

Perfect I'm on --startByte 134211436544 --endByte 144472801279 (9.56 GB)

[–] WhatCD@lemmy.world 2 points 1 day ago (11 children)

This would be the largest three gaps from what I have:

Three largest gaps:

--startByte 49981423616 --endByte 60299411455 (9.61 GB) --startByte 110131937280 --endByte 120424759295 (9.59 GB) --startByte 134211436544 --endByte 144472801279 (9.56 GB)

[–] WhatCD@lemmy.world 1 points 1 day ago (12 children)

The next question is who goes after what part.

[–] WhatCD@lemmy.world 0 points 1 day ago (15 children)

Ok updated the script. Added --startByte and --endByte and --totalFileBytes

https://pastebin.com/9Dj2Nhyb

Using --totalFileBytes 192613274080 avoids an HTTP head request at the beginning of the script making it slightly less brittle.

To grab the last 5 GB of the file you would add the following to your command:

--startByte 187244564960 --endByte 192613274079 --totalFileBytes 192613274080
[–] WhatCD@lemmy.world 1 points 1 day ago

Great idea, let me see what I can do!

[–] WhatCD@lemmy.world 4 points 1 day ago (20 children)

I don't know exactly, but seems about an hour or two if you get a 401 unauthorized.

Would you be interested in joining out effort here? I'm hoping to crowd source these chunks and then combine our effort.

[–] WhatCD@lemmy.world 2 points 1 day ago (1 children)

Yeah when I run into this I’ve switched browsers and it’s helped. I’ve also switched IP addresses and it’s helped.

[–] WhatCD@lemmy.world 3 points 1 day ago (24 children)

Updated the script to display information better: https://pastebin.com/S4gvw9q1

It has one library dependency so you'll have to do:

pip install rich

I haven't been getting blocked with this:

python script.py 'https://www.justice.gov/epstein/files/DataSet%209.zip' -o 'DataSet 9.zip' --cookies cookie.txt --retries 2 --referer 'https://www.justice.gov/age-verify?destination=%2Fepstein%2Ffiles%2FDataSet+9.zip' --ua '<set-this>' --timeout 90 -t 16 -c auto

The new script can auto set threads and chunks, I updated the main comment with more info about those.

I'm setting the --ua option which let's you override the user agent header. I'm making sure it matches the browser that I use to request the cookie.

[–] WhatCD@lemmy.world 2 points 1 day ago (30 children)

What happens when you go to https://www.justice.gov/epstein/files/DataSet%209.zip in your browser?

[–] WhatCD@lemmy.world 2 points 1 day ago

I would be interested in obtaining the chunks that you gathered and stitch them to what I gathered.

view more: next ›