Epstein Files Jan 30, 2026
Data hoarders on reddit have been hard at work archiving the latest Epstein Files release from the U.S. Department of Justice. Below is a compilation of their work with download links.
Please seed all torrent files to distribute and preserve this data.
Epstein Files Data Sets 1-8: INTERNET ARCHIVE LINK
Epstein Files Data Set 1 (2.47 GB): TORRENT MAGNET LINK
Epstein Files Data Set 2 (631.6 MB): TORRENT MAGNET LINK
Epstein Files Data Set 3 (599.4 MB): TORRENT MAGNET LINK
Epstein Files Data Set 4 (358.4 MB): TORRENT MAGNET LINK
Epstein Files Data Set 5: (61.5 MB) TORRENT MAGNET LINK
Epstein Files Data Set 6 (53.0 MB): TORRENT MAGNET LINK
Epstein Files Data Set 7 (98.2 MB): TORRENT MAGNET LINK
Epstein Files Data Set 8 (10.67 GB): TORRENT MAGNET LINK
Epstein Files Data Set 9 (Incomplete). Only contains 49 GB of 180 GB. Multiple reports of cutoff from DOJ server at offset 48995762176.
ORIGINAL JUSTICE DEPARTMENT LINK
- TORRENT MAGNET LINK (removed due to reports of CSAM)
/u/susadmin's More Complete Data Set 9 (96.25 GB)
De-duplicated merger of (45.63 GB + 86.74 GB) versions
- TORRENT MAGNET LINK (removed due to reports of CSAM)
Epstein Files Data Set 10 (78.64GB)
ORIGINAL JUSTICE DEPARTMENT LINK
- TORRENT MAGNET LINK (removed due to reports of CSAM)
- INTERNET ARCHIVE FOLDER (removed due to reports of CSAM)
- INTERNET ARCHIVE DIRECT LINK (removed due to reports of CSAM)
Epstein Files Data Set 11 (25.55GB)
ORIGINAL JUSTICE DEPARTMENT LINK
SHA1: 574950c0f86765e897268834ac6ef38b370cad2a
Epstein Files Data Set 12 (114.1 MB)
ORIGINAL JUSTICE DEPARTMENT LINK
SHA1: 20f804ab55687c957fd249cd0d417d5fe7438281
MD5: b1206186332bb1af021e86d68468f9fe
SHA256: b5314b7efca98e25d8b35e4b7fac3ebb3ca2e6cfd0937aa2300ca8b71543bbe2
This list will be edited as more data becomes available, particularly with regard to Data Set 9 (EDIT: NOT ANYMORE)
EDIT [2026-02-02]: After being made aware of potential CSAM in the original Data Set 9 releases and seeing confirmation in the New York Times, I will no longer support any effort to maintain links to archives of it. There is suspicion of CSAM in Data Set 10 as well. I am removing links to both archives.
Some in this thread may be upset by this action. It is right to be distrustful of a government that has not shown signs of integrity. However, I do trust journalists who hold the government accountable.
I am abandoning this project and removing any links to content that commenters here and on reddit have suggested may contain CSAM.
Ref 1: https://www.nytimes.com/2026/02/01/us/nude-photos-epstein-files.html
Ref 2: https://www.404media.co/doj-released-unredacted-nude-images-in-epstein-files
DOJ Epstein Files: I found what's around those 3 missing files (Part 2)
Follow-up to my Dataset 9 indexing post. I pulled the adjacent files from my local copy of the torrent. What I found is... notable.
TLDR
The 3 missing files aren't random corruption. They all cluster around one event: Epstein's girlfriend Karyna Shuliak leaving St. Thomas (the island) in April 2016. And one of the gaps sits directly next to an email where Epstein recommends her a novel about a sympathetic pedophile—two days before the book was publicly released.
The Big Finding: Duplicate Processing Batches
Two of the missing files (326497 and 534391) are the same document processed twice—once with redactions, once without—208,000 files apart in the index.
Random file corruption hitting the same logical document in two separate processing runs, 208,000 positions apart? That's not how corruption works. That's how removal works.
What's Actually In These Files
I pulled everything around the gaps. It's all one email chain from April 10, 2016:
The event: Karyna Shuliak (Epstein's girlfriend) booked on Delta flight from Charlotte Amalie, St. Thomas → JFK on April 13, 2016.
St. Thomas is where you fly in/out to reach Little St. James. She was leaving the island.
The chain:
The unredacted batch (534xxx) reveals the email addresses that are blacked out in the redacted batch (326xxx):
The Epstein Email (EFTA00534392)
The document immediately after missing file 534391:
He's telling her to buy a book. The same day she's being booked to leave his island.
The Book
"Undone" by John Colapinto (Soft Skull Press)
On-sale date: April 12, 2016
Epstein's email: April 10, 2016
He recommended it two days before public release.
Publisher's description:
The protagonist is a pedophile who resents society for judging him.
The author (John Colapinto) is a New Yorker staff writer, former Vanity Fair and Rolling Stone contributor. Exactly the media circles Epstein cultivated.
What's Missing
So now we know the context:
EFTA00326497 — Between AmEx confirmation and Groff's forward. Probably the PDF ticket attachment referenced in the emails.
EFTA00326501 — Between the forward chain and Shuliak's reply. Unknown.
EFTA00534391 — Immediately before Epstein's personal email about the pedo book. Unknown, but its position is notable.
Open Questions
How did Epstein have this book before release? Advance copy? Knows the author?
What is 534391? It sits between staff logistics emails and Epstein's direct correspondence. Another Epstein email? An attachment?
Are there other Shuliak travel records with similar gaps? Is April 2016 unique or part of a pattern?
What else is in the corpus from jeevacation@gmail.com?
Verify It Yourself
Try the DOJ links (all return errors):
Check the torrent: Pull the EFTA numbers I listed. Confirm the gaps. Confirm the adjacencies.
Grep the corpus: Search for "QWURMO" (booking reference), "Shuliak", "jeevacation", "Colapinto"
Summary
Three files missing from 531,256. All three cluster around one girlfriend's April 2016 departure from St. Thomas. Same gaps appear in two processing batches 208,000 files apart. One gap sits adjacent to Epstein personally recommending a novel about a sympathetic pedophile, sent before the book was even publicly available.
This isn't random corruption.
Full analysis + all code: https://github.com/degenai/Dataset9
If anyone has the torrent and wants to grep for Colapinto connections or other Shuliak trips, please do. This is open source for a reason.
Just skimming through and I have file 534391 but it shows 'No Images Produced' not sure if that was your reason as well and apologies in advance! Heres an image of said file (https://lemmy.world/pictrs/image/d840f280-5e32-4417-a92e-ff281582080a.png)
That is new information! I wasnt even able to get that 'no images produced' page, good to know thank you. I just hit a file corruption error when I tried to dl from the DOJ. Thank you for the information. I guess this means the content is still missing in a way but at least accounted for.
Yeah for sure! I may be going over my head with this but I want to believe that there were a few different dataset 9 zips that the DOJ uploaded. My theory is that each time they 'uploaded', for instance, the first upload, they just pushed it out w/o checking most of the unredacted files, then another set with more redacted files while simultaneously removed files. Next set, like mine, where there are files with 'no images produced' shown and finally, those that are missing the files completely like yourself. Then you got the news outlet folks where they probably have the complete dataset. With the DOJ taking everything down due to CSAM, they could possibly try to dox/charge those for even having it and that's possibly one of the many reasons OP left/stopped. Again, just a theory, call it a conspiracy lol