
DOJ Epstein Files: I found what’s around those 3 missing files (Part 2)
Follow-up to my Dataset 9 indexing post. I pulled the adjacent files from my local copy of the torrent. What I found is… notable.
TLDR
The 3 missing files aren’t random corruption. They all cluster around one event: Epstein’s girlfriend Karyna Shuliak leaving St. Thomas (the island) in April 2016. And one of the gaps sits directly next to an email where Epstein recommends her a novel about a sympathetic pedophile—two days before the book was publicly released.
The Big Finding: Duplicate Processing Batches
Two of the missing files (326497 and 534391) are the same document processed twice—once with redactions, once without—208,000 files apart in the index.
| Redacted Batch | Unredacted Batch | Content |
|---|---|---|
| 326494-326496 | 534388-534390 | AmEx travel booking, staff emails |
| 326497 - MISSING | 534391 - MISSING | ??? |
| 326498-326500 | — | Email chain continues |
| 326501 - MISSING | — | ??? |
| 326502-326506 | — | Reply + Invoice |
| — | 534392 | Epstein personal email |
Random file corruption hitting the same logical document in two separate processing runs, 208,000 positions apart? That’s not how corruption works. That’s how removal works.
What’s Actually In These Files
I pulled everything around the gaps. It’s all one email chain from April 10, 2016:
The event: Karyna Shuliak (Epstein’s girlfriend) booked on Delta flight from Charlotte Amalie, St. Thomas → JFK on April 13, 2016.
St. Thomas is where you fly in/out to reach Little St. James. She was leaving the island.
The chain:
- 11:31 AM — AmEx Centurion (black card) sends confirmation to lesley.jee@gmail.com
- 11:33 AM — Lesley Groff (Epstein’s executive assistant) forwards to Shuliak, CC’s staff
- 11:35 AM — Shuliak replies “Thanks so much”
- 3:52 PM — Epstein personally emails Shuliak
- Next day — AmEx sends invoice
The unredacted batch (534xxx) reveals the email addresses that are blacked out in the redacted batch (326xxx):
- Lesley Groff: lesley.jee@gmail.com
- Ann Rodriquez: annrodriquez@yahoo.com
- Bella Klein: bklein575@gmail.com
- Karyna Shuliak: karynashuliak@icloud.com
The Epstein Email (EFTA00534392)
The document immediately after missing file 534391:
From: "jeffrey E." <jeevacation@gmail.com>
To: Karyna Shuliak
Date: Sun, 10 Apr 2016 19:52:13 +0000
order http://softskull.com/dd-product/undone/
He’s telling her to buy a book. The same day she’s being booked to leave his island.
The Book
“Undone” by John Colapinto (Soft Skull Press)
On-sale date: April 12, 2016
Epstein’s email: April 10, 2016
He recommended it two days before public release.
Publisher’s description:
“Dez is a former lawyer and teacher—an ephebophile with a proclivity for teenage girls, hiding out in a trailer park with his latest conquest, Chloe. Having been in and out of courtrooms (and therapists’ offices) for a number of years, Dez is at odds with a society that persecutes him over his desires.”
The protagonist is a pedophile who resents society for judging him.
The author (John Colapinto) is a New Yorker staff writer, former Vanity Fair and Rolling Stone contributor. Exactly the media circles Epstein cultivated.
What’s Missing
So now we know the context:
-
EFTA00326497 — Between AmEx confirmation and Groff’s forward. Probably the PDF ticket attachment referenced in the emails.
-
EFTA00326501 — Between the forward chain and Shuliak’s reply. Unknown.
-
EFTA00534391 — Immediately before Epstein’s personal email about the pedo book. Unknown, but its position is notable.
Open Questions
-
How did Epstein have this book before release? Advance copy? Knows the author?
-
What is 534391? It sits between staff logistics emails and Epstein’s direct correspondence. Another Epstein email? An attachment?
-
Are there other Shuliak travel records with similar gaps? Is April 2016 unique or part of a pattern?
-
What else is in the corpus from jeevacation@gmail.com?
Verify It Yourself
Try the DOJ links (all return errors):
- https://www.justice.gov/epstein/files/DataSet 9/EFTA00326497.pdf
- https://www.justice.gov/epstein/files/DataSet 9/EFTA00326501.pdf
- https://www.justice.gov/epstein/files/DataSet 9/EFTA00534391.pdf
Check the torrent: Pull the EFTA numbers I listed. Confirm the gaps. Confirm the adjacencies.
Grep the corpus: Search for “QWURMO” (booking reference), “Shuliak”, “jeevacation”, “Colapinto”
Summary
Three files missing from 531,256. All three cluster around one girlfriend’s April 2016 departure from St. Thomas. Same gaps appear in two processing batches 208,000 files apart. One gap sits adjacent to Epstein personally recommending a novel about a sympathetic pedophile, sent before the book was even publicly available.
This isn’t random corruption.
Full analysis + all code: https://github.com/degenai/Dataset9
If anyone has the torrent and wants to grep for Colapinto connections or other Shuliak trips, please do. This is open source for a reason.
We definitely need a crowdsourced method for going through all the files. I am currently building a solo cytoscape tool to try out making an affiliation graph, but expanding this to be a tool for a community, with authorization to just allow whitelisted individuals work on it, that’s beyond my scope and I can’t volunteer to make such an important tool, but I am happy to offer my help building it. I can convert my existing tool to a prototype if anyone wants to collaborate with me on it. I am an amateur, but I will spend all the Cursor Credits on this.