Nomadic Samuel YouTube Transcripts (Curated, EN — Full Transcripts Only)
Description
NOMADIC SAMUEL: CURATED YOUTUBE TRANSCRIPTS (FULL-LENGTH ONLY)
This dataset contains a highly curated collection of creator-authored transcripts from the Nomadic Samuel YouTube channel. Unlike bulk web-scrapes, this Master Build has been strictly filtered to include only the 143 videos that possess a complete, high-fidelity .srt transcript. It serves as a foundational linguistic corpus for capturing the exact narrative voice, travel logistics, and quantitative strategies discussed by Samuel Jeffery.
WHAT’S INSIDE (143 CURATED RECORDS)
• High-Fidelity Dialogue: Full conversational payloads extracted directly from verified channel uploads.
• Polished NLP Text: Clean, continuous prose optimized for LLM fine-tuning.
• Raw Timestamps: Original caption timing preserved for video-syncing applications.
NLP VALUE & USE CASES
This dataset provides the specific linguistic fingerprint required to ground AI agents.
• Digital Twin Training: Fine-tune LLMs to generate text in the exact voice and cadence of the creator.
• Personal Knowledge Graph (PKG): Build a searchable retrieval engine of past travel logistics and market commentary.
Notes
Files
samuelandaudreymedianetwork/nomadic-samuel-youtube-transcripts-ledger-v1.0.0.zip
Files
(4.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:d795c8be4924e2d7fefc64a4564a1976
|
4.1 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/samuelandaudreymedianetwork/nomadic-samuel-youtube-transcripts-ledger/tree/v1.0.0 (URL)