Invisible Threads Skill

| Ollama | Free | Slow |

cdeistopened 8 2 Updated 5mo ago

Resources

GitHub

Install

npx skillscat add cdeistopened/content-os/skills-writing-invisible-threads

Install via the SkillsCat registry.

SKILL.md

Invisible Threads Skill

Discovers non-obvious thematic connections ("invisible threads") across a corpus of essays, articles, or transcripts.

Adapted from invisible-threads.

When to Use

Editing an anthology and want to find thematic connections
Analyzing a body of work for recurring ideas
Finding quotes that support a theme across multiple sources
Writing editorial commentary that weaves sources together

Quick Start

# 1. Prepare your corpus as markdown files in a source directory

# 2. Chunk the corpus into a database
python chunk_corpus.py --source /path/to/markdown/files --output corpus.db

# 3. Extract insights using Gemini
python extract_insights.py --db corpus.db --backend gemini

# 4. Find threads
python find_threads.py --input data/insights_*.json

# 5. Review threads_*.json for editorial use

Adapting for Your Project

The key file to modify is the extraction prompt in extract_insights.py.

Default Prompt Categories

The default is set up for Catholic agrarian content (Cross & Plough):

industrialism, land, family, property, craft, liturgy
organic-farming, distributism, eugenics, totalitarianism
natural-law, peasantry, economics, spirituality

For Other Projects

Change the EXTRACTION_PROMPT in extract_insights.py:

Context section: Describe what the corpus is
Examples section: Give 3 examples of genuine insights from this domain
Categories: List 10-15 thematic categories relevant to your content

See references/prompt-templates.md for examples.

Output

insights_*.json

Each insight includes:

insight_text: The extracted quote/idea
category: Thematic category
novelty_score: 1-10 how surprising
specificity_score: 1-10 how quotable
source: Which document it came from
raw_chunk: Original context

threads_*.json

Each thread includes:

thread_id: Identifier
category: Dominant theme
size: Number of connected insights
num_sources: How many different documents
years_spanned: Timeline (if applicable)
insights: All insights in this thread

Editorial Applications

Reorganize structure around discovered threads
Write headnotes referencing how themes recur
Add footnotes pointing to related pieces
Find unused quotes that connect to themes
Write bridge paragraphs between sections

Scripts

Script	Purpose
`chunk_corpus.py`	Split markdown files into database
`extract_insights.py`	Extract insights using LLM (Gemini/Claude/Ollama)
`find_threads.py`	Cluster insights into thematic threads

Requirements

google-generativeai>=0.3.0  # For Gemini
sentence-transformers>=2.2.0
networkx>=3.0
python-louvain>=0.16
scikit-learn>=1.0

Cost

Backend	~1000 chunks	Speed
Gemini Flash	Free tier	Fast
Gemini Pro	~$1-2	Fast
Claude Haiku	~$0.50	Fast
Ollama	Free	Slow

Invisible Threads Skill

Resources

Install

Invisible Threads Skill

When to Use

Quick Start

Adapting for Your Project

Default Prompt Categories

For Other Projects

Output

insights_*.json

threads_*.json

Editorial Applications

Scripts

Requirements

Cost

Categories

Install

Recommended Skills