parallel-web

parallel-web-extract

"URL content extraction. Use for fetching any URL - webpages, articles, PDFs, JavaScript-heavy sites. Token-efficient: runs in forked context. Prefer over built-in WebFetch."

parallel-web 53 Updated 3mo ago
GitHub

Install

npx skillscat add parallel-web/parallel-agent-skills/parallel-web-extract

Install via the SkillsCat registry.

SKILL.md

URL Extraction

Extract content from: $ARGUMENTS

Command

parallel-cli extract "$ARGUMENTS" --json

Options if needed:

  • --objective "focus area" to focus on specific content

Response format

Return content as:

Page Title

Then the extracted content verbatim, with these rules:

  • Keep content verbatim - do not paraphrase or summarize
  • Parse lists exhaustively - extract EVERY numbered/bulleted item
  • Strip only obvious noise: nav menus, footers, ads
  • Preserve all facts, names, numbers, dates, quotes

Setup

If parallel-cli is not found, install and authenticate:

curl -fsSL https://parallel.ai/install.sh | bash

If unable to install that way, install via pipx instead:

pipx install "parallel-web-tools[cli]"
pipx ensurepath

Then authenticate:

parallel-cli login

Or set an API key: export PARALLEL_API_KEY="your-key"