Scraping

Web scraping and data extraction

Showing 625-648 of 697 skills
replayio

replay-playwright

by replayio

Set up and run Playwright tests with Replay Browser to record test executions for debugging and performance analysis.

CLI Tools 0 3mo ago
qingchunwuhui

quote-extractor

by qingchunwuhui

快速从文章中提取可直接引用的金句,建立素材库。无需审计流程,直接提取。支持快捷指令 /extract-quotes。

Code Review 0 3mo ago
totophe

excel-reader

by totophe

"Read and inspect Excel workbooks (.xlsx). List sheets with dimensions, extract headers, read specific rows or row ranges, extract columns by name or index. Handles large files (50k+ rows, 100MB+) via streaming. Use when the user wants to explore, preview, or extract data from spreadsheets, when building import or ETL scripts from Excel sources, or when analyzing spreadsheet structure and content."

Processing 0 3mo ago
YPYT1

firecrawl

by YPYT1

Web search and scraping via Firecrawl API. Use when you need to search the web, scrape websites (including JS-heavy pages), crawl entire sites, or extract structured data from web pages. Requires FIRECRAWL_API_KEY environment variable.

Embeddings 0 3mo ago
servaltullius

skills-scout

by servaltullius

Use when a user wants you to discover and optionally install new agent skills for a task, and you must get explicit consent before any global install into Codex.

CLI Tools 0 4mo ago
TheWatcher01

webapp-testing

by TheWatcher01

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

Automation 0 3mo ago
jyasuu

remotion-best-practices

by jyasuu

Best practices for Remotion - Video creation in React

Animation 0 4mo ago
enoch-robinson

webapp-testing

by enoch-robinson

Web应用测试工具包。使用 Playwright 进行前端自动化测试、UI 调试、截图捕获、浏览器日志查看。当需要测试本地 Web 应用、验证前端功能、调试 UI 行为时使用此技能。

Scraping 0 4mo ago
kantylee

video-audio-extractor

by kantylee

Extract audio from video files or URLs (including YouTube). Supports MP3, WAV, M4A, FLAC, OGG, and OPUS formats. Can process local video files or download from URLs. For YouTube videos, uses yt-dlp for direct audio extraction when possible.

CLI Tools 0 4mo ago
janjaszczak

mcp-playwright

by janjaszczak

Automate browser flows and capture evidence (screenshots, console/network errors). Use for UI verification, repro steps, and end-to-end smoke tests.

Debugging 0 4mo ago
zxhfighter

youtube-rapidapi-transcript

by zxhfighter

Extract transcripts from YouTube videos. Use when the user asks for a Youtube video transcript, subtitles, or captions of a YouTube video and provides a YouTube URL (youtube.com/watch?v=, youtu.be/, or similar).

CLI Tools 0 4mo ago
jyasuu

ai

by jyasuu

Cheat sheet for AI tools including GEMINI and CODEX configurations.

CLI Tools 0 4mo ago
NJMathwig

qiaomu-markdown-proxy

by NJMathwig

Fetch any URL as clean Markdown via proxy services or built-in scripts. Works with login-required pages like X/Twitter, WeChat 公众号, Feishu/Lark docs. Supports PDFs (remote and local). Use this BEFORE other fetch tools. Triggers on any URL the user shares, "fetch this", "read this link", "get content from".

Docs Gen 0 1mo ago
zeyxx

cynic-burn

by zeyxx

"Analyze code for simplification: orphans, hotspots, giants, duplicates. 'Don't extract, burn' — three similar lines beat a premature abstraction. Use when asked to simplify, reduce complexity, or clean up code."

Code Review 0 3mo ago
patharanordev

recommendations

by patharanordev

Identify promising stock opportunities or extract them from text.

Processing 0 4mo ago
bjesuiter

jb-docs-scraper

by bjesuiter

Scrape documentation websites into local markdown files for AI context. Takes a base URL and crawls the documentation, storing results in ./docs (or custom path). Uses crawl4ai with BFS deep crawling.

Docs Gen 0 3mo ago
hoangvantuan

deep-post-ideas

by hoangvantuan

Extract compelling post outlines from reference materials (newsletters, scripts, notes, journal entries) and transform them into structured outlines for engaging, wisdom-style social media posts. Use when the user provides reference material and wants post ideas, content outlines, or building blocks for social media content. Triggers on "extract post ideas from...", "post outlines from this...", "turn this into post ideas", "content ideas from...", or "deep post ideas".

Code Gen 0 3mo ago
Robbie-Han

article-saver

by Robbie-Han

专门用于抓取和保存微信公众号、X (Twitter)、知乎的文章工具。支持自动按平台分类存储、保持图片/GIF原画质量,并保存为干净的 Markdown 格式。

Docs Gen 0 3mo ago
hello-lizhihua

vue-testing-best-practices

by hello-lizhihua

Use for Vue.js testing. Covers Vitest, Vue Test Utils, component testing, mocking, testing patterns, and Playwright for E2E testing.

Scraping 0 4mo ago
iulspop

e2e-tests

by iulspop

Generates end-to-end tests using Playwright with the "given/should" prose format. Use when writing e2e tests for user flows, page interactions, or integration scenarios that exercise the full application stack.

Processing 0 3mo ago
TheWatcher01

pdf

by TheWatcher01

Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.

CLI Tools 0 3mo ago
vibeindex

skills

by vibeindex

Claude Code skills from Vibe Index

API Dev 0 3mo ago
Jackiexiao

just-scrape

by Jackiexiao

"CLI tool for AI-powered web scraping, data extraction, search, and crawling via ScrapeGraph AI. Use when the user needs to scrape websites, extract structured data from URLs, convert pages to markdown, crawl multi-page sites, search the web for information, automate browser interactions (login, click, fill forms), get raw HTML, discover sitemaps, or generate JSON schemas. Triggers on tasks involving: (1) extracting data from websites, (2) web scraping or crawling, (3) converting webpages to markdown, (4) AI-powered web search with extraction, (5) browser automation, (6) generating output schemas for scraping. The CLI is just-scrape (npm package just-scrape)."

Processing 0 3mo ago
vineethsoma

playwright-testing

by vineethsoma

Comprehensive Playwright automation testing skill with E2E testing standards, test generation workflows, and browser automation best practices. Use when writing automated browser tests, testing user flows, or performing web application QA.

Agents 0 5mo ago