- Home
- /
- Categories
- /
- Scraping
Scraping
Web scraping and data extraction
replay-playwright
by replayio
Set up and run Playwright tests with Replay Browser to record test executions for debugging and performance analysis.
quote-extractor
by qingchunwuhui
快速从文章中提取可直接引用的金句,建立素材库。无需审计流程,直接提取。支持快捷指令 /extract-quotes。
excel-reader
by totophe
"Read and inspect Excel workbooks (.xlsx). List sheets with dimensions, extract headers, read specific rows or row ranges, extract columns by name or index. Handles large files (50k+ rows, 100MB+) via streaming. Use when the user wants to explore, preview, or extract data from spreadsheets, when building import or ETL scripts from Excel sources, or when analyzing spreadsheet structure and content."
firecrawl
by YPYT1
Web search and scraping via Firecrawl API. Use when you need to search the web, scrape websites (including JS-heavy pages), crawl entire sites, or extract structured data from web pages. Requires FIRECRAWL_API_KEY environment variable.
skills-scout
by servaltullius
Use when a user wants you to discover and optionally install new agent skills for a task, and you must get explicit consent before any global install into Codex.
webapp-testing
by TheWatcher01
Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.
remotion-best-practices
by jyasuu
Best practices for Remotion - Video creation in React
webapp-testing
by enoch-robinson
Webåºç¨æµè¯å·¥å ·å ãä½¿ç¨ Playwright è¿è¡å端èªå¨åæµè¯ãUI è°è¯ãæªå¾æè·ãæµè§å¨æ¥å¿æ¥çãå½éè¦æµè¯æ¬å° Web åºç¨ãéªè¯å端åè½ãè°è¯ UI è¡ä¸ºæ¶ä½¿ç¨æ¤æè½ã
video-audio-extractor
by kantylee
Extract audio from video files or URLs (including YouTube). Supports MP3, WAV, M4A, FLAC, OGG, and OPUS formats. Can process local video files or download from URLs. For YouTube videos, uses yt-dlp for direct audio extraction when possible.
mcp-playwright
by janjaszczak
Automate browser flows and capture evidence (screenshots, console/network errors). Use for UI verification, repro steps, and end-to-end smoke tests.
youtube-rapidapi-transcript
by zxhfighter
Extract transcripts from YouTube videos. Use when the user asks for a Youtube video transcript, subtitles, or captions of a YouTube video and provides a YouTube URL (youtube.com/watch?v=, youtu.be/, or similar).
ai
by jyasuu
Cheat sheet for AI tools including GEMINI and CODEX configurations.
qiaomu-markdown-proxy
by NJMathwig
Fetch any URL as clean Markdown via proxy services or built-in scripts. Works with login-required pages like X/Twitter, WeChat 公众号, Feishu/Lark docs. Supports PDFs (remote and local). Use this BEFORE other fetch tools. Triggers on any URL the user shares, "fetch this", "read this link", "get content from".
cynic-burn
by zeyxx
"Analyze code for simplification: orphans, hotspots, giants, duplicates. 'Don't extract, burn' — three similar lines beat a premature abstraction. Use when asked to simplify, reduce complexity, or clean up code."
recommendations
by patharanordev
Identify promising stock opportunities or extract them from text.
jb-docs-scraper
by bjesuiter
Scrape documentation websites into local markdown files for AI context. Takes a base URL and crawls the documentation, storing results in ./docs (or custom path). Uses crawl4ai with BFS deep crawling.
deep-post-ideas
by hoangvantuan
Extract compelling post outlines from reference materials (newsletters, scripts, notes, journal entries) and transform them into structured outlines for engaging, wisdom-style social media posts. Use when the user provides reference material and wants post ideas, content outlines, or building blocks for social media content. Triggers on "extract post ideas from...", "post outlines from this...", "turn this into post ideas", "content ideas from...", or "deep post ideas".
article-saver
by Robbie-Han
专门用于抓取和保存微信公众号、X (Twitter)、知乎的文章工具。支持自动按平台分类存储、保持图片/GIF原画质量,并保存为干净的 Markdown 格式。
vue-testing-best-practices
by hello-lizhihua
Use for Vue.js testing. Covers Vitest, Vue Test Utils, component testing, mocking, testing patterns, and Playwright for E2E testing.
e2e-tests
by iulspop
Generates end-to-end tests using Playwright with the "given/should" prose format. Use when writing e2e tests for user flows, page interactions, or integration scenarios that exercise the full application stack.
by TheWatcher01
Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.
skills
by vibeindex
Claude Code skills from Vibe Index
just-scrape
by Jackiexiao
"CLI tool for AI-powered web scraping, data extraction, search, and crawling via ScrapeGraph AI. Use when the user needs to scrape websites, extract structured data from URLs, convert pages to markdown, crawl multi-page sites, search the web for information, automate browser interactions (login, click, fill forms), get raw HTML, discover sitemaps, or generate JSON schemas. Triggers on tasks involving: (1) extracting data from websites, (2) web scraping or crawling, (3) converting webpages to markdown, (4) AI-powered web search with extraction, (5) browser automation, (6) generating output schemas for scraping. The CLI is just-scrape (npm package just-scrape)."
playwright-testing
by vineethsoma
Comprehensive Playwright automation testing skill with E2E testing standards, test generation workflows, and browser automation best practices. Use when writing automated browser tests, testing user flows, or performing web application QA.