Scraping

Web scraping and data extraction

Showing 601-624 of 697 skills
ianphil

Browser Automation with playwright-cli (Extension Mode)

by ianphil

CLI Tools 0 3mo ago
Lichens-Innovation

react-single-responsibility

by Lichens-Innovation

Strategies to simplify components, hooks, and methods: decomposition order (utilities, hooks, sub-components), early returns, control flow, parameter design, and code smell fixes. Use when the user says: ungodify this method/function/component, simplify this method/function/component, make this method/function/component less complex; or when refactoring a large component, hook, or function, reducing complexity, applying single responsibility, or asking how to simplify a component, hook, or method.

File Ops 0 3mo ago
viktor-silakov

auditing-bdd-tests

by viktor-silakov

Analyzes BDD (Gherkin) + Playwright test solutions. Produces aspect scoring (0/5/10), A–F grade, issues by severity, and Markdown + HTML report formats for deep test-solution analysis.

Code Review 0 4mo ago
spider-rs

spider-cli-extraction

by spider-rs

Use Spider Rust CLI for crawling, scraping, and extraction tasks from the terminal. Trigger this skill when a user asks to run or compose spider CLI commands, tune crawl/scrape options, export links or HTML, or control runtime browser mode with --headless or --http.

API Dev 0 3mo ago
danbars

writing-meeting-notes

by danbars

Use when a meeting just occurred and notes need to be turned into a clear summary with decisions, action items, owners, and dates.

Code Gen 0 4mo ago
InfQuest

audio-extract

by InfQuest

从视频文件中提取音频。Use when user wants to 提取音频, 抽取音频, 视频转音频, 导出音频, extract audio, video to audio, get audio from video, 把视频的声音提取出来.

CLI Tools 0 4mo ago
RomainJeff

web-fetch-linkup

by RomainJeff

Fetch and extract clean content from any web page using Linkup API

Docs Gen 0 3mo ago
Victory-Hugo

pdf

by Victory-Hugo

用于提取文本与表格、创建新 PDF、合并/拆分文档以及处理表单的综合 PDF 操作工具包。当需要填写 PDF 表单,或以编程方式批量处理、生成或分析 PDF 文档时使用。

CLI Tools 0 4mo ago
ychoi-kr

pdf-toc-bookmarks

by ychoi-kr

Extract table of contents from PDF pages visually and create clickable bookmarks. Use when user wants to add bookmarks/navigation to PDFs based on printed table of contents pages, or needs to convert TOC pages to navigable PDF bookmarks.

Processing 0 7mo ago
nanzhipro

youtube-podcast-extraction

by nanzhipro

极客级 YouTube 播客提取与可视化方案。遵循电影感字幕视觉标准 (Cinema Style),通过词窗重叠算法去重字幕,并利用 Playwright 渲染高质量金句卡片。适用于需要将长视频转化为高价值社交分享内容的场景。

Processing 0 4mo ago
hello-lizhihua

vue-testing-best-practices

by hello-lizhihua

用于 Vue.js 测试。涵盖 Vitest、Vue Test Utils、组件测试、模拟、测试模式以及用于 E2E 测试的 Playwright。

Scraping 0 4mo ago
avantmedialtd

e2e-testing

by avantmedialtd

E2E and visual regression testing with Playwright. Use when writing tests, running E2E tests, debugging test failures, or working with visual baselines. Contains test commands, patterns, and debugging tips.

Debugging 0 3mo ago
zhongjis

webapp-testing

by zhongjis

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

Automation 0 3mo ago
Alexu0317-FATHER

sync

by Alexu0317-FATHER

"Read extract-buffer.md, distribute signals to Domain State, surface thinking patterns and core candidates in report, then clear buffer. Must run in a NEW dedicated session."

Performance 0 3mo ago
danbars

writing-email-subjects

by danbars

Use when an email draft exists but the subject line is unclear, too long, or needs options tailored to a specific audience and tone.

Code Gen 0 4mo ago
Crawlio-app

observe

by Crawlio-app

Use this skill when the user asks to "check observations", "what did Crawlio see", "show crawl timeline", "query the observation log", or wants to review what happened during a crawl session. Queries the append-only observation log with filtering by host, source, operation, and time range.

Code Review 0 3mo ago
Jackiexiao

pdf

by Jackiexiao

(中文)Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.

CLI Tools 0 3mo ago
mmbmf1

postgis-extract-xy

by mmbmf1

Extract longitude and latitude from PostGIS geometries using ST_X and ST_Y safely.

Processing 0 3mo ago
spoonbobo

doc-convert

by spoonbobo

Convert and extract text from PDFs, DOCX, images (OCR), and other document formats using the gateway's built-in document processing stack.

Docker 0 2mo ago
qingchunwuhui

writing-analyzer

by qingchunwuhui

快速拆解文章写作结构,提取可复用的写作模板。无需审计流程,直接分析。适合学习写作技巧、建立模板库。支持快捷指令 /analyze-writing。

Code Review 0 3mo ago
alpex-ai

ui-extractor

by alpex-ai

Analyze screen recordings and websites to extract implementation specs, design systems, and UI patterns.

Code Review 0 4mo ago
zhongjis

pdf

by zhongjis

Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.

CLI Tools 0 3mo ago
wottpal

deep-research-firecrawl

by wottpal

Conducts citation-backed research using Firecrawl MCP search, scrape, map, crawl, and agent tools with selectable quick, standard, deep, and ultradeep modes. Use for multi-source comparisons, technical evaluations, market research, and high-stakes decision support.

Academic 0 3mo ago
Crawlio-app

crawl-site

by Crawlio-app

Use this skill when the user asks to "crawl a site", "download a website", "mirror a site", "scrape a site", or wants to download web pages for offline access or analysis. Configures Crawlio settings based on site type, starts the crawl, monitors progress, and reports results.

Scraping 0 3mo ago