LHX200013

abstract-annotation

Extract exact relevant mentions from academic abstracts, group them into dimensions and subdimensions, and generate review-ready article and dimension outputs.

LHX200013 0 Updated 2w ago

Resources

6
GitHub

Install

npx skillscat add lhx200013/abstractannotation-skill

Install via the SkillsCat registry.

SKILL.md

Abstract Annotation

Use this skill for literature-analysis workflows over CSV files containing academic abstracts.

Purpose

This skill helps a researcher annotate abstracts by extracting exact text spans for relevant variables, indicators, indices, metrics, factors, constructs, measures, predictors, methods, data sources, or other user-specified items, then grouping extracted items into review-ready dimensions.

Core Rules

  • Extracted values must be exact spans copied from the abstract.
  • Do not paraphrase, translate, standardize, or infer missing concepts.
  • Keep article-level extraction separate from later interpretive grouping.
  • Assign every unique extracted item to exactly one dimension and one subdimension.
  • Use binary article columns where 1 means the article mentions at least one extracted item in that dimension.

Expected Inputs

  • One or more CSV files.
  • An abstract column.
  • Optional language, year, and source-title columns for filtering.
  • A user-approved extraction prompt.

Expected Outputs

  • final_articles_with_dimensions.csv
  • dimension_original_variables.csv
  • dimension_mention_ratio_barplot.png
  • dimension_mention_ratio_table.png

Suggested Workflow

  1. Inspect CSV headers.
  2. Identify the abstract column and optional filter columns.
  3. Ask the user to approve a complete extraction prompt.
  4. Merge and filter CSV rows.
  5. Extract exact spans from each abstract.
  6. Validate that every extracted span appears literally in its abstract.
  7. Draft dimensions and subdimensions.
  8. Ask the user to approve the dimension structure.
  9. Generate final outputs.

Categories