"Perform autonomous, multi-step research using the Gemini Deep Research Agent (Interactions API). Supports web search, file/directory context, and resilient streaming."
Resources
2Install
npx skillscat add cnemri/google-genai-skills/deep-research Install via the SkillsCat registry.
Deep Research
Use this skill to conduct deep, autonomous research tasks that require planning, searching, reading, and synthesizing information.
This skill uses the Gemini Deep Research Agent (deep-research-pro-preview-12-2025) via the Interactions API.
Prerequisites
GOOGLE_API_KEY: Required for accessing the Interactions API (currently in Preview via AI Studio).google-genaiSDK v0.3.0+
Usage
Basic Research
Start a research task and stream the results to the console.
uv run skills/deep-research/scripts/research.py "Research the history of RISC-V architecture."Research with Context (Files or Directories)
Provide local files (PDFs, text) or entire directories for the agent to read and incorporate into its research.
Direct Upload (Best for specific docs):
# Single file
uv run skills/deep-research/scripts/research.py "Analyze this report" --file report.pdf
# Entire directory
uv run skills/deep-research/scripts/research.py "Summarize these meeting notes" --file ./notes/File Search Store (Best for large corpora):
Use --use-file-store to index files into a searchable store (RAG) instead of uploading them into the context window.
uv run skills/deep-research/scripts/research.py "Find trends in these 1000 PDFs" --file ./large_corpus/ --use-file-storeSaving the Report
Save the final Markdown report to a file.
uv run skills/deep-research/scripts/research.py "Competitive landscape of EV batteries" --output report.mdContinuing Research (Follow-up)
Ask follow-up questions to an existing research session using the Interaction ID (displayed at the start of the previous run).
uv run skills/deep-research/scripts/research.py "Elaborate on the second point about lithium supply." --follow-up "INTERACTION_ID_HERE"References
How it Works
- Planning: The agent breaks down your prompt into steps.
- Execution: It autonomously searches the web and reads your provided files.
- Resilience: The script automatically reconnects if the long-running stream drops.
- Synthesis: It produces a comprehensive, cited report.