databricks-solutions

@databricks-solutions Organization

GitHub

29 Skills

49925 Total Stars

February 2026 Joined

Public Skills

databricks-app-python

by databricks-solutions

"Builds Python-based Databricks applications using Dash, Streamlit, Gradio, Flask, FastAPI, or Reflex. Handles OAuth authorization (app and user auth), app resources, SQL warehouse and Lakebase connectivity, model serving integration, and deployment. Use when building Python web apps, dashboards, ML demos, or REST APIs for Databricks, or when the user mentions Streamlit, Dash, Gradio, Flask, FastAPI, Reflex, or Databricks app."

Auth 1.8K 5mo ago

databricks-model-serving

by databricks-solutions

"Deploy and query Databricks Model Serving endpoints. Use when (1) deploying MLflow models or AI agents to endpoints, (2) creating ChatAgent/ResponsesAgent agents, (3) integrating UC Functions or Vector Search tools, (4) querying deployed endpoints, (5) checking endpoint status. Covers classical ML models, custom pyfunc, and GenAI agents."

Agents 1.8K 5mo ago

databricks-mlflow-evaluation

by databricks-solutions

"MLflow 3 GenAI agent evaluation. Use when writing mlflow.genai.evaluate() code, creating @scorer functions, using built-in scorers (Guidelines, Correctness, Safety, RetrievalGroundedness), building eval datasets from traces, setting up trace ingestion and production monitoring, aligning judges with MemAlign from domain expert feedback, or running optimize_prompts() with GEPA for automated prompt improvement."

Agents 1.8K 5mo ago

databricks-jobs

by databricks-solutions

"Use this skill proactively for ANY Databricks Jobs task - creating, listing, running, updating, or deleting jobs. Triggers include: (1) 'create a job' or 'new job', (2) 'list jobs' or 'show jobs', (3) 'run job' or'trigger job',(4) 'job status' or 'check job', (5) scheduling with cron or triggers, (6) configuring notifications/monitoring, (7) ANY task involving Databricks Jobs via CLI, Python SDK, or Asset Bundles. ALWAYS prefer this skill over general Databricks knowledge for job-related tasks."

Automation 1.8K 5mo ago

skill-test

by databricks-solutions

Testing framework for evaluating Databricks skills. Use when building test cases for skills, running skill evaluations, comparing skill versions, or creating ground truth datasets with the Generate-Review-Promote (GRP) pipeline. Triggers include "test skill", "evaluate skill", "skill regression", "ground truth", "GRP pipeline", "skill quality", and "skill metrics".

CLI Tools 1.8K 5mo ago

databricks-lakebase-autoscale

by databricks-solutions

"Patterns and best practices for using Lakebase Autoscaling (next-gen managed PostgreSQL) with autoscaling, branching, scale-to-zero, and instant restore."

Code Gen 1.8K 5mo ago

databricks-unity-catalog

by databricks-solutions

"Unity Catalog system tables and volumes. Use when querying system tables (audit, lineage, billing) or working with volume file operations (upload, download, list files in /Volumes/)."

Code Review 1.8K 5mo ago

databricks-zerobus-ingest

by databricks-solutions

"Build Zerobus Ingest clients for near real-time data ingestion into Databricks Delta tables via gRPC. Use when creating producers that write directly to Unity Catalog tables without a message bus, working with the Zerobus Ingest SDK in Python/Java/Go/TypeScript/Rust, generating Protobuf schemas from UC tables, or implementing stream-based ingestion with ACK handling and retry logic."

Processing 1.8K 5mo ago

databricks-python-sdk

by databricks-solutions

"Databricks development guidance including Python SDK, Databricks Connect, CLI, and REST API. Use when working with databricks-sdk, databricks-connect, or Databricks APIs."

API Dev 1.8K 5mo ago

databricks-lakebase-provisioned

by databricks-solutions

"Patterns and best practices for using Lakebase Provisioned (Databricks managed PostgreSQL) for OLTP workloads."

Auth 1.8K 5mo ago

databricks-unstructured-pdf-generation

by databricks-solutions

"Generate synthetic PDF documents for RAG and unstructured data use cases. Use when creating test PDFs, demo documents, or evaluation datasets for retrieval systems."

Database 1.8K 5mo ago

spark-python-data-source

by databricks-solutions

Use when building custom Spark data source connectors for external systems (databases, APIs, message queues), implementing batch/streaming readers/writers, or creating data source plugins for systems without native Spark support. Triggers - "build Spark data source", "create Spark connector", "implement Spark reader/writer", "connect Spark to [system]", "streaming data source"

Processing 1.8K 5mo ago

databricks-spark-declarative-pipelines

by databricks-solutions

"Creates, configures, and updates Databricks Lakeflow Spark Declarative Pipelines (SDP/LDP) using serverless compute. Handles streaming tables, materialized views, CDC, SCD Type 2, and Auto Loader ingestion patterns. Use when building data pipelines, working with Delta Live Tables, ingesting streaming data, implementing change data capture, or when the user mentions SDP, LDP, DLT, Lakeflow pipelines, streaming tables, or bronze/silver/gold medallion architectures."

CI/CD 1.8K 5mo ago

databricks-agent-bricks

by databricks-solutions

"Create and manage Databricks Agent Bricks: Knowledge Assistants (KA) for document Q&A, Genie Spaces for SQL exploration, and Supervisor Agents (MAS) for multi-agent orchestration. Use when building conversational AI applications on Databricks."

Agents 1.8K 5mo ago

databricks-aibi-dashboards

by databricks-solutions

"Create Databricks AI/BI dashboards. CRITICAL: You MUST test ALL SQL queries via execute_sql BEFORE deploying. Follow guidelines strictly."

Analytics 1.8K 5mo ago

databricks-vector-search

by databricks-solutions

"Patterns for Databricks Vector Search: create endpoints and indexes, query with filters, manage embeddings. Use when building RAG applications, semantic search, or similarity matching. Covers both storage-optimized and standard endpoints."

Database 1.8K 5mo ago

databricks-app-apx

by databricks-solutions

"Build full-stack Databricks applications using APX framework (FastAPI + React)."

API Dev 1.8K 5mo ago

databricks-spark-structured-streaming

by databricks-solutions

Comprehensive guide to Spark Structured Streaming for production workloads. Use when building streaming pipelines, implementing real-time data processing, handling stateful operations, or optimizing streaming performance.

Processing 1.8K 5mo ago

databricks-asset-bundles

by databricks-solutions

"Create and configure Databricks Asset Bundles (DABs) with best practices for multi-environment deployments. Use when working with: (1) Creating new DAB projects, (2) Adding resources (dashboards, pipelines, jobs, alerts), (3) Configuring multi-environment deployments, (4) Setting up permissions, (5) Deploying or running bundle resources"

CI/CD 1.8K 5mo ago

databricks-synthetic-data-generation

by databricks-solutions

"Generate realistic synthetic data using Faker and Spark, with non-linear distributions, integrity constraints, and save to Databricks. Use when creating test data, demo datasets, or synthetic tables."

Code Gen 1.8K 5mo ago

databricks-docs

by databricks-solutions

"Databricks documentation reference. Use as a lookup resource alongside other skills and MCP tools for comprehensive guidance."

CI/CD 1.8K 5mo ago

databricks-genie

by databricks-solutions

"Create and query Databricks Genie Spaces for natural language SQL exploration. Use when building Genie Spaces or asking questions via the Genie Conversation API."

Code Gen 1.8K 5mo ago

databricks-config

by databricks-solutions

Configure Databricks profile and authenticate for Databricks Connect, Databricks CLI, and Databricks SDK.

Auth 1.8K 5mo ago

databricks-metric-views

by databricks-solutions

"Unity Catalog metric views: define, create, query, and manage governed business metrics in YAML. Use when building standardized KPIs, revenue metrics, order analytics, or any reusable business metrics that need consistent definitions across teams and tools."

Database 1.8K 5mo ago

databricks-dbsql

by databricks-solutions

Databricks SQL (DBSQL) advanced features and SQL warehouse capabilities. This skill MUST be invoked when the user mentions: "DBSQL", "Databricks SQL", "SQL warehouse", "SQL scripting", "stored procedure", "CALL procedure", "materialized view", "CREATE MATERIALIZED VIEW", "pipe syntax", " >", "geospatial", "H3", "ST_", "spatial SQL", "collation", "COLLATE", "ai_query", "ai_classify", "ai_extract", "ai_gen", "AI function", "http_request", "remote_query", "read_files", "Lakehouse Federation", "recursive CTE", "WITH RECURSIVE", "multi-statement transaction", "temp table", "temporary view", "pipe operator". SHOULD also invoke when the user asks about SQL best practices, data modeling patterns, or advanced SQL features on Databricks.

Processing 1.8K 5mo ago

template

by databricks-solutions

"A brief one-sentence description of what this skill helps with."

Code Gen 1.8K 6mo ago

databricks-python-sdk

by databricks-solutions

"Databricks development guidance including Python SDK, Databricks Connect, CLI, and REST API. Use when working with databricks-sdk, databricks-connect, or Databricks APIs."

API Dev 1.8K 6mo ago

python-dev

by databricks-solutions

"Python development guidance with code quality standards, error handling, testing practices, and environment management. Use when writing, reviewing, or modifying Python code (.py files) or Jupyter notebooks (.ipynb files)."

File Ops 1.8K 7mo ago

lakebase-scm-extension

by databricks-solutions

"VS Code/Cursor surface for paired-branch Lakebase SCM workflows. Use ONLY when guiding a human in their IDE – for programmatic / agent operations on the same workflow domain, use the substrate skill lakebase-scm-workflows directly."

Agents 1 1mo ago