ML Ops

Machine learning operations

Showing 145-168 of 1792 skills
sachacoldiq

cold-call-scripts

by sachacoldiq

1-minute cold call script (5 steps) and no-show phone script. Use when training SDRs on calls, building call frameworks, or handling common phone scenarios.

Automation 158 4mo ago
openclaw

sag

by openclaw

ElevenLabs text-to-speech with mac-style say UX.

CLI Tools 376.6K 4mo ago
borghei

cfo-advisor

by borghei

Financial leadership advisor for CFOs on financial planning, fundraising, investor reporting, unit economics, cash management, and financial operations.

Code Gen 214 3mo ago
Orchestra-Research

sparse-autoencoder-training

by Orchestra-Research

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

Caching 9.3K 5mo ago
openclaw

openai-whisper

by openclaw

Local speech-to-text with the Whisper CLI (no API key).

API Dev 376.5K 4mo ago
Orchestra-Research

awq-quantization

by Orchestra-Research

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster inference than GPTQ with better accuracy preservation, or for instruction-tuned and multimodal models. MLSys 2024 Best Paper Award winner.

ML Ops 9.3K 6mo ago
Orchestra-Research

training-llms-megatron

by Orchestra-Research

Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies. Use when training models >1B parameters, need maximum GPU efficiency (47% MFU on H100), or require tensor/pipeline/sequence/context/expert parallelism. Production-ready framework used for Nemotron, LLaMA, DeepSeek.

CLI Tools 9.3K 6mo ago
Orchestra-Research

nnsight-remote-interpretability

by Orchestra-Research

Provides guidance for interpreting and manipulating neural network internals using nnsight with optional NDIF remote execution. Use when needing to run interpretability experiments on massive models (70B+) without local GPU resources, or when working with any PyTorch architecture.

Debugging 9.3K 5mo ago
Orchestra-Research

hqq-quantization

by Orchestra-Research

Half-Quadratic Quantization for LLMs without calibration data. Use when quantizing models to 4/3/2-bit precision without needing calibration datasets, for fast quantization workflows, or when deploying with vLLM or HuggingFace Transformers.

ML Ops 9.3K 6mo ago
Orchestra-Research

pytorch-lightning

by Orchestra-Research

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.

Automation 9.3K 6mo ago
Orchestra-Research

fine-tuning-with-trl

by Orchestra-Research

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.

CI/CD 9.3K 6mo ago
Orchestra-Research

unsloth

by Orchestra-Research

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

i18n 9.3K 6mo ago
mitsuhiko

openscad

by mitsuhiko

"Create and render OpenSCAD 3D models. Generate preview images from multiple angles, extract customizable parameters, validate syntax, and export STL files for 3D printing platforms like MakerWorld."

CLI Tools 2.6K 4mo ago
Orchestra-Research

ray-data

by Orchestra-Research

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.

Automation 9.3K 4mo ago
Orchestra-Research

ML Training Recipes

by Orchestra-Research

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepower. Maintained by Orchestra Research.

Automation 9.3K 2mo ago
Orchestra-Research

lambda-labs-gpu-cloud

by Orchestra-Research

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.

API Dev 9.3K 6mo ago
Orchestra-Research

gguf-quantization

by Orchestra-Research

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

CLI Tools 9.3K 6mo ago
Orchestra-Research

sentencepiece

by Orchestra-Research

Language-independent tokenizer treating text as raw Unicode. Supports BPE and Unigram algorithms. Fast (50k sentences/sec), lightweight (6MB memory), deterministic vocabulary. Used by T5, ALBERT, XLNet, mBART. Train on raw text without pre-tokenization. Use when you need multilingual support, CJK languages, or reproducible tokenization.

i18n 9.3K 6mo ago
proffesor-for-testing

DDD Domain Mapping (from QCSD-AGENTIC-QE-MAPPING-FRAMEWORK.md)

by proffesor-for-testing

Agentic QE Fleet is an open-source AI-powered quality engineering platform designed for use with Claude Code, featuring specialized agents and skills to support testing activities for a product at any stage of the SDLC. Free to use, fork, build, and contribute. Based on the Agentic QE Framework created by Dragan Spiridonov.

CI/CD 372 3mo ago
Orchestra-Research

huggingface-tokenizers

by Orchestra-Research

Fast tokenizers optimized for research and production. Rust-based implementation tokenizes 1GB in <20 seconds. Supports BPE, WordPiece, and Unigram algorithms. Train custom vocabularies, track alignments, handle padding/truncation. Integrates seamlessly with transformers. Use when you need high-performance tokenization or custom tokenizer training.

Git & VCS 9.3K 6mo ago
Yeachan-Heo

ultrawork

by Yeachan-Heo

Parallel execution engine for high-throughput task completion

Agents 30.3K 3mo ago
openclaw

summarize

by openclaw

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).

Processing 376.5K 4mo ago
Yeachan-Heo

omc-help

by Yeachan-Heo

Guide on using oh-my-claudecode plugin

Code Review 35.7K 3mo ago
Orchestra-Research

rwkv-architecture

by Orchestra-Research

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux Foundation AI project. Production at Windows, Office, NeMo. RWKV-7 (March 2025). Models up to 14B parameters.

ML Ops 9.3K 6mo ago