Data Science & AI Models Prompts
3 categories · 13 workflows. The best community-tested data science & ai models prompts — proof-backed and ready to fork.
Top prompts in Data Science & AI Models
RAG system prompt that refuses to hallucinate and cites sources
A retrieval-augmented system prompt that answers only from context and returns inline citations or 'I don't know'.
Pandas data-cleaning pipeline for a messy CSV
Produce a reproducible Pandas cleaning pipeline: types, missing values, dedupe, outliers.
Scaffold a clean PyTorch training loop with eval and early stopping
Gives you a reproducible, well-structured PyTorch training script — config, model, dataloaders, train/eval loop, metrics, checkpointing, and early stopping — tuned to your task.
Pick the right Ollama model and generate an install plus run script for your hardware
Produces a hardware-aware Ollama model recommendation for your task plus a ready-to-run install and run script with VRAM checks, instead of guessing a model name and hoping it fits.
Wire a local RAG pipeline to Ollama with a doc loader and vector store
Produces a complete, local-first RAG pipeline with document loading, chunking, Ollama embeddings, a vector store, retrieval, and a grounded answer step with citations, requiring no cloud APIs.
Design a privacy-first local chat setup with quantization guidance
Produces a privacy-first local chat configuration with model and quantization choice for your hardware, a system prompt, conversation settings, and a data-leakage audit checklist so nothing leaves your machine.
Build a robust PyTorch Dataset and DataLoader with an augmentation pipeline
Produces a custom PyTorch Dataset with correct transforms, a tuned DataLoader, and a debuggable augmentation pipeline that handles edge cases instead of throwing on the first weird sample.
Fine-tune a pretrained model in PyTorch with a deliberate layer-freezing strategy
Produces a transfer-learning script that swaps the right head, freezes the right layers, and uses distinct learning rates so you adapt a backbone instead of nuking its pretrained weights.
Audit a messy DataFrame against an expected schema with dtype coercion
Produces a reusable schema-validation and dtype-coercion script that flags every column that drifted from spec, coerces what it safely can, and quarantines what it cannot instead of producing silent NaNs.