Vidilearn — Teach your AI using YouTube videos

Engineered for AI Autonomy

A simple, lightweight CLI designed to bypass API restrictions and feed structured data straight to RAG databases, MCP servers, and LLM coding assistants.

No API Keys Required

Zero configuration. Bypasses complex GCP Project setups, YouTube OAuth verification, and query quota limits. Just install and run.

Structured JSON Output

Get transcripts, metadata, channel details, custom timestamps, and chapters in a single clean JSON payload optimized for vector storage.

Zero Browser Automation

No Puppeteer, Playwright, or Selenium required. Extremely low RAM and CPU footprint, meaning it will run smoothly inside serverless functions or containerized jobs.

Built for AI Workflows

Perfect fit for OpenAI Agents, MCP Server implementations, Codex CLI workflows, CrewAI, AutoGen, and LangGraph structures. Ingest whole channels or tutorials as technical contexts.

Local-First Processing

Extract and process content entirely on your machine without relying on external APIs or cloud processing pipelines. Full control over privacy, speed, and reliability.

Production CLI Experience

Designed for modern developer workflows with clean commands, structured outputs, automation-ready pipelines, and seamless integration into CI/CD systems.

Flexible CLI Integration

From simple metadata extraction to automated developer workflows, customize Vidilearn to feed your pipeline needs.

Basic Extraction Examples

# Standard complete extraction (prints title, transcript, chapters, etc.)
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID"

# Pretty-print formatted JSON
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID" --pretty

# Save structured JSON to file directly
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID" > context.json

# Specialized extractions
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID" --transcript
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID" --chapters
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID" --metadata

RULE OF THUMB

Always wrap URLs in double quotes (e.g. "https://youtube.com/...") to prevent terminal shells from parsing special characters like & or ? as background job triggers.

Example JSON Schema Response

{
  "title": "Build AI Agents",
  "channel": "AI Academy",
  "duration": "12:45",
  "description": "Learn how to build and orchestrate multi-agent architectures...",
  "transcript": "In this video, we will design and code multi-agent LLM teams...",
  "chapters": [
    {
      "title": "Introduction",
      "timestamp": "00:00"
    },
    {
      "title": "Agent Architecture Design",
      "timestamp": "03:42"
    },
    {
      "title": "Code Implementation",
      "timestamp": "07:15"
    }
  ]
}

MCP Integration & Pipelines

# Feed structured data straight into local vector database loading scripts
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID" --json | node ingest-vector-store.js

# Use directly inside MCP (Model Context Protocol) tool configurations:
{
  "mcpServers": {
    "vidilearn-server": {
      "command": "npx",
      "args": ["-y", "vidilearn", "extract", "{{url}}", "--json"]
    }
  }
}

Contextual Learning for Coding Agents

# Extract technical documentation from standard video tutorials
vidilearn extract "https://youtube.com/watch?v=VIDEO_ID" > /tmp/context-learning.json

# Feed the output as context injection to Codex or Gemini commands
codex run "implement a backend auth system using the rules in context-learning.json" \
  --context=/tmp/context-learning.json

Command Reference

Quick look up of all available flags, arguments, and roadmap features.

Command / Flag	Description	Default Behavior
`vidilearn extract <url>`	Core parsing command for a YouTube video.	Extracts all available data
`--pretty`	Formats JSON output with spaces for human readability.	Minified JSON string
`--transcript`	Extracts only the transcription text.	Extracts full dataset
`--chapters`	Returns video chapters & timestamps.	Extracts full dataset
`--metadata`	Returns core title, description, channel, and duration.	Extracts full dataset
`--help`	Prints standard usage, examples, and flags information.	-

Roadmap & Future Modules

Streaming transcript extraction (Real-time console printing)
Interactive MCP server compatibility layer
Batch playlist processing & full channel extraction
Local embedding generation pipelines (Ready-to-query chunks)
Multi-language subtitle translations and support
Vector Database upload integrations (Pinecone, Chroma, pgvector)

Teach your AI using
YouTube Videos

Engineered for AI Autonomy

No API Keys Required

Structured JSON Output

Zero Browser Automation

Built for AI Workflows

Local-First Processing

Production CLI Experience

Flexible CLI Integration

High-Performance Statistics

Extraction Speed (s)

Startup Overhead / Install Size

Command Reference

Roadmap & Future Modules

Teach your AI usingYouTube Videos

Engineered for AI Autonomy

No API Keys Required

Structured JSON Output

Zero Browser Automation

Built for AI Workflows

Local-First Processing

Production CLI Experience

Flexible CLI Integration

High-Performance Statistics

Extraction Speed (s)

Startup Overhead / Install Size

Command Reference

Roadmap & Future Modules

Support Vidilearn

Teach your AI using
YouTube Videos