Introduction

What HeadlessX ships today, how the platform is shaped, and where to start.

HeadlessX is a self-hosted scraping platform built around a TypeScript API, a Next.js operator dashboard, queue-backed crawl workflows, a Python YouTube engine, a published CLI, and a remote MCP endpoint for AI clients.

Recommended first path

Start with Quick Start if you want the fastest path to a running instance. Jump to Self-Hosting Overview when you are deciding between local services, mixed infrastructure, or a full Docker deployment. For production proxy capacity, see Proxy Integrations.

What ships today

Area	Current surface
Website scraping	HTML, JS-rendered HTML, content extraction, screenshots, map, crawl, and SSE progress
Search tools	Google AI Search, Tavily, and Exa workspaces plus API endpoints
YouTube	Metadata extraction, formats, subtitles, preview, and temporary save packaging
Operations	API keys, logs, settings, jobs, proxies, and runtime status
AI integrations	Remote MCP over `/mcp`, the published `headlessx` CLI, and the repository agent skill

Product shape

HeadlessX is split into a few clear layers:

the API backend exposes the authenticated /api/* HTTP surface and the /mcp endpoint
the web dashboard provides the operator interface and playground
the YouTube engine handles metadata extraction and temporary media packaging
the HTML-to-Markdown service supports content workflows
Redis and the worker process power queued crawl and background jobs

Where to start

Use this reading order if you are new to the platform:

Quick Start

Get the platform running, create an API key, and make your first request.

CLI

Use the published `headlessx` command against the same operator API used by the dashboard.

Self-Hosting Overview

Compare mixed local, full Docker, and fully local runtime modes.

Environment Variables

Review the required runtime variables for local, mixed, and self-hosted deployments.

Agent Skills

Install the HeadlessX skill into supported AI coding agents.

Resources

Open the core references for setup, routes, release history, and support.

Who this documentation is for

This docs set is written for three audiences:

operators running HeadlessX in local, mixed, or Docker setups
developers extending the API, dashboard, or runtime services
automation users connecting via HTTP, workflow tools, CLI, skills, or MCP clients