Skrapp

New Job

Create crawl

Start URL

The root page Skrapp should start from.

Max depth

Depth 0 keeps only the root page.

Max pages

Upper bound for accepted pages.

Optional settings

Allowed path prefix

Restrict crawling to one subtree. Leave empty to infer from the start URL.

Ignore path prefixes

Comma-separated prefixes to skip during discovery.

Timeout (seconds)

Overview

Runs a BFS crawl, keeps accepted pages in scope, and stores the page tree.

Renders each page, captures the main content, and preserves the raw markdown for review.

Scores blocks generically by page type, position, density, and repetition across the crawl.

Recent Jobs

Job	Status	URL	Discovered	Succeeded	Created	Actions
No jobs yet.