Exclusions

Most websites contain dynamic-page traps that would otherwise generate thousands of pointless page loads during a crawl - event calendars with date pickers, faceted search filters, paginated archives, tag pages, author pages. Orbit's exclusion system lets you skip these patterns so the crawl focuses on pages that actually have media worth capturing.

Default exclusions

Orbit ships with a sensible default set ported from Vesper Audit. These match the URL patterns most commonly found on Drupal, WordPress, and civic CMS sites.

Path skips

Query string skips

Universal always-skip patterns

The following patterns are always skipped regardless of your settings, because they cannot contribute to a media crawl:

These cannot be turned off. They are guardrails against trivial mistakes.

Editing your exclusions

Click the Edit link on the idle screen to open the exclusions panel. The panel has three categories - Path, Query string, and Regex - each with its own list of pattern chips.

How patterns match

Substring, not glob
Path and query string patterns are simple substring matches - no glob characters, no anchors. Use the regex category if you need anchoring or wildcards.

Import & export

Click Export to save your exclusions as a JSON file. Click Import to load a JSON file - imported patterns replace your current set.

Importing Vesper Audit settings

If you already have a tuned scan profile in Vesper Audit, you can import its exclusions directly. In Vesper Audit, go to File → Export Settings, then in Orbit click Import and select that JSON file. Orbit will pull the path, query, and regex exclusions out and replace its built-in set.

Reset to defaults

Click Reset in the exclusions panel to restore the original Vesper Audit-style default set. This wipes any custom patterns you've added.

See also