Skip to content

Katana

Purpose

Katana crawls URL assets and emits discovered URLs and path assets.

Plugin Information

Plugin ID: katana

Category: Web Discovery

Plugin Type: crawler

Execution: active CLI crawler

Default State: enabled

Default Profiles:

  • deep
  • web_discovery

Input Scope

Accepted asset types:

  • url

Required metadata:

  • None

Produces targets:

  • A temporary URL list passed with -list.

Output

Creates assets:

  • url
  • path

Creates vulnerabilities:

  • None

May enrich:

  • URL assets with parent URL, scheme, host, and port metadata.
  • Path assets with parent URL and path metadata.

Metadata:

  • parent_url: source URL from Katana output.
  • path: parsed path for path assets.
  • scheme: parsed URL scheme.
  • host: parsed hostname.
  • port: parsed or inferred port.
  • source: katana.

Graph Relations

The worker derives crawl relationships when parent_url matches an existing URL asset:

url -> has_path -> path
url -> discovered_url -> url

Files / Artifacts

Produces:

  • None

Dependencies

Required binary: katana

Required installer entry: tools.katana

Command model:

katana -jsonl -silent -list <target-file>

Example Flow

url
  -> katana
path + url

Notes

The parser accepts JSONL and plain URL lines. Path assets are created when a discovered URL has a path and a parent URL is known.