Katana
Purpose
Katana crawls URL assets and emits discovered URLs and path assets.
Plugin Information
Plugin ID: katana
Category: Web Discovery
Plugin Type: crawler
Execution: active CLI crawler
Default State: enabled
Default Profiles:
deepweb_discovery
Input Scope
Accepted asset types:
url
Required metadata:
- None
Produces targets:
- A temporary URL list passed with
-list.
Output
Creates assets:
urlpath
Creates vulnerabilities:
- None
May enrich:
- URL assets with parent URL, scheme, host, and port metadata.
- Path assets with parent URL and path metadata.
Metadata:
parent_url: source URL from Katana output.path: parsed path for path assets.scheme: parsed URL scheme.host: parsed hostname.port: parsed or inferred port.source:katana.
Graph Relations
The worker derives crawl relationships when parent_url matches an existing URL asset:
url -> has_path -> path
url -> discovered_url -> url
Files / Artifacts
Produces:
- None
Dependencies
Required binary: katana
Required installer entry: tools.katana
Command model:
katana -jsonl -silent -list <target-file>
Example Flow
url
-> katana
path + url
Notes
The parser accepts JSONL and plain URL lines. Path assets are created when a discovered URL has a path and a parent URL is known.