Kreuzcrawl (kreuzcrawl v0.1.0-rc.9)
Copy MarkdownHigh-level API for kreuzcrawl.
Summary
Functions
Crawl multiple seed URLs concurrently, each following links to configured depth.
Scrape multiple URLs concurrently.
Method
Crawl a website starting from url, following links up to the configured depth.
Method
Validate the configuration, returning an error if any values are invalid.
Returns the count of unique normalized URLs encountered during crawling.
Create a new crawl engine with the given configuration.
Create a new crawl engine with the given configuration.
Discover all pages on a website by following links and sitemaps.
Scrape a single URL, returning extracted page data.
Functions
Crawl multiple seed URLs concurrently, each following links to configured depth.
Scrape multiple URLs concurrently.
@spec browserconfig_default() :: String.t() | nil
Method
Crawl a website starting from url, following links up to the configured depth.
@spec crawlconfig_default() :: String.t() | nil
Method
Validate the configuration, returning an error if any values are invalid.
@spec crawlresult_unique_normalized_urls(map()) :: non_neg_integer()
Returns the count of unique normalized URLs encountered during crawling.
Create a new crawl engine with the given configuration.
Create a new crawl engine with the given configuration.
Discover all pages on a website by following links and sitemaps.
Scrape a single URL, returning extracted page data.