Kreuzcrawl (kreuzcrawl v0.3.0-rc.37)
Copy MarkdownHigh-level API for kreuzcrawl
Summary
Functions
Crawl multiple seed URLs concurrently, each following links to configured depth.
Streaming batch_crawl_stream — returns an Enumerable of decoded chunk maps.
Scrape multiple URLs concurrently.
Crawl a website starting from url, following links up to the configured depth.
Streaming crawl_stream — returns an Enumerable of decoded chunk maps.
Create a new crawl engine with the given configuration.
Create a new crawl engine with the given configuration.
Convert markdown links to numbered citations.
Execute browser actions on a single page.
Discover all pages on a website by following links and sitemaps.
Scrape a single URL, returning extracted page data.
Functions
Crawl multiple seed URLs concurrently, each following links to configured depth.
Streaming batch_crawl_stream — returns an Enumerable of decoded chunk maps.
Scrape multiple URLs concurrently.
Crawl a website starting from url, following links up to the configured depth.
Streaming crawl_stream — returns an Enumerable of decoded chunk maps.
Create a new crawl engine with the given configuration.
Create a new crawl engine with the given configuration.
Convert markdown links to numbered citations.
@spec interact_async(reference(), String.t(), [map()]) :: {:ok, map()} | {:error, atom(), String.t()}
Execute browser actions on a single page.
Discover all pages on a website by following links and sitemaps.
Scrape a single URL, returning extracted page data.