API Reference html_to_markdown v#3.4.0
Copy MarkdownModules
High-level API for html_to_markdown
The type of an inline text annotation.
Code block fence style in Markdown output.
Main conversion options for HTML to Markdown conversion.
Partial update for ConversionOptions.
The primary result of HTML conversion and extraction.
Document-level metadata extracted from <head> and top-level elements.
A single node in the document tree.
A structured document tree representing the semantic content of an HTML document.
A single cell in a table grid.
Header element metadata with hierarchy tracking.
Heading style options for Markdown output.
Highlight rendering style for <mark> elements.
Comprehensive metadata extraction result from HTML document.
Image metadata with source and dimensions.
Image source classification for proper handling and processing.
Hyperlink metadata with categorization and attributes.
Link rendering style in Markdown output.
Link classification based on href value and document context.
List indentation character type.
Line break syntax in Markdown output.
The semantic content type of a document node.
Context information passed to all visitor methods.
Node type enumeration covering all HTML element types.
Output format for conversion.
HTML preprocessing options for document cleanup before conversion.
Partial update for PreprocessingOptions.
HTML preprocessing aggressiveness level.
A non-fatal warning generated during HTML processing.
Structured data block (JSON-LD, Microdata, or RDFa).
Structured data format type.
A top-level extracted table with both structured data and markdown representation.
A structured table grid with cell-level data including spans.
An inline text annotation with byte-range offsets.
Text directionality of document content.
Result of a visitor callback.
Categories of processing warnings.
Whitespace handling strategy during conversion.