Extract content blocks and metadata from HTML source
Extract ordered article metadata, content-bearing blocks, and embedded artifact references from static source HTML. This skill defines the structural backbone that downstream generation consumes.
PageStructure plus block completeness metrics.articleUrlsourceHtmlOutput shapes are defined in:
TRR-APP/apps/web/src/lib/admin/design-docs-pipeline-types.tsPageStructureblockCompletenessheaderFidelityRequirementsnoteTextRequirementsPageStructure payload with stable ordering and block indexes.blockCompleteness from matched versus expected content-bearing blocks.blockCompleteness indicates missing content.Return:
page_structuremetadata_summaryembed_summaryblock_completenesswarnings