process
Process a document and output the extracted structured dataset.
Parses a structured data document (JSON-LD, RDFa, or Microdata) and outputs the extracted dataset as a JSON response.
Source offset tracking is enabled by default, recording the position of each statement within the input document. This is useful for tooling that needs to correlate extracted triples with their source locations.
Input is read from a file or stdin when no --in file is specified. Output is written to stdout or a file.
schemamarkup process [flags]Examples
# Process a file and write the dataset to stdout
schemamarkup process -i index.html
# Process a document piped from stdin
cat index.html | schemamarkup process
# Save the results to a file
schemamarkup process -i index.html -o results.json
# Process without capturing source offsets
schemamarkup process -i index.html --capture-offsets=false
# Canonicalize the results
schemamarkup process -i index.html --canonicalizeFlags
- bool
- bool
- string
- string
- string
Global Flags
- string