| HTML Tools: |
Parse, Search, Scrape & Update |
| Also Includes: |
JavaDoc Upgrader, News-Site Scraper, Browser Control (CDP), JSON Package,
Build Tool, CSS Parser
|
| |
| Newest: |
May 2026 |
| Compiled With: |
JDK 11 |
HTML PARSE, SEARCH & SCRAPE
The HTML Parser produces a Vector<HTMLNode> structure for easy rewrite and analysis.
NodeSearch utilities simplify sub-list queries, attribute edits, and table or list extraction.
These same utilities power the JavaDoc Upgrader and Browser scraping workflows.
BROWSER CONTROL, CDP, WEBSOCKETS & JSON
Includes a Headless / Headful Browser Controller built on the Chrome DevTools Protocol (CDP).
Communication uses WebSockets and JSON envelopes to issue commands and receive events.
Typed Java classes represent domains, commands and events, making browser automation and scraping
predictable and straightforward.
JSON PACKAGE
A pragmatic JSON library complements javax.json with clear readers, writers, and
validation helpers. Used throughout CDP generation and toolchains, it emphasizes predictable typing
and meaningful error reporting.
BUILD TOOL — Torello.Java.Build
A multi‑stage build system orchestrates compilation, JAR/TAR packaging, and Javadoc upgrades.
It supports the project’s custom source layout and automates inlining of external HTML into
JavaDoc output.
JAVADOC UPGRADER
Continuously improved and used daily by the author. Not fully documented for external use, so
expect evolving behavior as upgrades roll out.
CSS PARSER
Implements CSS Syntax Level 3 tokenization and parsing into higher‑level nodes, useful for style
normalization and integrated HTML processing.