HTML Tools: Parse, Search, Scrape & Update
Also Includes: JavaDoc Upgrader, News-Site Scraper, Browser Control (CDP), JSON Package, Build Tool, CSS Parser
 
Newest: May 2026
Compiled With: JDK 11
VIEW DOCS DOWNLOAD DOCS (~205 MB) DOWNLOAD JAR (~7.8 MB)

HTML PARSE, SEARCH & SCRAPE

The HTML Parser produces a Vector<HTMLNode> structure for easy rewrite and analysis. NodeSearch utilities simplify sub-list queries, attribute edits, and table or list extraction. These same utilities power the JavaDoc Upgrader and Browser scraping workflows.

BROWSER CONTROL, CDP, WEBSOCKETS & JSON

Includes a Headless / Headful Browser Controller built on the Chrome DevTools Protocol (CDP). Communication uses WebSockets and JSON envelopes to issue commands and receive events. Typed Java classes represent domains, commands and events, making browser automation and scraping predictable and straightforward.

JSON PACKAGE

A pragmatic JSON library complements javax.json with clear readers, writers, and validation helpers. Used throughout CDP generation and toolchains, it emphasizes predictable typing and meaningful error reporting.

BUILD TOOL — Torello.Java.Build

A multi‑stage build system orchestrates compilation, JAR/TAR packaging, and Javadoc upgrades. It supports the project’s custom source layout and automates inlining of external HTML into JavaDoc output.

JAVADOC UPGRADER

Continuously improved and used daily by the author. Not fully documented for external use, so expect evolving behavior as upgrades roll out.

CSS PARSER

Implements CSS Syntax Level 3 tokenization and parsing into higher‑level nodes, useful for style normalization and integrated HTML processing.