Skip to content

Latest commit

 

History

History
20 lines (14 loc) · 698 Bytes

README.md

File metadata and controls

20 lines (14 loc) · 698 Bytes

Uncaring HTML Parser

HTML parser that intend to be fast, but I didn't benchmark it yet. And it's yet to be documented. And it doesn't parse all HTML. It searches given HTML for matching tags without parsing whole HTML or building it's DOM.

Example usage

String html = "<a href='http://example.org/' class='link'>";

new Html(html).css("a.link").filter(a -> a.attr("href").contains("http:")).forEach(e -> {
    System.out.println(e.getTextContent() + " : " + e.attr("href"));
});

Repositories

I'm not planning to upload it to any public Maven repositories, but it can be linked using JitPack repository.