Parse HTML in Java using the best library

Asked By 20 points N/A Posted on -
qa-featured

Hello everyone

I want to parse HTML in java using a library. Can anyone tell me what is the best library to parse HTML in java?

Thanks

SHARE
Answered By 0 points N/A #109877

Parse HTML in Java using the best library

qa-featured

JTidy could be the one you are looking for.  Here's what you'll get using that library :

You can use it to clean up faulty HTML, plus, You'll be provided with the DOM interface to the processing document. Ultimately, it's a good choice to parse HTML in java.

Jsoup is also a good choice for HTML processing. It'll do the same as Jtidy except it'll use tag query. i.e. tag selector syntax.

In my view, i'd go for Jtidy as it is kind of less technical or more flexible to work with you might say.

Related Questions