jTidy and TagSoup documentation

2019-04-28 12:04发布

站内文章 / Java

88 0

祖国的老花朵

女 | 书童

私信

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

I'm looking for documentation (officially documentation if it is possible) for TagSoup and jTidy libraries.

I want use this libraries to manipulate html "tagsoup" files that include xml tags with different namespaces mixed between html (html, xhtml or html5) tags.

I have tested HTMLCleaner, NekoHTML and Jericho, but i don't find documentation for jTidy and TagSoup, apart from simplest examples to clear a file.

I need documentation about manipulate contents, replace tags, extract info, etc...

Thanks

Note: After test all options, I used StAX / Woodstox :

http://wiki.fasterxml.com/WoodstoxHome
https://en.wikipedia.org/wiki/StAX
https://docs.oracle.com/javase/tutorial/jaxp/stax/using.html

回答1:

The answer to a similar question on the tagsoup-friends google group may help:

Documentation for TagSoup

You've probably already seen them, but the javadoc for JTidy is available here: http://jtidy.sourceforge.net/apidocs/index.html

标签： java jtidy tag-soup jericho-html-parser

祖国的老花朵

女 | 书童

私信

收藏的人(0)

Ta的文章更多文章

0条评论

还没有人评论过~

jTidy and TagSoup documentation

问题:

回答1:

收藏的人(0)

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮