How do you parse and process HTML/XML in PHP?-第2页回答

How can one parse HTML/XML and extract information from it?

标签： php xml parsing xml-parsing html-parsing

29条回答

2楼-- · 2018-12-31 00:35

JSON and array from XML in three lines:

$xml = simplexml_load_string($xml_string);
$json = json_encode($xml);
$array = json_decode($json,TRUE);

Ta da!

0人赞添加讨论(0) 举报

与君花间醉酒

3楼-- · 2018-12-31 00:37

For 1a and 2: I would vote for the new Symfony Componet class DOMCrawler ( DomCrawler ). This class allows queries similar to CSS Selectors. Take a look at this presentation for real-world examples: news-of-the-symfony2-world.

The component is designed to work standalone and can be used without Symfony.

The only drawback is that it will only work with PHP 5.3 or newer.

0人赞添加讨论(0) 举报

皆成旧梦

4楼-- · 2018-12-31 00:37

This is commonly referred to as screen scraping, by the way. The library I have used for this is Simple HTML Dom Parser.

0人赞添加讨论(0) 举报

浅入江南

5楼-- · 2018-12-31 00:37

You could try using something like HTML Tidy to cleanup any "broken" HTML and convert the HTML to XHTML, which you can then parse with a XML parser.

0人赞添加讨论(0) 举报

永恒的永恒

6楼-- · 2018-12-31 00:40

I recommend PHP Simple HTML DOM Parser.

It really has nice features, like:

foreach($html->find('img') as $element)
       echo $element->src . '<br>';

0人赞添加讨论(0) 举报

余生无你

7楼-- · 2018-12-31 00:40

Advanced Html Dom is a simple HTML DOM replacement that offers the same interface, but it's DOM-based which means none of the associated memory issues occur.

It also has full CSS support, including jQuery extensions.

0人赞添加讨论(0) 举报

How do you parse and process HTML/XML in PHP?

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间