Simple HTML Dom: How to remove elements?

2019-01-05 03:17发布

I would like to use Simple HTML DOM to remove all images in an article so I can easily create a small snippet of text for a news ticker but I haven't figured out how to remove elements with it.

Basically I would do

  1. Get content as HTML string
  2. Remove all image tags from content
  3. Limit content to x words
  4. Output.

Any help?

6条回答
看我几分像从前
2楼-- · 2019-01-05 03:52

This is working for me:

foreach($html->find('element') as $element){
   $element = NULL;
}
查看更多
成全新的幸福
3楼-- · 2019-01-05 03:56

The supposed solutions are quite expensive and practically unusable in a big loop or other kind of repetition.

I prefer to use "soft deletes":

foreach($html->find('somecondition'),$item){
    if (somecheck) $item->setAttribute('softDelete', true); //<= set marker to check in further code
    $item->outertext='';


   foreach($foo as $bar){
       if(!baz->getAttribute('softDelete'){
           //do something 
        }
    }

}
查看更多
聊天终结者
4楼-- · 2019-01-05 03:58

There is no dedicated methods for removing elements. You just find all the img elements and then do

$e->outertext = '';
查看更多
够拽才男人
5楼-- · 2019-01-05 03:58

when you only delete the outer text you delete the HTML content itself, but if you perform another find on the same elements it will appear in the result. the reason is that the simple HTML DOM object still has it's internal structure of the element, only without its actual content. what you need to do in order to really delete the element is simply reload the HTML as string to the same variable. this way the object will be recreated without the deleted content, and the simple HTML DOM object will be built without it.

here is an example function:

public function removeNode($selector)
{
    foreach ($this->find($selector) as $node)
    {
        $node->outertext = '';
    }

    $this->load($this->save());        
}

put this function inside the simple_html_dom class and you're good.

查看更多
乱世女痞
6楼-- · 2019-01-05 04:14

I could not figure out where to put the function so I just put the following directly in my code:

$html->load($html->save());

It basically locks changes made in the for loop back into the html per above.

查看更多
Melony?
7楼-- · 2019-01-05 04:15

I think you have some difficulties because you forgot to save(dump the internal DOM tree back into string).

Try this:

$html = file_get_html("http://example.com");

foreach($html ->find('img') as $item) {
    $item->outertext = '';
    }

$html->save();

echo $html;
查看更多
登录 后发表回答