Is there a web crawler library available for PHP o - 码农岛

Is there a web crawler library available for PHP o

2019-04-14 00:58发布

站内问答 / PHP

820 5

做自己的国王

女 | 书童

私信

Is there a web crawler library available for PHP or Ruby? a library that can do it depth first or breadth first... and handle the links even when href="../relative_path.html" and base url is used.

标签： php ruby web-crawler

5条回答

2楼-- · 2019-04-14 01:22

If you need to scrape web pages that use javascript you can use Capybara with a driver which will spin up a real browser, such as poltergeist. Its usually used with a testing framework for acceptance testing, but can also be used outside a testing framework.

查看更多

0人赞添加讨论(0) 举报

3楼-- · 2019-04-14 01:27

If you'd like to learn basic web crawler & search things, you can start look at "luna engine".

查看更多

0人赞添加讨论(0) 举报

叼着烟拽天下

4楼-- · 2019-04-14 01:31

http://phpcrawl.cuab.de/

查看更多

0人赞添加讨论(0) 举报

5楼-- · 2019-04-14 01:33

Check this page out for a Ruby library: Ruby Mechanize

I'd like to mention that you would still be responsible for the way in which your crawler traverses sites.

查看更多

0人赞添加讨论(0) 举报

干净又极端

6楼-- · 2019-04-14 01:44

you can go for webrat or watir in ruby, much easier than mechanize

查看更多

0人赞添加讨论(0) 举报

相关问题

相关文章

收藏的人(6)