how to create URL extractor like facebook share

i need to extract data from url like title , description ,and any vedios images in the given url like facebook share button

like this : http://www.facebook.com/sharer.php?u=http://www.wired.com&t=Test

regards

标签： php python facebook

5条回答

在下西门庆

2楼-- · 2020-06-20 05:11

If the web site has support for oEmbed, that's easier and more robust than scraping HTML:

oEmbed is a format for allowing an embedded representation of a URL on third party sites. The simple API allows a website to display embedded content (such as photos or videos) when a user posts a link to that resource, without having to parse the resource directly.

oEmbed is supported by sites like YouTube and Flickr.

0人赞添加讨论(0) 举报

爷、活的狠高调

3楼-- · 2020-06-20 05:19

Use something like cURL to get the page and then something like Simple HTML DOM to parse it and extract the elements you want.

0人赞添加讨论(0) 举报

老娘就宠你

4楼-- · 2020-06-20 05:20

Embed.ly has a nice api for exactly this purpose. Their api returns the site's oEmbed data if available - otherwise, it attempts to extract a summary of the page like Facebook.

0人赞添加讨论(0) 举报

啃猪蹄的小仙女

5楼-- · 2020-06-20 05:21

While I was looking for a similar functionality, I came across a jQuery + PHP demo of the url extract feature of Facebook messages: http://www.99points.info/2010/07/facebook-like-extracting-url-data-with-jquery-ajax-php/

Instead of using an HTML DOM parser, it works with simple regular expressions. It looks for title, description and img tags. Hence, the image extraction doesn't perform well with a lot of websites, which use CSS for images. Also, Facebook looks first at its own meta tags and then at the classic description tag of HTML but it illustrates well the principe.

0人赞添加讨论(0) 举报

Lonely孤独者°

6楼-- · 2020-06-20 05:24

I am working on a project for this issue, it is not as easy as writing an html parser and expecting sites to be 'semantical'. Especially extracting videos and finding auto-play parameters are killing. You can check the project in http://www.embedify.me, which has also fb-style url preview script. As I see, embed.ly and oembed are passive parser, they need the sites to support them, so called providers, the approach is quite different than fb does.

0人赞添加讨论(0) 举报

how to create URL extractor like facebook share

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间