R readHTMLTable failed to load external entity [du

2019-02-15 18:07发布

问题:

This question already has an answer here:

  • R Error using readHTMLTable 2 answers

When I run the line on my laptop,

table500 <- readHTMLTable('http://en.wikipedia.org/wiki/List_of_S%26P_500_companies')[[1]]

it gets the data. When I run it on my desktop, I receive the error

Error: failed to load external entity "http://en.wikipedia.org/wiki/List_of_S%26P_500_companies".

So I'm guessing the problem has something to do with network settings in my desktop, I haven't the slightest idea what it could be though. Any suggestions?

回答1:

In the link that I mentioned in the comment, you can find solutions using RCurl and httr package. Here, I provide the solution using rvest package.

   library(rvest)
    kk<-html("http://en.wikipedia.org/wiki/List_of_S%26P_500_companies")%>%
    html_table(fill=TRUE)%>%
    .[[1]] //table 1 only

head(kk)
  Ticker symbol            Security SEC filings            GICS Sector                GICS Sub Industry Address of Headquarters
1           MMM          3M Company     reports            Industrials         Industrial Conglomerates     St. Paul, Minnesota
2           ABT Abbott Laboratories     reports            Health Care Health Care Equipment & Services North Chicago, Illinois
3          ABBV              AbbVie     reports            Health Care                  Pharmaceuticals North Chicago, Illinois
4           ACN       Accenture plc     reports Information Technology   IT Consulting & Other Services         Dublin, Ireland
5           ACE         ACE Limited     reports             Financials    Property & Casualty Insurance     Zurich, Switzerland
6           ACT         Actavis plc     reports            Health Care                  Pharmaceuticals         Dublin, Ireland
  Date first added     CIK
1                    66740
2                     1800
3       2012-12-31 1551152
4       2011-07-06 1467373
5       2010-07-15  896159
6                   884629


标签: xml r connection