How to retrieve/calculate citation counts and/or c

2020-05-31 03:55发布

问题:

I have a list of authors. I wish to automatically retrieve/calculate the (ideally yearly) citation index (h-index, m-quotient,g-index, HCP indicator or ...) for each author.

Author Year Index
first  2000   1
first  2001   2
first  2002   3

I can calculate all of these metrics given the citation counts for each paper of each researcher.

Author Paper Year Citation_count
first    1    2000   1
first    2    2000   2
first    3    2002   3

Despite my efforts, I have not found an API/scraping method capable of this.

My institution has access to a number of services including Web of Science.

回答1:

Effectively the main problem is to build the citation graph. Once you have that you can compute any metrics you want (e.g. h-index, g-index, PageRank).

Supposing you have a collections of papers (that you've retrieved in some way) you can extract the citations from each of them and build the citation graph. You might find useful ParsCit, an open-source CRF Reference String and Logical Document Structure Parsing Package which is also used by CiteSeerX and works great.