You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One explanation for this discrepancy would be the histogram captures when citations occur and not the citations to the papers published in the year. For example, if a paper is published in the year 2010 and receives a citation in the year 2016, in the histogram, this citation is added to the year 2016.
As for the crawling issue, I have resolved it in a python scraper. I will link to it in a subsequent comment.
Noted by @dragomirradev
The "year" column is based on the earliest year in the citation count histogram, which in fact is not the earliest year in terms of publications.
For example:
data:image/s3,"s3://crabby-images/ccf99/ccf9907e479e49d07ab520d356ed01150eff8948" alt="Screen Shot 2019-08-24 at 10 26 34 AM"
But see:
One reasonable hypothesis is that the histogram is capped at 20 years... but here's a counterexample:
No idea what's going on.
From a crawling perspective, the histogram is easy to get. Getting actual earliest requires sort pubs by time and then "scrolling".
The text was updated successfully, but these errors were encountered: