Ticket #1259 (new defect)

Opened 3 years ago

Last modified 3 years ago

Google Scholar link returning incorrect result set

Reported by: npeterson Assigned to: josowski
Priority: unassigned Milestone: 0.9.3_rc2
Component: ambra Version: 0.9.3
Keywords: Cc:
Blocking: Blocked By:

Description

When I select the GS links (either the related articles or look for citations) for two of the articles published on Feb 2nd, the title of the article is not indicated - instead GS says:

Sorry, we didn't find any articles related to PLoS: 0 notes 0 comments.

see also enclosed screenshot.

http://www.plosone.org/article/related/info%3Adoi%2F10.1371%2Fjournal.pone.0004310 http://www.plosone.org/article/related/info%3Adoi%2F10.1371%2Fjournal.pone.0004298

I looked at other articles in Feb - eg on Feb 3rd, and the links seemed to be working properly. Is there something wrong just with the articles published on Feb 2nd?

Dependency Graph

Change History

04/06/09 16:03:46 changed by npeterson

eg this article

http://plosmedicine-demo.plos.org/article/info%3Adoi%2F10.1371%2Fjournal.pmed.0020124

has no citations on GS when you click the link but I think that's because it is searching with the wrong search string ie it searches with

http://dx.plos.org/10.1371/journal.pmed.0020124.

if you search with just the doi

doi:10.1371/journal.pmed.0020124 you get 353 citations

Andy thinks this is also causing a problem on the searching of google blogs

04/06/09 16:14:39 changed by npeterson

Not sure if it's been already reported or considered, but I'm not convinced that the Google Scholar functionality is working as well as it could.

For example, for http://plosgenetics-demo.plos.org/article/related/info%3Adoi%2F10.1371%2Fjournal.pgen.0030050 [] (http://scholar.google.com/scholar?hl=en&lr=&cites=http%3A%2F%2Fdx.plos.org%2F10.1371%2Fjournal.pgen.0030050 [] ), the first result (based on DOI in the search term?) is a Genetics Soc America article.

However, if you pull in author and title within the Scholar search term, like Bio and Medicine, you seem to get the correct article at the top more often: http://medicine.plosjournals.org/perlserv/?request=get-document&doi=10.1371/journal.pmed.0040125 [] (http://scholar.google.com/scholar?q=author:M+author:Nomura+Polymorphisms,+Mutations,+and+Amplification+of+the+EGFR+Gene+in+Non-Small+Cell+Lung+Cancers [] )

A similar search mechanism (author + title, not doi) applied to the first article above only pulls up the relevant article (http://scholar.google.com/scholar?q=author:M+author:Johnson-Schlitz+Multiple-Pathway+Analysis+of+Double-Strand+Break+Repair+Mutations+in+Drosophila [] )

Is this something to consider changing (to bring up what appear to be more accurate results)?

04/06/09 16:22:44 changed by npeterson

  • version changed from 0.9.1_rc2-SNAPSHOT to 0.9.2.

04/10/09 14:56:32 changed by josowski

We might switch to title/author:

http://scholar.google.com/scholar?hl=en&lr=&q=%22Modulation+of+the+%CE%B2-Catenin+Signaling+Pathway+by+the+Dishevelled-Associated+Protein+Hipk1%22+author%3A%22Sarah+H.+Louie%22&btnG=Search

BUT we would have no direct link to citations or related articles at this point. That is, the user would have to go to scholar by the above link and then click on citations or related. But this won't fix the issue with newly published articles.

The root of the issue here is the Google search index is out of date. To my knowledge there is no way to force google to crawl our sites more frequently, we just have to wait it out.

04/10/09 15:35:16 changed by josowski

  • owner changed from dragisak to josowski.

05/06/09 09:23:16 changed by npeterson

  • version changed from 0.9.2 to 0.9.3.
  • milestone changed from 0.9.3_rc1 to 0.9.3_rc2.