Ticket #556 (closed clarification: fixed)

Opened 5 years ago

Last modified 4 years ago

Some reference/citation volume "numbers" are not actually numbers

Reported by: ebrown Assigned to: susanne
Priority: high Milestone:
Component: topaz Version: 0.8
Keywords: Cc:
Blocking: Blocked By:

Description (Last modified by ebrown)

Multiple articles have non-integer volume "numbers" in their citations. For example:

  • Article 5: 86-A
  • Article 7: 296B & 302B

See http://dobby.topazproject.org:9090/fedora/get/doi:10.1371%2Fjournal.pone.0000007/XML ...

<ref id="pone.0000007-Hallgrmsson2">
  <label>35</label>
  <citation citation-type="journal" xlink:type="simple">
    <person-group person-group-type="author">
      <name name-style="western">
        <surname>Hallgrímsson</surname>
        <given-names>B</given-names>
      </name>
      <name name-style="western">
        <surname>Miyake</surname>
        <given-names>T</given-names>
      </name>
      <name name-style="western">
        <surname>Willmore</surname>
        <given-names>K</given-names>
      </name>
      <name name-style="western">
        <surname>Hall</surname>
        <given-names>BK</given-names>
      </name>
    </person-group>
    <year>2003</year>
    <article-title>
      Embryological origins of developmental stability: size, shape and fluctuating asymmetry in prenatal random bred mice.
    </article-title>
    <source>J Exp Zool</source>
    <volume>296B</volume>
    <fpage>40</fpage>
    <lpage>57</lpage>
  </citation>
</ref>

I think for now I'm going to log something and truncate to just the digits for now so I can move on and find any other problems.

The bibtex owl for hasVolume specifies this as a nonNegativeInteger.

There are 3 possibilities:

  • Change the definition of Citaiton.volume (bibtex:hasVolume) to be a string
  • Truncate the values to be just integers
  • Fix the data

Dependency Graph

Change History

08/16/07 15:01:41 changed by ebrown

  • description changed.

08/16/07 15:55:56 changed by amit

  • owner changed from amit to susanne.
  • priority changed from unassigned to high.

We are finding problems in references now that we are looking at extracting the information to store in mulgara. We might be able to fix some of them, but looks like the upstream process with regards to references elements needs tweaking.

08/16/07 16:22:48 changed by ronald

Ah yes, this is one of the mismatches between NLM and the Bibtex RDF. NLM allows even simple markup in the volume. Sounds like we need to create our own predicate (or at least take it from somewhere else - maybe prism's prism:volume would be better suited here).

08/21/07 10:02:08 changed by ebrown

This issue as well as two others is summarized at the bottom of MigrationScript.

09/02/07 02:26:41 changed by amit

  • status changed from new to closed.
  • resolution set to fixed.

Nasty fix in r3628 based on discussion with PLoS. The problem has been migrated to a different level, and is not a real fix. The real fix will have to be upstream with more stringent data quality checks.

07/16/08 11:00:34 changed by

  • milestone deleted.

Milestone 0.8 deleted