Ticket #212 (closed defect: fixed)

Opened 2 years ago

Last modified 1 year ago

Search indexing is loosing whitespace between words sometimes

Reported by: ebrown Assigned to: ebrown
Priority: unassigned Milestone:
Component: topaz Version: 0.5-SNAPSHOT
Keywords: search Cc:
Blocking: Blocked By:

Description

See bottom of source:head/topazproject/webservices/search/topaz-lucene-impl/src/main/resources/org/topazproject/fedoragsearch/topazlucene/topazFoxmlToLucene.xslt. This thing brings in all the text of the body of the article sans xml. But if two words are separated by markup, when the markup is stripped, the words are frequently placed next to each other with no whitespace between. This will result in improper indexing and inaccurate search results.

Dependency Graph

Change History

11/29/06 19:03:21 changed by ebrown

  • status changed from new to closed.
  • resolution set to fixed.

Should be fixed by [1469]. Thanks Ronald!

08/07/07 16:25:51 changed by

  • milestone deleted.

Milestone Bugs deleted