Details
-
Bug
-
Resolution: Fixed
-
Major
-
4.0-rc-1, 3.5.1
-
None
-
Windows XP, JRE 1.6, reproduced with the install version (hsql) and war version (PostgreSQL 9.1)
-
word, doc, docx, attachment, index, lucene
-
Unknown
-
N/A
-
N/A
-
Description
When a Word is attached to a page, it's content is indexed properly by Lucene. However, after attaching a new version of the same document, only the original version is found. In the sample there are 2 document (both in doc and docx format). The first contains the text "1212", the second contains "3434".
To reproduce the bug, attach test.doc from directory 1212. Search will find the document. Attach test.doc from directory 3434 to the same page. Search will find only 1212, but not 3434.
If I delete the attachment from the page, and reattach test.doc from 3434, then it works correctly: will not found 1212, but will found 3434. (however, versions are lost this way)