Loading...

XML

Word

Printable

Details

Type: Improvement
Resolution: Fixed
Priority: Major
Fix Version/s: 6.0-milestone-1, 5.4.3
Affects Version/s: 5.4.2
Component/s: Search - Solr
Labels:
- performance

Development Priority:
High
Difficulty:
Medium
Documentation:
N/A
Documentation in Release Notes:
N/A
Similar issues:

Description

This job is triggered at XWiki startup and does two things:

removes from the Solr index the entries that correspond to XWiki documents that don't exist anymore in the database
indexes the XWiki documents that are not already indexed (or for which a newer version exists in the database)

The goal is to synchronize the Solr index with the XWiki database.

This synchronization is costly when the Solr index or the XWiki database is big because:

for each document entry in the Solr index we need to check if a corresponding XWiki document exists in the database
for each document in the XWiki database we need to check if the Solr index has a corresponding entry (for the current version of the XWiki document).

Basically we need to iterate over the entire Solr Index and XWiki database at startup. Even if this job runs in a daemon thread with minimum priority it still pushes the CPU to 100% for a while, depending on the size of the Solr index and XWiki database. Moreover, queries on both sides are not done with pagination. In other words, currently a considerable part of the Solr Index and XWiki database is loaded into memory.

We can improve this by:

Paginate the queries and leave the CPU after each batch
Stream the Solr query results
Improve the Solr queries by disabling faceting and highlighting

Attachments

Activity

People

Assignee:: Marius Dumitru Florea

Reporter:: Marius Dumitru Florea

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 13/Mar/14 17:13

Updated:: 13/Mar/14 19:00

Resolved:: 13/Mar/14 17:23

Date of First Response:: 13/Mar/14 5:46 PM