Class MergePolicy
- All Implemented Interfaces:
Closeable,AutoCloseable,Cloneable
- Direct Known Subclasses:
LogMergePolicy,NoMergePolicy,SortingMergePolicy,TieredMergePolicy,UpgradeIndexMergePolicy
Expert: a MergePolicy determines the sequence of primitive merge operations.
Whenever the segments in an index have been altered by
IndexWriter, either the addition of a newly
flushed segment, addition of many segments from
addIndexes* calls, or a previous merge that may now need
to cascade, IndexWriter invokes findMerges(org.apache.lucene.index.MergePolicy.MergeTrigger, org.apache.lucene.index.SegmentInfos) to give the MergePolicy a chance to pick
merges that are now required. This method returns a
MergePolicy.MergeSpecification instance describing the set of
merges that should be done, or null if no merges are
necessary. When IndexWriter.forceMerge is called, it calls
findForcedMerges(SegmentInfos,int,Map) and the MergePolicy should
then return the necessary merges.
Note that the policy can return more than one merge at
a time. In this case, if the writer is using SerialMergeScheduler, the merges will be run
sequentially but if it is using ConcurrentMergeScheduler they will be run concurrently.
The default MergePolicy is TieredMergePolicy.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classA map of doc IDs.static classThrown when a merge was explicity aborted becauseIndexWriter.close(boolean)was called withfalse.static classException thrown if there are any problems while executing a merge.static classA MergeSpecification instance provides the information necessary to perform multiple merges.static enumMergeTrigger is passed tofindMerges(MergeTrigger, SegmentInfos)to indicate the event that triggered the merge.static classOneMerge provides the information necessary to perform an individual primitive merge operation, resulting in a single new segment. -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionclone()abstract voidclose()Release all resources for the policy.abstract MergePolicy.MergeSpecificationfindForcedDeletesMerges(SegmentInfos segmentInfos) Determine what set of merge operations is necessary in order to expunge all deletes from the index.abstract MergePolicy.MergeSpecificationfindForcedMerges(SegmentInfos segmentInfos, int maxSegmentCount, Map<SegmentCommitInfo, Boolean> segmentsToMerge) Determine what set of merge operations is necessary in order to merge to invalid input: '<'= the specified segment count.abstract MergePolicy.MergeSpecificationfindMerges(MergePolicy.MergeTrigger mergeTrigger, SegmentInfos segmentInfos) Determine what set of merge operations are now necessary on the index.final doubleReturns the largest size allowed for a compound file segmentfinal doubleReturns currentnoCFSRatio.voidsetIndexWriter(IndexWriter writer) Sets theIndexWriterto use by this merge policy.final voidsetMaxCFSSegmentSizeMB(double v) If a merged segment will be more than this value, leave the segment as non-compound file even if compound file is enabled.final voidsetNoCFSRatio(double noCFSRatio) If a merged segment will be more than this percentage of the total size of the index, leave the segment as non-compound file even if compound file is enabled.booleanuseCompoundFile(SegmentInfos infos, SegmentCommitInfo mergedInfo) Returns true if a new segment (regardless of its origin) should use the compound file format.
-
Constructor Details
-
MergePolicy
public MergePolicy()Creates a new merge policy instance. Note that if you intend to use it without passing it toIndexWriter, you should callsetIndexWriter(IndexWriter).
-
-
Method Details
-
clone
-
setIndexWriter
Sets theIndexWriterto use by this merge policy. This method is allowed to be called only once, and is usually set by IndexWriter. If it is called more than once,SetOnce.AlreadySetExceptionis thrown.- See Also:
-
findMerges
public abstract MergePolicy.MergeSpecification findMerges(MergePolicy.MergeTrigger mergeTrigger, SegmentInfos segmentInfos) throws IOException Determine what set of merge operations are now necessary on the index.IndexWritercalls this whenever there is a change to the segments. This call is always synchronized on theIndexWriterinstance so only one thread at a time will call this method.- Parameters:
mergeTrigger- the event that triggered the mergesegmentInfos- the total set of segments in the index- Throws:
IOException
-
findForcedMerges
public abstract MergePolicy.MergeSpecification findForcedMerges(SegmentInfos segmentInfos, int maxSegmentCount, Map<SegmentCommitInfo, Boolean> segmentsToMerge) throws IOExceptionDetermine what set of merge operations is necessary in order to merge to invalid input: '<'= the specified segment count.IndexWritercalls this when itsIndexWriter.forceMerge(int)method is called. This call is always synchronized on theIndexWriterinstance so only one thread at a time will call this method.- Parameters:
segmentInfos- the total set of segments in the indexmaxSegmentCount- requested maximum number of segments in the index (currently this is always 1)segmentsToMerge- contains the specific SegmentInfo instances that must be merged away. This may be a subset of all SegmentInfos. If the value is True for a given SegmentInfo, that means this segment was an original segment present in the to-be-merged index; else, it was a segment produced by a cascaded merge.- Throws:
IOException
-
findForcedDeletesMerges
public abstract MergePolicy.MergeSpecification findForcedDeletesMerges(SegmentInfos segmentInfos) throws IOException Determine what set of merge operations is necessary in order to expunge all deletes from the index.- Parameters:
segmentInfos- the total set of segments in the index- Throws:
IOException
-
close
public abstract void close()Release all resources for the policy.- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceCloseable
-
useCompoundFile
Returns true if a new segment (regardless of its origin) should use the compound file format. The default implementation returnstrueiff the size of the given mergedInfo is less or equal togetMaxCFSSegmentSizeMB()and the size is less or equal to the TotalIndexSize *getNoCFSRatio()otherwisefalse.- Throws:
IOException
-
getNoCFSRatio
public final double getNoCFSRatio()Returns currentnoCFSRatio.- See Also:
-
setNoCFSRatio
public final void setNoCFSRatio(double noCFSRatio) If a merged segment will be more than this percentage of the total size of the index, leave the segment as non-compound file even if compound file is enabled. Set to 1.0 to always use CFS regardless of merge size. -
getMaxCFSSegmentSizeMB
public final double getMaxCFSSegmentSizeMB()Returns the largest size allowed for a compound file segment -
setMaxCFSSegmentSizeMB
public final void setMaxCFSSegmentSizeMB(double v) If a merged segment will be more than this value, leave the segment as non-compound file even if compound file is enabled. Set this to Double.POSITIVE_INFINITY (default) and noCFSRatio to 1.0 to always use CFS regardless of merge size.
-