Class TermsEnum
- java.lang.Object
-
- org.apache.lucene.index.TermsEnum
-
- All Implemented Interfaces:
BytesRefIterator
- Direct Known Subclasses:
FilterAtomicReader.FilterTermsEnum
,FilteredTermsEnum
,FuzzyTermsEnum
,MultiTermsEnum
public abstract class TermsEnum extends Object implements BytesRefIterator
Iterator to seek (seekCeil(BytesRef)
,seekExact(BytesRef)
) or step through (BytesRefIterator.next()
terms to obtain frequency information (docFreq()
),DocsEnum
orDocsAndPositionsEnum
for the current term (docs(org.apache.lucene.util.Bits, org.apache.lucene.index.DocsEnum)
.Term enumerations are always ordered by
BytesRefIterator.getComparator()
. Each term in the enumeration is greater than the one before it.The TermsEnum is unpositioned when you first obtain it and you must first successfully call
BytesRefIterator.next()
or one of theseek
methods.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static class
TermsEnum.SeekStatus
Represents returned result fromseekCeil(org.apache.lucene.util.BytesRef)
.
-
Constructor Summary
Constructors Modifier Constructor Description protected
TermsEnum()
Sole constructor.
-
Method Summary
All Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description AttributeSource
attributes()
Returns the related attributes.abstract int
docFreq()
Returns the number of documents containing the current term.DocsEnum
docs(Bits liveDocs, DocsEnum reuse)
GetDocsEnum
for the current term.abstract DocsEnum
docs(Bits liveDocs, DocsEnum reuse, int flags)
GetDocsEnum
for the current term, with control over whether freqs are required.DocsAndPositionsEnum
docsAndPositions(Bits liveDocs, DocsAndPositionsEnum reuse)
GetDocsAndPositionsEnum
for the current term.abstract DocsAndPositionsEnum
docsAndPositions(Bits liveDocs, DocsAndPositionsEnum reuse, int flags)
GetDocsAndPositionsEnum
for the current term, with control over whether offsets and payloads are required.abstract long
ord()
Returns ordinal position for current term.abstract TermsEnum.SeekStatus
seekCeil(BytesRef text)
Seeks to the specified term, if it exists, or to the next (ceiling) term.abstract void
seekExact(long ord)
Seeks to the specified term by ordinal (position) as previously returned byord()
.boolean
seekExact(BytesRef text)
Attempts to seek to the exact term, returning true if the term is found.void
seekExact(BytesRef term, TermState state)
Expert: Seeks a specific position byTermState
previously obtained fromtermState()
.abstract BytesRef
term()
Returns current term.TermState
termState()
Expert: Returns the TermsEnums internal state to position the TermsEnum without re-seeking the term dictionary.abstract long
totalTermFreq()
Returns the total number of occurrences of this term across all documents (the sum of the freq() for each doc that has this term).-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.lucene.util.BytesRefIterator
getComparator, next
-
-
-
-
Field Detail
-
EMPTY
public static final TermsEnum EMPTY
An empty TermsEnum for quickly returning an empty instance e.g. inMultiTermQuery
Please note: This enum should be unmodifiable, but it is currently possible to add Attributes to it. This should not be a problem, as the enum is always empty and the existence of unused Attributes does not matter.
-
-
Method Detail
-
attributes
public AttributeSource attributes()
Returns the related attributes.
-
seekExact
public boolean seekExact(BytesRef text) throws IOException
Attempts to seek to the exact term, returning true if the term is found. If this returns false, the enum is unpositioned. For some codecs, seekExact may be substantially faster thanseekCeil(org.apache.lucene.util.BytesRef)
.- Throws:
IOException
-
seekCeil
public abstract TermsEnum.SeekStatus seekCeil(BytesRef text) throws IOException
Seeks to the specified term, if it exists, or to the next (ceiling) term. Returns SeekStatus to indicate whether exact term was found, a different term was found, or EOF was hit. The target term may be before or after the current term. If this returns SeekStatus.END, the enum is unpositioned.- Throws:
IOException
-
seekExact
public abstract void seekExact(long ord) throws IOException
Seeks to the specified term by ordinal (position) as previously returned byord()
. The target ord may be before or after the current ord, and must be within bounds.- Throws:
IOException
-
seekExact
public void seekExact(BytesRef term, TermState state) throws IOException
Expert: Seeks a specific position byTermState
previously obtained fromtermState()
. Callers should maintain theTermState
to use this method. Low-level implementations may position the TermsEnum without re-seeking the term dictionary.Seeking by
TermState
should only be used iff the state was obtained from the sameTermsEnum
instance.NOTE: Using this method with an incompatible
TermState
might leave thisTermsEnum
in undefined state. On a segment levelTermState
instances are compatible only iff the source and the targetTermsEnum
operate on the same field. If operating on segment level, TermState instances must not be used across segments.NOTE: A seek by
TermState
might not restore theAttributeSource
's state.AttributeSource
states must be maintained separately if this method is used.- Parameters:
term
- the term the TermState corresponds tostate
- theTermState
- Throws:
IOException
-
term
public abstract BytesRef term() throws IOException
Returns current term. Do not call this when the enum is unpositioned.- Throws:
IOException
-
ord
public abstract long ord() throws IOException
Returns ordinal position for current term. This is an optional method (the codec may throwUnsupportedOperationException
). Do not call this when the enum is unpositioned.- Throws:
IOException
-
docFreq
public abstract int docFreq() throws IOException
Returns the number of documents containing the current term. Do not call this when the enum is unpositioned.TermsEnum.SeekStatus.END
.- Throws:
IOException
-
totalTermFreq
public abstract long totalTermFreq() throws IOException
Returns the total number of occurrences of this term across all documents (the sum of the freq() for each doc that has this term). This will be -1 if the codec doesn't support this measure. Note that, like other term measures, this measure does not take deleted documents into account.- Throws:
IOException
-
docs
public final DocsEnum docs(Bits liveDocs, DocsEnum reuse) throws IOException
GetDocsEnum
for the current term. Do not call this when the enum is unpositioned. This method will not return null.- Parameters:
liveDocs
- unset bits are documents that should not be returnedreuse
- pass a prior DocsEnum for possible reuse- Throws:
IOException
-
docs
public abstract DocsEnum docs(Bits liveDocs, DocsEnum reuse, int flags) throws IOException
GetDocsEnum
for the current term, with control over whether freqs are required. Do not call this when the enum is unpositioned. This method will not return null.- Parameters:
liveDocs
- unset bits are documents that should not be returnedreuse
- pass a prior DocsEnum for possible reuseflags
- specifies which optional per-document values you require; seeDocsEnum.FLAG_FREQS
- Throws:
IOException
- See Also:
docs(Bits, DocsEnum, int)
-
docsAndPositions
public final DocsAndPositionsEnum docsAndPositions(Bits liveDocs, DocsAndPositionsEnum reuse) throws IOException
GetDocsAndPositionsEnum
for the current term. Do not call this when the enum is unpositioned. This method will return null if positions were not indexed.- Parameters:
liveDocs
- unset bits are documents that should not be returnedreuse
- pass a prior DocsAndPositionsEnum for possible reuse- Throws:
IOException
- See Also:
docsAndPositions(Bits, DocsAndPositionsEnum, int)
-
docsAndPositions
public abstract DocsAndPositionsEnum docsAndPositions(Bits liveDocs, DocsAndPositionsEnum reuse, int flags) throws IOException
GetDocsAndPositionsEnum
for the current term, with control over whether offsets and payloads are required. Some codecs may be able to optimize their implementation when offsets and/or payloads are not required. Do not call this when the enum is unpositioned. This will return null if positions were not indexed.- Parameters:
liveDocs
- unset bits are documents that should not be returnedreuse
- pass a prior DocsAndPositionsEnum for possible reuseflags
- specifies which optional per-position values you require; seeDocsAndPositionsEnum.FLAG_OFFSETS
andDocsAndPositionsEnum.FLAG_PAYLOADS
.- Throws:
IOException
-
termState
public TermState termState() throws IOException
Expert: Returns the TermsEnums internal state to position the TermsEnum without re-seeking the term dictionary.NOTE: A seek by
TermState
might not capture theAttributeSource
's state. Callers must maintain theAttributeSource
states separately- Throws:
IOException
- See Also:
TermState
,seekExact(BytesRef, TermState)
-
-