Uses of Interface

Packages that use TextExtractor

Uses of TextExtractor in org.apache.jackrabbit.core.query.lucene

Classes in org.apache.jackrabbit.core.query.lucene that implement TextExtractor
 class JackrabbitTextExtractor
          Backwards-compatible Jackrabbit text extractor component.
 class PooledTextExtractor
          PooledTextExtractor implements a text extractor that extracts the text using a pool of background threads.
 class TextFilterExtractor
          Adapter class for achieving backwards compatibility with classes implementing the deprectated TextFilter interface.

Fields in org.apache.jackrabbit.core.query.lucene declared as TextExtractor
protected  TextExtractor NodeIndexer.extractor
          Content extractor.

Methods in org.apache.jackrabbit.core.query.lucene that return TextExtractor
protected  TextExtractor SearchIndex.createTextExtractor()
          Factory method to create the TextExtractor instance.
 TextExtractor SearchIndex.getTextExtractor()
          Returns the text extractor in use for indexing.

Constructors in org.apache.jackrabbit.core.query.lucene with parameters of type TextExtractor
NodeIndexer(NodeState node, ItemStateManager stateProvider, NamespaceMappings mappings, TextExtractor extractor)
          Creates a new node indexer.
PooledTextExtractor(TextExtractor extractor, int poolSize, int backLog, long timeout)
          Returns a pooled text extractor based on extractor.
TextExtractorFilter(TextExtractor extractor)
          Creates a text filter adapter for the given text extractor.
TextExtractorJob(TextExtractor extractor, InputStream stream, String type, String encoding)
          Creates a new TextExtractorJob with the given extractor on the stream.

Uses of TextExtractor in org.apache.jackrabbit.extractor

Subinterfaces of TextExtractor in org.apache.jackrabbit.extractor
 interface DelegatingTextExtractor
          Interface for text extractors that need to delegate the extraction of parts of content documents to another text extractor.

Classes in org.apache.jackrabbit.extractor that implement TextExtractor
 class AbstractTextExtractor
          Base class for text extractor implementations.
 class CompositeTextExtractor
          Composite text extractor.
 class DefaultTextExtractor
          Composite text extractor that by default contains the standard text extractors found in this package.
 class EmptyTextExtractor
          Dummy text extractor that always returns and empty reader for all documents.
 class HTMLTextExtractor
          Text extractor for HyperText Markup Language (HTML).
 class MsExcelTextExtractor
          Text extractor for Microsoft Excel sheets.
 class MsOutlookTextExtractor
          Text extractor for Microsoft Outlook messages.
 class MsPowerPointTextExtractor
          Text extractor for Microsoft PowerPoint presentations.
 class MsWordTextExtractor
          Text extractor for Microsoft Word documents.
 class OpenOfficeTextExtractor
          Text extractor for OpenOffice documents.
 class PdfTextExtractor
          Text extractor for Portable Document Format (PDF).
 class PlainTextExtractor
          Text extractor for plain text.
 class PngTextExtractor
          Text extractor for png/apng/mng images.
 class RTFTextExtractor
          Text extractor for Rich Text Format (RTF)
 class XMLTextExtractor
          Text extractor for XML documents.

Methods in org.apache.jackrabbit.extractor with parameters of type TextExtractor
 void CompositeTextExtractor.addTextExtractor(TextExtractor extractor)
          Adds a component text extractor.
 void DelegatingTextExtractor.setDelegateTextExtractor(TextExtractor extractor)
          Sets the text textractor to which this extractor should delegate any partial text extraction tasks.

Copyright © 2004-2008 The Apache Software Foundation. All Rights Reserved.