\ZendSearchLuceneTextExtractor
Abstract base class instructing the Lucene module on how to extract text from a given file type.
- Author: Darren Inwood <darren.inwood@chrometoaster.com>
Synopsis
class ZendSearchLuceneTextExtractor
{
- // members
- public static array $extensions = ;
- public static integer $priority = 100;
- // methods
- public abstract static String extract()
Members
public
-
$extensions
An array of strings representing file extensions that can be handled by this TextExtractor. Do not include a dot in your extensions. Extensions should be in lower case, and will detect all case variations on scanned files. -
$priority
Controls the order in which text extractor classes are tried for a specific file extension. Default is 100. To make your custom extractor run before an inbuilt one, set this to less than 100, or to make it run afterwards set it to more than 100.
Methods
public
- extract() — Returns text for a given full filesystem path. If a file cannot be processed, you should return an empty string.