\TxtDocumentExtractor
Extracts text from a TXT text document.
- Author: Darren Inwood <darren.inwood@chrometoaster.com>
Synopsis
class TxtDocumentExtractor
extends ZendSearchLuceneTextExtractor
{
- // members
- public static array $extensions = ;
- // Inherited members from ZendSearchLuceneTextExtractor
- public static array $extensions;
- public static integer $priority;
- // methods
- public static String extract()
- // Inherited methods from ZendSearchLuceneTextExtractor
- public abstract static String extract()
Hierarchy
Extends
Members
public
-
$extensions
The extensions that can be handled by this text extractor. -
$extensions
An array of strings representing file extensions that can be handled by this TextExtractor. Do not include a dot in your extensions. Extensions should be in lower case, and will detect all case variations on scanned files. -
$priority
Controls the order in which text extractor classes are tried for a specific file extension. Default is 100. To make your custom extractor run before an inbuilt one, set this to less than 100, or to make it run afterwards set it to more than 100.
Methods
public
- extract() — Returns a string containing the text in the given TXT document.
Inherited from ZendSearchLuceneTextExtractor
public
- extract() — Returns text for a given full filesystem path. If a file cannot be processed, you should return an empty string.