Classes

\

Name Description
Exception No description available EMPTY

\SilverStripe\TextExtraction\Cache\FileTextCache

Name Description
Cache Uses SS_Cache with a lifetime to cache extracted content EMPTY
Database Caches the extracted content on the record for the file. EMPTY

\SilverStripe\TextExtraction\Extension

Name Description
FileTextExtractable Decorate File or a File derivative to enable text extraction from the file content. Uses a set of subclasses of FileTextExtractor to do the extraction based on the content type of the file. EMPTY

\SilverStripe\TextExtraction\Extractor

Name Description
FileTextExtractor A decorator for File or a subclass that provides a method for extracting full-text from the file's external contents. EMPTY
HTMLTextExtractor Text extractor that uses php function strip_tags to get just the text. OK for indexing, not the best for readable text. EMPTY
PDFTextExtractor Text extractor that calls pdftotext to do the conversion. EMPTY
SolrCellTextExtractor Text extractor that calls an Apache Solr instance and extracts content via the "ExtractingRequestHandler" endpoint. EMPTY
TikaServerTextExtractor Enables text extraction of file content via the Tika Rest Server EMPTY
TikaTextExtractor Enables text extraction of file content via the Tika CLI EMPTY

\SilverStripe\TextExtraction\Extractor\FileTextExtractor

Name Description
Exception No description available EMPTY

\SilverStripe\TextExtraction\Rest

Name Description
TikaRestClient No description available EMPTY

\SilverStripe\TextExtraction\Tests

Name Description
FileTextCacheDatabaseTest No description available EMPTY
FileTextExtractableTest No description available EMPTY
HTMLTextExtractorTest No description available EMPTY
PDFTextExtractorTest No description available EMPTY
TikaServerTextExtractorTest No description available EMPTY
TikaTextExtractorTest Tests the {@see TikaTextExtractor} class EMPTY