SilverStripe\TextExtraction\Extractor\SolrCellTextExtractor
Text extractor that calls an Apache Solr instance and extracts content via the "ExtractingRequestHandler" endpoint.
Does not alter the Solr index itself, but uses it purely
for its file parsing abilities.
- Author: ischommer
Synopsis
class SolrCellTextExtractor
extends FileTextExtractor
{
- // members
- private static $base_url;
- private static integer $priority = 75;
- protected Client $httpClient;
- // Inherited members from FileTextExtractor
- protected static $sorted_extractor_classes;
- // methods
- public Client getHttpClient()
- public $this setHttpClient()
- public string isAvailable()
- public bool supportsExtension()
- public bool supportsMime()
- public string getContent()
- // Inherited methods from FileTextExtractor
- protected static array get_extractor_classes()
- protected static FileTextExtractor get_extractor()
- public static FileTextExtractor|null for_file()
- protected static string getPathFromFile()
- public abstract boolean isAvailable()
- public abstract boolean supportsExtension()
- public abstract boolean supportsMime()
- public abstract string getContent()
Hierarchy
Members
private
- $base_url
—
string
Base URL to use for Solr text extraction. - $priority — int
protected
- $httpClient — GuzzleHttp\Client
- $sorted_extractor_classes
—
array
Cache of extractor class names, sorted by priority
Methods
public
Inherited from SilverStripe\TextExtraction\Extractor\FileTextExtractor
protected
- getPathFromFile() — Some text extractors (like pdftotext) may require a physical file to read from, so write the current file contents to a temp file and return its path
- get_extractor() — Get the text file extractor for the given class
- get_extractor_classes() — Gets the list of prioritised extractor classes
public
- for_file() — Given a File object, decide which extractor instance to use to handle it
- getContent() — Given a File instance, extract the contents as text.
- isAvailable() — Checks if the extractor is supported on the current environment, for example if the correct binaries or libraries are available.
- supportsExtension() — Determine if this extractor supports the given extension.
- supportsMime() — Determine if this extractor supports the given mime type.