SilverStripe\SearchService\Service\PageCrawler
Fetches the main content off the page to index. This handles more complex templates. Main content should be low-weighted as depending on your front-end the <main> element may contain other information which should not be indexed.
Synopsis
class PageCrawler
{
- // members
- private $item;
- private static string $content_xpath_selector = '//main';
- // methods
- public string getMainContent()
Hierarchy
Uses
- SilverStripe\Core\Config\Configurable
Tasks
Line | Task |
---|---|
22+ | allow filtering |
Members
private
- $content_xpath_selector
—
string
Defines the xpath selector for the first element of content that should be indexed. - $item