\Zend_Search_Lucene_Document_Html
HTML document.
- Copyright: Copyright (c) 2005-2010 Zend Technologies USA Inc. (http://www.zend.com)
- License: http://framework.zend.com/license/new-bsd New BSD License
Synopsis
class Zend_Search_Lucene_Document_Html
extends Zend_Search_Lucene_Document
{
- // members
- private array $_links = ;
- private array $_headerLinks = ;
- private DOMDocument $_doc;
- private static boolean $_excludeNoFollowLinks = false;
- private array $_inlineTags = ;
- // methods
- private void __construct()
- public static void setExcludeNoFollowLinks()
- public static boolean getExcludeNoFollowLinks()
- private void _retrieveNodeText()
- public array getLinks()
- public array getHeaderLinks()
- public static Zend_Search_Lucene_Document_Html loadHTML()
- public static Zend_Search_Lucene_Document_Html loadHTMLFile()
- protected void _highlightTextNode()
- protected void _highlightNodeRecursive()
- public string applyColour()
- public string highlight()
- public string highlightExtended()
- public string getHTML()
- public string getHtmlBody()
- // Inherited methods from Zend_Search_Lucene_Document
- public string __get()
- public Zend_Search_Lucene_Document addField()
- public array getFieldNames()
- public Zend_Search_Lucene_Field getField()
- public string getFieldValue()
- public string getFieldUtf8Value()
Hierarchy
Extends
Tasks
Line | Task |
---|---|
133 | Add correction of wrong HTML encoding recognition processing |
Members
private
- $_doc
—
DOMDocument
Stored DOM representation - $_excludeNoFollowLinks
—
boolean
Exclud nofollow links flag - $_headerLinks
—
array
List of document header links - $_inlineTags
—
array
List of inline tags - $_links
—
array
List of document links
protected
- $_fields
—
array
Associative array Zend_Search_Lucene_Field objects where the keys to the array are the names of the fields.
public
- $boost
—
float
Field boost factor It's not stored directly in the index, but affects on normalization factor
Methods
private
- __construct() — Object constructor
- _retrieveNodeText() — Get node text
protected
- _highlightNodeRecursive() — highlight words in content of the specified node
- _highlightTextNode() — Highlight text in text node
public
- applyColour() — Standard callback method used to highlight words.
- getExcludeNoFollowLinks() — Get exclude nofollow links flag
- getHTML() — Get HTML
- getHeaderLinks() — Get document header links
- getHtmlBody() — Get HTML body
- getLinks() — Get document HREF links
- highlight() — Highlight text with specified color
- highlightExtended() — Highlight text using specified View helper or callback function.
- loadHTML() — Load HTML document from a string
- loadHTMLFile() — Load HTML document from a file
- setExcludeNoFollowLinks() — Set exclude nofollow links flag
Inherited from Zend_Search_Lucene_Document
public
- __get() — Proxy method for getFieldValue(), provides more convenient access to the string value of a field.
- addField() — Add a field object to this document.
- getField() — Returns Zend_Search_Lucene_Field object for a named field in this document.
- getFieldNames() — Return an array with the names of the fields in this document.
- getFieldUtf8Value() — Returns the string value of a named field in UTF-8 encoding.
- getFieldValue() — Returns the string value of a named field in this document.