\Zend_Search_Lucene_Analysis_TokenFilter_StopWords
Token filter that removes stop words. These words must be provided as array (set), example: $stopwords = array('the' => 1, 'an' => '1');
We do recommend to provide all words in lowercase and concatenate this class after the lowercase filter.
- Copyright: Copyright (c) 2005-2010 Zend Technologies USA Inc. (http://www.zend.com)
- License: http://framework.zend.com/license/new-bsd New BSD License
Synopsis
class Zend_Search_Lucene_Analysis_TokenFilter_StopWords
extends Zend_Search_Lucene_Analysis_TokenFilter
{
- // members
- private $_stopSet;
- // methods
- public void __construct()
- public Zend_Search_Lucene_Analysis_Token normalize()
- public void loadFromFile()
- // Inherited methods from Zend_Search_Lucene_Analysis_TokenFilter
- public abstract Zend_Search_Lucene_Analysis_Token normalize()
Hierarchy
Members
private
- $_stopSet
—
array
Stop Words
Methods
public
- __construct() — Constructs new instance of this filter.
- loadFromFile() — Fills stopwords set from a text file. Each line contains one stopword, lines with '#' in the first column are ignored (as comments).
- normalize() — Normalize Token or remove it (if null is returned)
Inherited from Zend_Search_Lucene_Analysis_TokenFilter
public
- normalize() — Normalize Token or remove it (if null is returned)