\Zend_Search_Lucene_Analysis_TokenFilter_StopWords

Token filter that removes stop words. These words must be provided as array (set), example: $stopwords = array('the' => 1, 'an' => '1');

We do recommend to provide all words in lowercase and concatenate this class after the lowercase filter.

Copyright: Copyright (c) 2005-2010 Zend Technologies USA Inc. (http://www.zend.com)
License: http://framework.zend.com/license/new-bsd New BSD License

Synopsis

class Zend_Search_Lucene_Analysis_TokenFilter_StopWords extends Zend_Search_Lucene_Analysis_TokenFilter {

// members
private $_stopSet;

// methods
public void __construct()
public Zend_Search_Lucene_Analysis_Token normalize()
public void loadFromFile()

// Inherited methods from Zend_Search_Lucene_Analysis_TokenFilter
public abstract Zend_Search_Lucene_Analysis_Token normalize()

}

Hierarchy

Extends

Zend_Search_Lucene_Analysis_TokenFilter

Members

private

$_stopSet — array
Stop Words

Methods

public

__construct() — Constructs new instance of this filter.
loadFromFile() — Fills stopwords set from a text file. Each line contains one stopword, lines with '#' in the first column are ignored (as comments).
normalize() — Normalize Token or remove it (if null is returned)

Inherited from Zend_Search_Lucene_Analysis_TokenFilter

public

normalize() — Normalize Token or remove it (if null is returned)