\Zend_Search_Lucene_Analysis_Analyzer

An Analyzer is used to analyze text.

It thus represents a policy for extracting index terms from text.

Note:
Lucene Java implementation is oriented to streams. It provides effective work
with a huge documents (more then 20Mb).
But engine itself is not oriented such documents.
Thus Zend_Search_Lucene analysis API works with data strings and sets (arrays).

Copyright: Copyright (c) 2005-2010 Zend Technologies USA Inc. (http://www.zend.com)
License: http://framework.zend.com/license/new-bsd New BSD License

Synopsis

class Zend_Search_Lucene_Analysis_Analyzer {

// members
private static Zend_Search_Lucene_Analysis_Analyzer $_defaultImpl;
protected $_input = NULL;
protected string $_encoding = '';

// methods
public array tokenize()
public void setInput()
public abstract void reset()
public abstract Zend_Search_Lucene_Analysis_Token|null nextToken()
public static void setDefault()
public static Zend_Search_Lucene_Analysis_Analyzer getDefault()

}

Members

private

$_defaultImpl — Zend_Search_Lucene_Analysis_Analyzer
The Analyzer implementation used by default.

protected

$_encoding — string
Input string encoding
$_input — string
Input string

Methods

public

getDefault() — Return the default Analyzer implementation used by indexing code.
nextToken() — Tokenization stream API Get next token Returns null at the end of stream
reset() — Reset token stream
setDefault() — Set the default Analyzer implementation used by indexing code.
setInput() — Tokenization stream API Set input
tokenize() — Tokenize text to a terms Returns array of Zend_Search_Lucene_Analysis_Token objects