\Zend_Search_Lucene_Analysis_Analyzer
An Analyzer is used to analyze text.
It thus represents a policy for extracting index terms from text.
Note:
Lucene Java implementation is oriented to streams. It provides effective work
with a huge documents (more then 20Mb).
But engine itself is not oriented such documents.
Thus Zend_Search_Lucene analysis API works with data strings and sets (arrays).
- Copyright: Copyright (c) 2005-2010 Zend Technologies USA Inc. (http://www.zend.com)
- License: http://framework.zend.com/license/new-bsd New BSD License
Synopsis
class Zend_Search_Lucene_Analysis_Analyzer
{
- // members
- private static Zend_Search_Lucene_Analysis_Analyzer $_defaultImpl;
- protected $_input = NULL;
- protected string $_encoding = '';
- // methods
- public array tokenize()
- public void setInput()
- public abstract void reset()
- public abstract Zend_Search_Lucene_Analysis_Token|null nextToken()
- public static void setDefault()
- public static Zend_Search_Lucene_Analysis_Analyzer getDefault()
Members
private
- $_defaultImpl
—
Zend_Search_Lucene_Analysis_Analyzer
The Analyzer implementation used by default.
protected
- $_encoding
—
string
Input string encoding - $_input
—
string
Input string
Methods
public
- getDefault() — Return the default Analyzer implementation used by indexing code.
- nextToken() — Tokenization stream API Get next token Returns null at the end of stream
- reset() — Reset token stream
- setDefault() — Set the default Analyzer implementation used by indexing code.
- setInput() — Tokenization stream API Set input
- tokenize() — Tokenize text to a terms Returns array of Zend_Search_Lucene_Analysis_Token objects