|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.itc.irst.tcc.sre.util.PorterStemmer
public class PorterStemmer
Stemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form. The input word can be provided a character at time (by calling add()), or at once by calling one of the various stem(something) methods.
Field Summary | |
---|---|
private char[] |
b
|
private boolean |
dirty
|
private static int |
EXTRA
|
private int |
i
|
private static int |
INC
|
private int |
j
|
private int |
k
|
private int |
k0
|
(package private) static org.apache.log4j.Logger |
logger
Define a static logger variable so that it references the Logger instance named PorterStemmer . |
private static PorterStemmer |
stemmer
A prototype for a feature factory such that only one instance class can ever exist. |
Constructor Summary | |
---|---|
PorterStemmer()
|
Method Summary | |
---|---|
void |
add(char ch)
Add a character to the word being stemmed. |
void |
add(char[] ch)
Add a character to the word being stemmed. |
private boolean |
cons(int i)
|
private boolean |
cvc(int i)
|
private boolean |
doublec(int j)
|
private boolean |
ends(java.lang.String s)
|
char[] |
getResultBuffer()
Returns a reference to a character buffer containing the results of the stemming process. |
int |
getResultLength()
Returns the length of the word resulting from the stemming process. |
static PorterStemmer |
getStemmer()
|
private int |
m()
|
static void |
main(java.lang.String[] args)
Test program for demonstrating the Stemmer. |
(package private) void |
r(java.lang.String s)
|
void |
reset()
reset() resets the stemmer so it can stem another word. |
(package private) void |
setto(java.lang.String s)
|
boolean |
stem()
Stem the word placed into the Stemmer buffer through calls to add(). |
boolean |
stem(char[] word)
Stem a word contained in a char[]. |
boolean |
stem(char[] word,
int wordLen)
Stem a word contained in a leading portion of a char[] array. |
boolean |
stem(char[] wordBuffer,
int offset,
int wordLen)
Stem a word contained in a portion of a char[] array. |
boolean |
stem(int i0)
|
java.lang.String |
stem(java.lang.String s)
Stem a word provided as a String. |
private void |
step1()
|
private void |
step2()
|
private void |
step3()
|
private void |
step4()
|
private void |
step5()
|
private void |
step6()
|
java.lang.String |
toString()
After a word has been stemmed, it can be retrieved by toString(), or a reference to the internal buffer can be retrieved by getResultBuffer and getResultLength (which is generally more efficient.) |
private boolean |
vowelinstem()
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
static org.apache.log4j.Logger logger
PorterStemmer
.
private static PorterStemmer stemmer
private char[] b
private int i
private int j
private int k
private int k0
private boolean dirty
private static final int INC
private static final int EXTRA
Constructor Detail |
---|
public PorterStemmer()
Method Detail |
---|
public void reset()
reset
in interface Stemmer
public void add(char ch)
add
in interface Stemmer
public void add(char[] ch)
add
in interface Stemmer
public java.lang.String toString()
toString
in class java.lang.Object
public int getResultLength()
getResultLength
in interface Stemmer
public char[] getResultBuffer()
getResultBuffer
in interface Stemmer
private final boolean cons(int i)
private final int m()
private final boolean vowelinstem()
private final boolean doublec(int j)
private final boolean cvc(int i)
private final boolean ends(java.lang.String s)
void setto(java.lang.String s)
void r(java.lang.String s)
private final void step1()
private final void step2()
private final void step3()
private final void step4()
private final void step5()
private final void step6()
public java.lang.String stem(java.lang.String s)
stem
in interface Stemmer
public boolean stem(char[] word)
stem
in interface Stemmer
public boolean stem(char[] wordBuffer, int offset, int wordLen)
stem
in interface Stemmer
public boolean stem(char[] word, int wordLen)
stem
in interface Stemmer
public boolean stem()
stem
in interface Stemmer
public boolean stem(int i0)
stem
in interface Stemmer
public static void main(java.lang.String[] args)
public static PorterStemmer getStemmer()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |