|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.umber.core.text.filters.Normalizer
Text filter to perform whitespace (or custom) normalization.
Constructor Summary | |
Normalizer()
Creates a new instance of whitespace Normalizer. |
|
Normalizer(java.lang.String[] tokens,
java.lang.String normalText,
java.lang.String[][] exclusionTokens)
Creates a new instance of Normalizer with custom normalization. |
Method Summary | |
java.lang.String |
filterText(java.lang.String text)
Sends input document through a modifying filter. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
public Normalizer()
public Normalizer(java.lang.String[] tokens, java.lang.String normalText, java.lang.String[][] exclusionTokens)
exclusionTokens
parameter must be an array of two-element
String[] arrays. Each pair of Strings should contain the start and
end tokens delimiters of areas to skip normalization.
If any of the constructor parameters are null, the default normalization will be used for those parameters.
tokens
- tokens to collapse and replace with normalTextnormalText
- text to replace spans of normalize tokens withexclusionTokens
- pairs of delimiters to exclude regions of text
from normalizationMethod Detail |
public java.lang.String filterText(java.lang.String text)
filterText
in interface ITextFilter
text
- input text document
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |