Package | Description |
---|---|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
Modifier and Type | Class and Description |
---|---|
class |
LetterTokenizer
A LetterTokenizer is a tokenizer that divides text at non-letters.
|
class |
LowerCaseTokenizer
Deprecated.
Use
LetterTokenizer followed by LowerCaseFilter |
class |
UnicodeWhitespaceTokenizer
A UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace.
|
class |
WhitespaceTokenizer
A tokenizer that divides text at whitespace characters as defined by
Character.isWhitespace(int) . |
Modifier and Type | Method and Description |
---|---|
static CharTokenizer |
CharTokenizer.fromSeparatorCharPredicate(AttributeFactory factory,
java.util.function.IntPredicate separatorCharPredicate)
Creates a new instance of CharTokenizer with the supplied attribute factory using a custom predicate, supplied as method reference or lambda expression.
|
static CharTokenizer |
CharTokenizer.fromSeparatorCharPredicate(AttributeFactory factory,
java.util.function.IntPredicate separatorCharPredicate,
java.util.function.IntUnaryOperator normalizer)
Deprecated.
Normalization should be done in a subsequent TokenFilter
|
static CharTokenizer |
CharTokenizer.fromSeparatorCharPredicate(java.util.function.IntPredicate separatorCharPredicate)
Creates a new instance of CharTokenizer using a custom predicate, supplied as method reference or lambda expression.
|
static CharTokenizer |
CharTokenizer.fromSeparatorCharPredicate(java.util.function.IntPredicate separatorCharPredicate,
java.util.function.IntUnaryOperator normalizer)
Deprecated.
Normalization should be done in a subsequent TokenFilter
|
static CharTokenizer |
CharTokenizer.fromTokenCharPredicate(AttributeFactory factory,
java.util.function.IntPredicate tokenCharPredicate)
Creates a new instance of CharTokenizer with the supplied attribute factory using a custom predicate, supplied as method reference or lambda expression.
|
static CharTokenizer |
CharTokenizer.fromTokenCharPredicate(AttributeFactory factory,
java.util.function.IntPredicate tokenCharPredicate,
java.util.function.IntUnaryOperator normalizer)
Deprecated.
Normalization should be done in a subsequent TokenFilter
|
static CharTokenizer |
CharTokenizer.fromTokenCharPredicate(java.util.function.IntPredicate tokenCharPredicate)
Creates a new instance of CharTokenizer using a custom predicate, supplied as method reference or lambda expression.
|
static CharTokenizer |
CharTokenizer.fromTokenCharPredicate(java.util.function.IntPredicate tokenCharPredicate,
java.util.function.IntUnaryOperator normalizer)
Deprecated.
Normalization should be done in a subsequent TokenFilter
|
Copyright © 2000–2019 The Apache Software Foundation. All rights reserved.