java.lang.Object
org.apache.lucene.util.automaton.UTF32ToUTF8
Converts UTF-32 automata to the equivalent UTF-8 representation.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionprivate static class
private static class
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final int[]
private final UTF32ToUTF8.UTF8Sequence
(package private) static int[]
private static final int[]
private final UTF32ToUTF8.UTF8Sequence
private final UTF32ToUTF8.UTF8Sequence
private final UTF32ToUTF8.UTF8Sequence
(package private) Automaton.Builder
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprivate void
all
(int start, int end, int startCode, int endCode, int left) private void
build
(int start, int end, UTF32ToUTF8.UTF8Sequence startUTF8, UTF32ToUTF8.UTF8Sequence endUTF8, int upto) Converts an incoming utf32 automaton to an equivalent utf8 one.(package private) void
convertOneEdge
(int start, int end, int startCodePoint, int endCodePoint) private void
end
(int start, int end, UTF32ToUTF8.UTF8Sequence endUTF8, int upto, boolean doAll) private void
start
(int start, int end, UTF32ToUTF8.UTF8Sequence startUTF8, int upto, boolean doAll)
-
Field Details
-
startCodes
private static final int[] startCodes -
endCodes
private static final int[] endCodes -
MASKS
static int[] MASKS -
startUTF8
-
endUTF8
-
tmpUTF8a
-
tmpUTF8b
-
utf8
Automaton.Builder utf8
-
-
Constructor Details
-
UTF32ToUTF8
public UTF32ToUTF8()Sole constructor.
-
-
Method Details
-
convertOneEdge
void convertOneEdge(int start, int end, int startCodePoint, int endCodePoint) -
build
private void build(int start, int end, UTF32ToUTF8.UTF8Sequence startUTF8, UTF32ToUTF8.UTF8Sequence endUTF8, int upto) -
start
-
end
-
all
private void all(int start, int end, int startCode, int endCode, int left) -
convert
Converts an incoming utf32 automaton to an equivalent utf8 one. The incoming automaton need not be deterministic. Note that the returned automaton will not in general be deterministic, so you must determinize it if that's needed.
-