Class UnicodeEncoding

    • Field Detail

      • PROPERTY_NAME_MAX_SIZE

        private static final int PROPERTY_NAME_MAX_SIZE
        See Also:
        Constant Field Values
      • UNICODE_ISO_8859_1_CTypeTable

        static final short[] UNICODE_ISO_8859_1_CTypeTable
      • CaseFold_From

        static final int[] CaseFold_From
      • CaseFold_To

        static final int[][] CaseFold_To
      • CaseFold_Locale_From

        static final int[] CaseFold_Locale_From
      • CaseFold_Locale_To

        static final int[][] CaseFold_Locale_To
      • CaseUnfold_11_From

        static final int[] CaseUnfold_11_From
      • CaseUnfold_11_To

        static final int[][] CaseUnfold_11_To
      • CaseUnfold_11_Locale_From

        static final int[] CaseUnfold_11_Locale_From
      • CaseUnfold_11_Locale_To

        static final int[][] CaseUnfold_11_Locale_To
      • CaseUnfold_12

        static final int[][] CaseUnfold_12
      • CaseUnfold_12_Locale

        static final int[][] CaseUnfold_12_Locale
      • CaseUnfold_13

        static final int[][] CaseUnfold_13
      • FoldHash

        static final IntHash<int[]> FoldHash
      • Unfold1Hash

        static final IntHash<int[]> Unfold1Hash
    • Constructor Detail

      • UnicodeEncoding

        protected UnicodeEncoding​(java.lang.String name,
                                  int minLength,
                                  int maxLength,
                                  int[] EncLen)
      • UnicodeEncoding

        protected UnicodeEncoding​(java.lang.String name,
                                  int minLength,
                                  int maxLength,
                                  int[] EncLen,
                                  int[][] Trans)
    • Method Detail

      • isCodeCType

        public boolean isCodeCType​(int code,
                                   int ctype)
        Description copied from class: Encoding
        Perform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)
        Specified by:
        isCodeCType in class Encoding
        Parameters:
        code - a code point of a character
        ctype - a character type to check against Oniguruma equivalent: is_code_ctype
      • ctypeCodeRange

        protected final int[] ctypeCodeRange​(int ctype)
      • propertyNameToCType

        public int propertyNameToCType​(byte[] name,
                                       int p,
                                       int end)
        Description copied from class: AbstractEncoding
        onigenc_minimum_property_name_to_ctype notably overridden by unicode encodings
        Overrides:
        propertyNameToCType in class AbstractEncoding
      • mbcCaseFold

        public int mbcCaseFold​(int flag,
                               byte[] bytes,
                               IntHolder pp,
                               int end,
                               byte[] fold)
        Description copied from class: AbstractEncoding
        onigenc_ascii_mbc_case_fold
        Overrides:
        mbcCaseFold in class AbstractEncoding
        Parameters:
        flag - case fold flag
        pp - an IntHolder that points at character head
        fold - a buffer where to extract case folded character Oniguruma equivalent: mbc_case_fold
      • applyAllCaseFold

        public void applyAllCaseFold​(int flag,
                                     ApplyAllCaseFoldFunction fun,
                                     java.lang.Object arg)
        Description copied from class: AbstractEncoding
        onigenc_ascii_apply_all_case_fold / used also by multibyte encodings
        Overrides:
        applyAllCaseFold in class AbstractEncoding
        Parameters:
        flag - case fold flag
        fun - case folding functor (look at: ApplyCaseFold)
        arg - case folding functor argument (look at: ApplyCaseFoldArg) Oniguruma equivalent: apply_all_case_fold
      • initializeFoldHash

        private static IntHash<int[]> initializeFoldHash()
      • initializeUnfold1Hash

        private static IntHash<int[]> initializeUnfold1Hash()
      • initializeUnfold2Hash

        private static IntArrayHash<int[]> initializeUnfold2Hash()
      • initializeUnfold3Hash

        private static IntArrayHash<int[]> initializeUnfold3Hash()