C# (CSharp) Icu.HasNormalizationBoundaryBeforeの例

プログラミング言語: C# (CSharp)

クラス/型: Icu

メソッド/関数: HasNormalizationBoundaryBefore

hotexamples.comのコード掲載数: 1

C# (CSharp) Icu.HasNormalizationBoundaryBefore - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたC# (CSharp)のIcu.HasNormalizationBoundaryBeforeの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Normalize(24)

InitIcuDataDir(18)

Cleanup(14)

GetDisplayName(12)

SetDataDirectory(12)

ToLower(7)

GetCharType(5)

IsPunct(5)

IsSymbol(4)

IsSpace(4)

Split(4)

IsControl(3)

IsSeparator(2)

GetLCID(2)

IsNumeric(2)

IsNormalized(2)

GetSortKeyBound(2)

GetSortKey(2)

GetNumericType(2)

GetNextStringInResourceBundleIteration(2)

BeginResourceBundleIteration(2)

GetPrettyICUCharName(2)

GetDecompositionType(2)

GetISO3Country(2)

GetCountryCode(2)

GetAvailable(1)

GetDecompositionFromUtf32(1)

IsPrivateUse(1)

GetAvailableLocale(1)

IsValidCodepoint(1)

CountAvailableLocales(1)

IsDiacritic(1)

CountAvailable(1)

OpenCollator(1)

OpenResourceBundle(1)

CloseResourceBundle(1)

CloseCollator(1)

IsIdeographic(1)

GetIcuNormalizer(1)

GetISO3Language(1)

GetDisplayCountry(1)

IsAlphabetic(1)

GetDisplayLanguage(1)

HasNormalizationBoundaryBefore(1)

GetVariantCode(1)

GetDisplayScript(1)

GetDisplayVariant(1)

GetScriptCode(1)

GetResourceBundleSubsection(1)

GetResourceBundleStringByKey(1)

コード例 #1

ファイルを表示

        /// <summary>
        /// Given an ICU normalizer, enumerate the limit indices of the "segments" of this string.
        /// A "segment" is defined as a group of characters that interact with each other in this
        /// normalization, and which therefore can't be split apart and normalized separately without
        /// changing the result of the normalization. For example, under NFC, if LATIN SMALL LETTER C (U+0063)
        /// is followed by COMBINING CEDILLA (U+0327) which is followed by LATIN SMALL LETTER D (U+0064),
        /// then the c and cedilla will form one "segment": splitting them apart and normalizing them
        /// separately would produce a different result than normalizing them together. So this function
        /// would yield (among other values) the index of LATIN SMALL LETTER D, the first index that is
        /// not part of the segment (that is, the limit index).
        ///
        /// The last index yielded by this function will be equal to the length of the string, and it
        /// will never yield the index 0. (If the string is empty, it will return an empty enumerable).
        /// Therefore, it is always safe to do GetChars(previousIndex, thisIndex) in a foreach loop to get
        /// the "current" segment (assuming previousIndex is set to 0 the first time through the loop).
        /// </summary>
        /// <param name="icuNormalizer">IntPtr to the ICU normalizer to use (get this from Icu.GetIcuNormalizer)</param>
        /// <returns>An enumerable of indexes into "this" TsString, at all the normalization "segment" boundaries, suitable for passing into GetChars(prevIdx, thisIdx)</returns>
        private IEnumerable <int> EnumerateSegmentLimits(IntPtr icuNormalizer)
        {
            if (String.IsNullOrEmpty(Text))
            {
                yield break;
            }
            int i = 0;

            while (i < Text.Length)
            {
                int codepoint = Char.ConvertToUtf32(Text, i);
                if (Icu.HasNormalizationBoundaryBefore(icuNormalizer, codepoint) && i > 0)
                {
                    yield return(i);
                }
                i += codepoint > 0xffff ? 2 : 1;
            }
            yield return(Text.Length);
        }