C# (CSharp) TECFF.Toolkit WordsAndCounts.Add 예제들

프로그래밍 언어: C# (CSharp)

네임스페이스/패키지 이름: TECFF.Toolkit

클래스/타입: WordsAndCounts

메소드/함수: Add

hotexamples.com에서의 예제들: 2

C# (CSharp) TECFF.Toolkit WordsAndCounts.Add - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 C# (CSharp)의 TECFF.Toolkit.WordsAndCounts.Add에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Add(2)

AddAll(2)

RemoveWord(1)

예제 #1

파일 보기

        private static void ExtractWordsAround(
            string text, Dictionary <string, bool> allWordForms, string langCode,
            WordsAndCounts wordsAndCounts, List <string> phrasesToKeep)
        {
            string textLowerCase      = text.ToLowerInvariant();
            string textHtmlDecoded    = HttpUtility.HtmlDecode(textLowerCase);
            string textWithoutPhrases = EncodePhrases(textHtmlDecoded, phrasesToKeep);

            string[] allWords;
            if ((langCode == LANG_CODE_BG) || (langCode == LANG_CODE_RU))
            {
                string nonCyrillicChars = "[^�-‗א-�" + BULLET_OPERATOR + "]+";
                allWords = Regex.Split(textWithoutPhrases, nonCyrillicChars);
            }
            else
            {
                string nonLatinChars = "[^A-Za-z" + BULLET_OPERATOR + "]+";
                allWords = Regex.Split(textWithoutPhrases, nonLatinChars);
            }
            DecodePhrases(allWords);

            List <string> validWords;

            if (removeShortAndStopWords)
            {
                // TODO: this does not work correctly for short words like "םמ"
                validWords = FilterWords(allWords, langCode, allWordForms);
            }
            else
            {
                validWords = new List <string>(allWords);
            }

            bool[] wordsToInclude = new bool[validWords.Count];
            for (int i = 0; i < validWords.Count; i++)
            {
                string currentWord = validWords[i];
                if (allWordForms.ContainsKey(currentWord))
                {
                    int start = Math.Max(0, i - contextSize);
                    int end   = Math.Min(i + contextSize, validWords.Count - 1);
                    for (int contextIndex = start; contextIndex <= end; contextIndex++)
                    {
                        wordsToInclude[contextIndex] = true;
                    }
                }
            }
            for (int i = 0; i < validWords.Count; i++)
            {
                if (wordsToInclude[i])
                {
                    string wordToInclude = validWords[i];
                    wordsAndCounts.Add(wordToInclude, 1);
                }
            }
        }

예제 #2

파일 보기

파일: LemmaDictionaryUtils.cs 프로젝트: nakov/cognates-and-false-friends-tools

        public static WordsAndCounts GetBasicForms(WordsAndCounts words, string langCode)
        {
            WordsAndCounts basicForms = new WordsAndCounts();

            WordAndCount[] wordsAndCounts = words.AsSortedArray;
            foreach (WordAndCount word in wordsAndCounts)
            {
                List <string> basicFormWords =
                    LemmaDictionaryUtils.GetBasicForms(word.Word, langCode);
                foreach (string basicWord in basicFormWords)
                {
                    //double count = (double)word.Count / basicFormWords.Count;
                    double count = (double)word.Count;
                    basicForms.Add(basicWord, count);
                }
            }
            return(basicForms);
        }