C# (CSharp) CustomTFIDF MegaDictionary.AddToDictionary примеры использования

Язык программирования: C# (CSharp)

Пространство имен/Пакет: CustomTFIDF

Класс/Тип: MegaDictionary

Метод/Функция: AddToDictionary

Примеров на hotexamples.com: 1

C# (CSharp) CustomTFIDF MegaDictionary.AddToDictionary - 1 пример найден. Это лучшие примеры C# (CSharp) кода для CustomTFIDF.MegaDictionary.AddToDictionary, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

ReturnKeysList(3)

ReturnTermFrequency(3)

AddToDictionary(1)

CleanseDictionary(1)

Пример #1

Показать файл

Файл: Parser.cs Проект: junsu10291/k-means-clustering

        public Document parseDocument(string line, string id)
        {
            termFreqDict = new Dictionary <string, int>();

            line = line.ToLower();
            line = line.TrimEnd(' ');
            line = Regex.Replace(line, @"\t|\n|\r", "");

            Regex rgx = new Regex("[^a-z0-9 ]"); // keep just alphanumeric characters

            line = rgx.Replace(line, " ");

            line = Regex.Replace(line, string.Format(@"(\p{{L}}{{{0}}})\p{{L}}+", 11), ""); // remove 12 >
            line = Regex.Replace(line, @"\b\w{1,3}\b", "");                                 // remove words that have three letters or fewer
            line = Regex.Replace(line, @"\s+", " ");                                        // remove extra whitespace

            var noSpaces = line.Split(new String[] { " " }, StringSplitOptions.RemoveEmptyEntries);

            HashSet <string> uniqueWords = new HashSet <string>();

            Stemmer stemmer = new Stemmer();

            foreach (var s in noSpaces)
            {
                // stem words
                string word = stemmer.stem(s);
                if (!StopWords.stopWordsSet.Contains(word) && !word.Any(c => char.IsDigit(c)))
                {
                    addToLocalDict(word);

                    if (!uniqueWords.Contains(word))
                    {
                        MegaDictionary.AddToDictionary(word);
                        uniqueWords.Add(word);
                    }
                }
            }

            return(new Document(termFreqDict, id));
        }