C# (CSharp) Project_NLP_Salgado Unigramの例

プログラミング言語: C# (CSharp)

名前空間/パッケージ名: Project_NLP_Salgado

クラス/型: Unigram

hotexamples.comのコード掲載数: 2

C# (CSharp) Project_NLP_Salgado Unigram - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたC# (CSharp)のProject_NLP_Salgado.Unigramの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

GetComparisonKey(2)

コード例 #1

ファイルを表示

        public override double ComputeWordProbability(string u, string v, string w)
        {
            Unigram vUnigram = new Unigram {
                w = v
            };
            Bigram vwBigram = new Bigram {
                v = v, w = w
            };

            // Compute word probability given the previous one
            this.NGramCounts.TryGetValue(vUnigram.GetComparisonKey(), out int vCount);
            this.NGramCounts.TryGetValue(vwBigram.GetComparisonKey(), out int vwCount);

            // q(w|v) = c(v, w)/c(v)
            // q(w|v)_{addK} = (c(v, w) + k)/(c(v) + k|V*|)
            double qWv = this.Smoother.ComputeSmoothedWordProbability(u, v, w, vwCount, vCount, this.UniqueNGramsCount);

            return(qWv);
        }

コード例 #2

ファイルを表示

        public override void TrainLanguageModel(Corpus trainingCorpus)
        {
            // We need to process sentece by sentence to avoid wrapping sentences, ie. counting (STOP, w) bigrams
            foreach (var sentence in trainingCorpus.AllTokenizedSentences)
            {
                // Initialize x_{-1} to START
                var v = "<s>";

                // We now need to store all counts of c(v, w) and c(v)
                foreach (var w in sentence)
                {
                    Unigram vUnigram = new Unigram {
                        w = v
                    };
                    Bigram vwBigram = new Bigram {
                        v = v, w = w
                    };

                    // +1 to current count, current will be 0 if not found, thus starting at 1 as expected
                    this.NGramCounts.TryGetValue(vUnigram.GetComparisonKey(), out int vCount);
                    vCount++;
                    this.NGramCounts[vUnigram.GetComparisonKey()] = vCount;

                    var isNewNgram = !this.NGramCounts.TryGetValue(vwBigram.GetComparisonKey(), out int vwCount);
                    vwCount++;
                    this.NGramCounts[vwBigram.GetComparisonKey()] = vwCount;

                    if (isNewNgram)
                    {
                        this.UniqueNGramsCount++;
                    }

                    // Replace previous token
                    v = w;
                }
            }
        }