C# (CSharp) Lucene.Net.Analysis.Util TokenizerFactory 예제들

프로그래밍 언어: C# (CSharp)

네임스페이스/패키지 이름: Lucene.Net.Analysis.Util

클래스/타입: TokenizerFactory

hotexamples.com에서의 예제들: 14

C# (CSharp) Lucene.Net.Analysis.Util TokenizerFactory - 14개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 C# (CSharp)의 Lucene.Net.Analysis.Util.TokenizerFactory에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

LookupClass(3)

ForName(3)

Create(1)

create(1)

Abstract parent class for analysis factories that create Tokenizer instances.

상속: AbstractAnalysisFactory

TokenizerFactory 1 문서

예제 #1

파일 보기

파일: TestAnalysisSPILoader.cs 프로젝트: ywscr/lucenenet

        public virtual void TestBogusLookupTokenizerClass()
        {
            try
            {
                TokenizerFactory.LookupClass("sdfsdfsdfdsfsdfsdf");
                fail();
            }
            catch (Exception expected) when(expected.IsIllegalArgumentException())
            {
                //
            }

            try
            {
                TokenizerFactory.LookupClass("!(**#$U*#$*");
                fail();
            }
            catch (Exception expected) when(expected.IsIllegalArgumentException())
            {
                //
            }
        }

예제 #2

파일 보기

파일: TestAnalysisSPILoader.cs 프로젝트: ywscr/lucenenet

        public virtual void TestBogusLookupTokenizer()
        {
            try
            {
                TokenizerFactory.ForName("sdfsdfsdfdsfsdfsdf", new Dictionary <string, string>());
                fail();
            }
            catch (Exception expected) when(expected.IsIllegalArgumentException())
            {
                //
            }

            try
            {
                TokenizerFactory.ForName("!(**#$U*#$*", new Dictionary <string, string>());
                fail();
            }
            catch (Exception expected) when(expected.IsIllegalArgumentException())
            {
                //
            }
        }

예제 #3

파일 보기

파일: TestAnalysisSPILoader.cs 프로젝트: zalintyre/lucenenet

        public virtual void TestBogusLookupTokenizerClass()
        {
            try
            {
                TokenizerFactory.LookupClass("sdfsdfsdfdsfsdfsdf");
                fail();
            }
            catch (ArgumentException)
            {
                //
            }

            try
            {
                TokenizerFactory.LookupClass("!(**#$U*#$*");
                fail();
            }
            catch (ArgumentException)
            {
                //
            }
        }

예제 #4

파일 보기

        public virtual void TestBogusLookupTokenizer()
        {
            try
            {
                TokenizerFactory.ForName("sdfsdfsdfdsfsdfsdf", new Dictionary <string, string>());
                fail();
            }
            catch (System.ArgumentException)
            {
                //
            }

            try
            {
                TokenizerFactory.ForName("!(**#$U*#$*", new Dictionary <string, string>());
                fail();
            }
            catch (System.ArgumentException)
            {
                //
            }
        }

예제 #5

파일 보기

파일: TestAnalysisSPILoader.cs 프로젝트: ywscr/lucenenet

 public virtual void TestLookupTokenizerClass()
 {
     assertSame(typeof(WhitespaceTokenizerFactory), TokenizerFactory.LookupClass("Whitespace"));
     assertSame(typeof(WhitespaceTokenizerFactory), TokenizerFactory.LookupClass("WHITESPACE"));
     assertSame(typeof(WhitespaceTokenizerFactory), TokenizerFactory.LookupClass("whitespace"));
 }

예제 #6

파일 보기

파일: TestAnalysisSPILoader.cs 프로젝트: ywscr/lucenenet

 public virtual void TestLookupTokenizer()
 {
     assertSame(typeof(WhitespaceTokenizerFactory), TokenizerFactory.ForName("Whitespace", VersionArgOnly()).GetType());
     assertSame(typeof(WhitespaceTokenizerFactory), TokenizerFactory.ForName("WHITESPACE", VersionArgOnly()).GetType());
     assertSame(typeof(WhitespaceTokenizerFactory), TokenizerFactory.ForName("whitespace", VersionArgOnly()).GetType());
 }

예제 #7

파일 보기

파일: SlowSynonymFilterFactory.cs 프로젝트: Cefa68000/lucenenet

 //JAVA TO C# CONVERTER WARNING: Method 'throws' clauses are not available in .NET:
 //ORIGINAL LINE: private static java.util.List<String> splitByTokenizer(String source, TokenizerFactory tokFactory) throws java.io.IOException
 private static IList<string> splitByTokenizer(string source, TokenizerFactory tokFactory)
 {
     StringReader reader = new StringReader(source);
     TokenStream ts = loadTokenizer(tokFactory, reader);
     IList<string> tokList = new List<string>();
     try
     {
       CharTermAttribute termAtt = ts.addAttribute(typeof(CharTermAttribute));
       ts.reset();
       while (ts.incrementToken())
       {
     if (termAtt.length() > 0)
     {
       tokList.Add(termAtt.ToString());
     }
       }
     }
     finally
     {
       reader.close();
     }
     return tokList;
 }

예제 #8

파일 보기

파일: SlowSynonymFilterFactory.cs 프로젝트: Cefa68000/lucenenet

 private static TokenStream loadTokenizer(TokenizerFactory tokFactory, Reader reader)
 {
     return tokFactory.create(reader);
 }

예제 #9

파일 보기

파일: SlowSynonymFilterFactory.cs 프로젝트: Cefa68000/lucenenet

 // a , b c , d e f => [[a],[b,c],[d,e,f]]
 //JAVA TO C# CONVERTER WARNING: Method 'throws' clauses are not available in .NET:
 //ORIGINAL LINE: private static java.util.List<java.util.List<String>> getSynList(String str, String separator, TokenizerFactory tokFactory) throws java.io.IOException
 private static IList<IList<string>> getSynList(string str, string separator, TokenizerFactory tokFactory)
 {
     IList<string> strList = splitSmart(str, separator, false);
     // now split on whitespace to get a list of token strings
     IList<IList<string>> synList = new List<IList<string>>();
     foreach (string toks in strList)
     {
       IList<string> tokList = tokFactory == null ? splitWS(toks, true) : splitByTokenizer(toks, tokFactory);
       synList.Add(tokList);
     }
     return synList;
 }

예제 #10

파일 보기

파일: SlowSynonymFilterFactory.cs 프로젝트: Cefa68000/lucenenet

        //JAVA TO C# CONVERTER WARNING: Method 'throws' clauses are not available in .NET:
        //ORIGINAL LINE: static void parseRules(Iterable<String> rules, SlowSynonymMap map, String mappingSep, String synSep, boolean expansion, TokenizerFactory tokFactory) throws java.io.IOException
        internal static void parseRules(IEnumerable<string> rules, SlowSynonymMap map, string mappingSep, string synSep, bool expansion, TokenizerFactory tokFactory)
        {
            int count = 0;
            foreach (string rule in rules)
            {
              // To use regexes, we need an expression that specifies an odd number of chars.
              // This can't really be done with string.split(), and since we need to
              // do unescaping at some point anyway, we wouldn't be saving any effort
              // by using regexes.

              IList<string> mapping = splitSmart(rule, mappingSep, false);

              IList<IList<string>> source;
              IList<IList<string>> target;

              if (mapping.Count > 2)
              {
            throw new System.ArgumentException("Invalid Synonym Rule:" + rule);
              }
              else if (mapping.Count == 2)
              {
            source = getSynList(mapping[0], synSep, tokFactory);
            target = getSynList(mapping[1], synSep, tokFactory);
              }
              else
              {
            source = getSynList(mapping[0], synSep, tokFactory);
            if (expansion)
            {
              // expand to all arguments
              target = source;
            }
            else
            {
              // reduce to first argument
              target = new List<>(1);
              target.Add(source[0]);
            }
              }

              bool includeOrig = false;
              foreach (IList<string> fromToks in source)
              {
            count++;
            foreach (IList<string> toToks in target)
            {
              map.add(fromToks, SlowSynonymMap.makeTokens(toToks), includeOrig, true);
            }
              }
            }
        }

예제 #11

파일 보기

파일: TestFactories.cs 프로젝트: ChristopherHaws/lucenenet

 internal FactoryAnalyzer(TokenizerFactory tokenizer, TokenFilterFactory tokenfilter, CharFilterFactory charFilter)
 {
     Debug.Assert(tokenizer != null);
     this.tokenizer = tokenizer;
     this.charFilter = charFilter;
     this.tokenfilter = tokenfilter;
 }

예제 #12

파일 보기

파일: FSTSynonymFilterFactory.cs 프로젝트: ChristopherHaws/lucenenet

 public AnalyzerAnonymousInnerClassHelper(FSTSynonymFilterFactory outerInstance, TokenizerFactory factory)
 {
     this.outerInstance = outerInstance;
     this.factory = factory;
 }

예제 #13

파일 보기

파일: SlowSynonymFilterFactory.cs 프로젝트: ChristopherHaws/lucenenet

 private static TokenStream LoadTokenizer(TokenizerFactory tokFactory, TextReader reader)
 {
     return tokFactory.Create(reader);
 }

예제 #14

파일 보기

파일: SlowSynonymFilterFactory.cs 프로젝트: ChristopherHaws/lucenenet

 private static IList<string> SplitByTokenizer(string source, TokenizerFactory tokFactory)
 {
     StringReader reader = new StringReader(source);
     TokenStream ts = LoadTokenizer(tokFactory, reader);
     IList<string> tokList = new List<string>();
     try
     {
         ICharTermAttribute termAtt = ts.AddAttribute<ICharTermAttribute>();
         ts.Reset();
         while (ts.IncrementToken())
         {
             if (termAtt.Length > 0)
             {
                 tokList.Add(termAtt.ToString());
             }
         }
     }
     finally
     {
         reader.Dispose();
     }
     return tokList;
 }