C# (CSharp) Lucene.Net.Analysis.Core LetterTokenizer - 5 examples found. These are the top rated real world C# (CSharp) examples of Lucene.Net.Analysis.Core.LetterTokenizer extracted from open source projects. You can rate examples to help us improve the quality of examples.
Related
Related in langs
A LetterTokenizer is a tokenizer that divides text at non-letters. That's to say, it defines tokens as maximal strings of adjacent letters, as defined by java.lang.Character.isLetter() predicate.
Note: this does a decent job for most European languages, but does a terrible job for some Asian languages, where words are not separated by spaces.