Normalizer for Persian.

Normalization is done in-place for efficiency, operating on a termbuffer.

Normalization is defined as:

  • Normalization of various heh + hamza forms and heh goal to heh.
  • Normalization of farsi yeh and yeh barree to arabic yeh
  • Normalization of persian keheh to arabic kaf

 public PersianNormalizationFilter(TokenStream input)
     : base(input)
 {
     _normalizer = new PersianNormalizer();
     _termAtt    = AddAttribute <ITermAttribute>();
 }
 public PersianNormalizationFilter(TokenStream input)
     : base(input)
 {
     _normalizer = new PersianNormalizer();
     _termAtt = AddAttribute<ITermAttribute>();
 }
 public PersianNormalizationFilter(TokenStream input)
     : base(input)
 {
     _normalizer = new PersianNormalizer();
     _termAtt = (TermAttribute)AddAttribute(typeof(TermAttribute));
 }