Normalization token filters

Normalization token filters

There are several token filters available which try to normalize special characters of a certain language.

Arabic

arabic_normalization

German

german_normalization

Hindi

hindi_normalization

Indic

indic_normalization

Kurdish (Sorani)

sorani_normalization

Persian

persian_normalization

Scandinavian

scandinavian_normalization, scandinavian_folding

Serbian

serbian_normalization