The size of the character N-grams to use to tokenize Asian text.
NOTE: You must not use NGram with the SentenceBreaking configuration parameter.
If you set NGram for Japanese, you can use SentenceBreakingOptions for normalization.
| Type: | Long |
| Default: | 0 (off) |
| Required: | No |
| Configuration Section: | LanguageTypes or MyLanguage |
| Example: | Encodings=UTF8:JapaneseUTF8
In this example, all text is indexed as N-grams of two characters. |
| See Also: | NGramMultiByteOnly
NGramOrientalOnly SentenceBreaking SentenceBreakingOptions |
NOTE: If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.
|
|