Interface INGramTokenizer
A tokenizer of type nGram.
Namespace: OpenSearch.Client
Assembly: OpenSearch.Client.dll
Syntax
public interface INGramTokenizer : ITokenizer
Properties
| Edit this page View SourceCustomTokenChars
Custom characters that should be treated as part of a token. For example, setting this to +-_ will make the tokenizer treat the plus, minus and underscore sign as part of a token.
Requires setting Custom as part of TokenCharsDeclaration
[DataMember(Name = "custom_token_chars")]
string CustomTokenChars { get; set; }
Property Value
Type | Description |
---|---|
string |
MaxGram
Maximum size in codepoints of a single n-gram, defaults to 2.
Declaration
[DataMember(Name = "max_gram")]
int? MaxGram { get; set; }
Property Value
Type | Description |
---|---|
int? |
MinGram
Minimum size in codepoints of a single n-gram, defaults to 1.
Declaration
[DataMember(Name = "min_gram")]
int? MinGram { get; set; }
Property Value
Type | Description |
---|---|
int? |
TokenChars
Characters classes to keep in the tokens, OpenSearch will split on characters that don’t belong to any of these classes.
Declaration
[DataMember(Name = "token_chars")]
IEnumerable<TokenChar> TokenChars { get; set; }
Property Value
Type | Description |
---|---|
IEnumerable<TokenChar> |