Class NGramTokenizer
Inherited Members
Namespace: OpenSearch.Client
Assembly: OpenSearch.Client.dll
Syntax
public class NGramTokenizer : TokenizerBase, INGramTokenizer, ITokenizer
Constructors
| Edit this page View SourceNGramTokenizer()
Declaration
public NGramTokenizer()
Properties
| Edit this page View SourceCustomTokenChars
Custom characters that should be treated as part of a token. For example, setting this to +-_ will make the tokenizer treat the plus, minus and underscore sign as part of a token.
Requires setting Custom as part of TokenCharsDeclaration
public string CustomTokenChars { get; set; }
Property Value
Type | Description |
---|---|
string |
MaxGram
Maximum size in codepoints of a single n-gram, defaults to 2.
Declaration
public int? MaxGram { get; set; }
Property Value
Type | Description |
---|---|
int? |
MinGram
Minimum size in codepoints of a single n-gram, defaults to 1.
Declaration
public int? MinGram { get; set; }
Property Value
Type | Description |
---|---|
int? |
TokenChars
Characters classes to keep in the tokens, OpenSearch will split on characters that don’t belong to any of these classes.
Declaration
public IEnumerable<TokenChar> TokenChars { get; set; }
Property Value
Type | Description |
---|---|
IEnumerable<TokenChar> |