Class NGramTokenizerDescriptor
Inheritance
NGramTokenizerDescriptor
Assembly: OpenSearch.Client.dll
Syntax
public class NGramTokenizerDescriptor : TokenizerDescriptorBase<NGramTokenizerDescriptor, INGramTokenizer>, IDescriptor, INGramTokenizer, ITokenizer
Properties
|
Edit this page
View Source
Type
Declaration
protected override string Type { get; }
Property Value
Overrides
Methods
|
Edit this page
View Source
CustomTokenChars(string)
Custom characters that should be treated as part of a token. For example,
setting this to +-_ will make the tokenizer treat the plus, minus and
underscore sign as part of a token.
Requires setting
Custom as part of
TokenChars
Declaration
public NGramTokenizerDescriptor CustomTokenChars(string customTokenChars)
Parameters
Type |
Name |
Description |
string |
customTokenChars |
|
Returns
|
Edit this page
View Source
MaxGram(int?)
Maximum size in codepoints of a single n-gram, defaults to 2.
Declaration
public NGramTokenizerDescriptor MaxGram(int? minGram)
Parameters
Type |
Name |
Description |
int? |
minGram |
|
Returns
|
Edit this page
View Source
MinGram(int?)
Minimum size in codepoints of a single n-gram, defaults to 1.
Declaration
public NGramTokenizerDescriptor MinGram(int? minGram)
Parameters
Type |
Name |
Description |
int? |
minGram |
|
Returns
|
Edit this page
View Source
TokenChars(params TokenChar[])
Characters classes to keep in the tokens, OpenSearch will
split on characters that don’t belong to any of these classes.
Declaration
public NGramTokenizerDescriptor TokenChars(params TokenChar[] tokenChars)
Parameters
Returns
|
Edit this page
View Source
TokenChars(IEnumerable<TokenChar>)
Characters classes to keep in the tokens, OpenSearch will
split on characters that don’t belong to any of these classes.
Declaration
public NGramTokenizerDescriptor TokenChars(IEnumerable<TokenChar> tokenChars)
Parameters
Returns
Implements
Extension Methods