Class NoriTokenizerDescriptor
Tokenizer that ships with the analysis-nori plugin
Inheritance
NoriTokenizerDescriptor
Assembly: OpenSearch.Client.dll
Syntax
public class NoriTokenizerDescriptor : TokenizerDescriptorBase<NoriTokenizerDescriptor, INoriTokenizer>, IDescriptor, INoriTokenizer, ITokenizer
Properties
|
Edit this page
View Source
Type
Declaration
protected override string Type { get; }
Property Value
Overrides
Methods
|
Edit this page
View Source
DecompoundMode(NoriDecompoundMode?)
The regular expression pattern, defaults to \W+.
Declaration
public NoriTokenizerDescriptor DecompoundMode(NoriDecompoundMode? mode)
Parameters
Returns
|
Edit this page
View Source
DiscardPunctuation(bool?)
Whether punctuation should be discarded from the output. Defaults to true
.
Declaration
public NoriTokenizerDescriptor DiscardPunctuation(bool? discard = true)
Parameters
Type |
Name |
Description |
bool? |
discard |
|
Returns
|
Edit this page
View Source
UserDictionary(string)
The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG) may be
appended to
the default dictionary. This property allows you to specify this file on disk
Declaration
public NoriTokenizerDescriptor UserDictionary(string path)
Parameters
Type |
Name |
Description |
string |
path |
|
Returns
|
Edit this page
View Source
UserDictionaryRules(IEnumerable<string>)
The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG)
can be specified inline with this property
Declaration
public NoriTokenizerDescriptor UserDictionaryRules(IEnumerable<string> rules)
Parameters
Returns
|
Edit this page
View Source
UserDictionaryRules(params string[])
The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG)
can be specified inline with this property
Declaration
public NoriTokenizerDescriptor UserDictionaryRules(params string[] rules)
Parameters
Type |
Name |
Description |
string[] |
rules |
|
Returns
Implements
Extension Methods