Class NoriTokenizer
Tokenizer that ships with the analysis-nori plugin
Inherited Members
Namespace: OpenSearch.Client
Assembly: OpenSearch.Client.dll
Syntax
public class NoriTokenizer : TokenizerBase, INoriTokenizer, ITokenizer
Constructors
| Edit this page View SourceNoriTokenizer()
Declaration
public NoriTokenizer()
Properties
| Edit this page View SourceDecompoundMode
The regular expression pattern, defaults to \W+.
Declaration
public NoriDecompoundMode? DecompoundMode { get; set; }
Property Value
Type | Description |
---|---|
NoriDecompoundMode? |
DiscardPunctuation
Whether punctuation should be discarded from the output. Defaults to true
.
Declaration
public bool? DiscardPunctuation { get; set; }
Property Value
Type | Description |
---|---|
bool? |
UserDictionary
The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG) may be appended to the default dictionary. This property allows you to specify this file on disk
Declaration
public string UserDictionary { get; set; }
Property Value
Type | Description |
---|---|
string |
UserDictionaryRules
The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG) can be specified inline with this property
Declaration
public IEnumerable<string> UserDictionaryRules { get; set; }
Property Value
Type | Description |
---|---|
IEnumerable<string> |