Interface INoriAnalyzer
The nori analyzer consists of the following tokenizer and token filters:
- nori_tokenizer
- nori_part_of_speech token filter
- nori_readingform token filter
- nori_number token filter
- lowercase token filter
Namespace: OpenSearch.Client
Assembly: OpenSearch.Client.dll
Syntax
public interface INoriAnalyzer : IAnalyzer
Properties
| Edit this page View SourceDecompoundMode
The regular expression pattern, defaults to \W+.
Declaration
[DataMember(Name = "decompound_mode")]
NoriDecompoundMode? DecompoundMode { get; set; }
Property Value
Type | Description |
---|---|
NoriDecompoundMode? |
StopTags
An array of part-of-speech tags that should be removed.
Declaration
[DataMember(Name = "stoptags")]
IEnumerable<string> StopTags { get; set; }
Property Value
Type | Description |
---|---|
IEnumerable<string> |
UserDictionary
The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG) may be appended to the default dictionary. This property allows you to specify this file on disk
Declaration
[DataMember(Name = "user_dictionary")]
string UserDictionary { get; set; }
Property Value
Type | Description |
---|---|
string |