Interface INoriAnalyzer

The nori analyzer consists of the following tokenizer and token filters:

- nori_tokenizer

- nori_part_of_speech token filter

- nori_readingform token filter

- nori_number token filter

- lowercase token filter

Inherited Members

IAnalyzer.Type

IAnalyzer.Version

Namespace: OpenSearch.Client

Assembly: OpenSearch.Client.dll

Syntax

public interface INoriAnalyzer : IAnalyzer

Properties

| Edit this page View Source

DecompoundMode

The regular expression pattern, defaults to \W+.

Declaration

[DataMember(Name = "decompound_mode")]
NoriDecompoundMode? DecompoundMode { get; set; }

Property Value

Type	Description
NoriDecompoundMode?

| Edit this page View Source

StopTags

An array of part-of-speech tags that should be removed.

Declaration

[DataMember(Name = "stoptags")]
IEnumerable<string> StopTags { get; set; }

Property Value

Type	Description
IEnumerable<string>

| Edit this page View Source

UserDictionary

The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG) may be appended to the default dictionary. This property allows you to specify this file on disk

Declaration

[DataMember(Name = "user_dictionary")]
string UserDictionary { get; set; }

Property Value

Type	Description
string

Extension Methods

SuffixExtensions.Suffix(object, string)