Search Results for

    Show / Hide Table of Contents

    Interface INoriTokenizer

    Tokenizer that ships with the analysis-nori plugin

    Inherited Members
    ITokenizer.Type
    ITokenizer.Version
    Namespace: OpenSearch.Client
    Assembly: OpenSearch.Client.dll
    Syntax
    public interface INoriTokenizer : ITokenizer

    Properties

    | Edit this page View Source

    DecompoundMode

    The regular expression pattern, defaults to \W+.

    Declaration
    [DataMember(Name = "decompound_mode")]
    NoriDecompoundMode? DecompoundMode { get; set; }
    Property Value
    Type Description
    NoriDecompoundMode?
    | Edit this page View Source

    DiscardPunctuation

    Whether punctuation should be discarded from the output. Defaults to true.

    Declaration
    [DataMember(Name = "discard_punctuation")]
    bool? DiscardPunctuation { get; set; }
    Property Value
    Type Description
    bool?
    | Edit this page View Source

    UserDictionary

    The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG) may be appended to the default dictionary. This property allows you to specify this file on disk

    Declaration
    [DataMember(Name = "user_dictionary")]
    string UserDictionary { get; set; }
    Property Value
    Type Description
    string
    | Edit this page View Source

    UserDictionaryRules

    The Nori tokenizer uses the mecab-ko-dic dictionary by default. A user_dictionary with custom nouns (NNG) can be specified inline with this property

    Declaration
    [DataMember(Name = "user_dictionary_rules")]
    IEnumerable<string> UserDictionaryRules { get; set; }
    Property Value
    Type Description
    IEnumerable<string>

    Extension Methods

    SuffixExtensions.Suffix(object, string)
    • Edit this page
    • View Source
    In this article
    • Properties
      • DecompoundMode
      • DiscardPunctuation
      • UserDictionary
      • UserDictionaryRules
    • Extension Methods
    Back to top Generated by DocFX