Search Results for

    Show / Hide Table of Contents

    Class KuromojiTokenizer

    Inheritance
    object
    TokenizerBase
    KuromojiTokenizer
    Implements
    IKuromojiTokenizer
    ITokenizer
    Inherited Members
    TokenizerBase.Type
    TokenizerBase.Version
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: OpenSearch.Client
    Assembly: OpenSearch.Client.dll
    Syntax
    public class KuromojiTokenizer : TokenizerBase, IKuromojiTokenizer, ITokenizer

    Constructors

    | Edit this page View Source

    KuromojiTokenizer()

    Declaration
    public KuromojiTokenizer()

    Properties

    | Edit this page View Source

    DiscardCompoundToken

    Whether original compound tokens should be discarded from the output with Search Mode. Defaults to false.

    Declaration
    public bool? DiscardCompoundToken { get; set; }
    Property Value
    Type Description
    bool?
    | Edit this page View Source

    DiscardPunctuation

    Whether punctuation should be discarded from the output. Defaults to true.

    Declaration
    public bool? DiscardPunctuation { get; set; }
    Property Value
    Type Description
    bool?
    | Edit this page View Source

    Mode

    The tokenization mode determines how the tokenizer handles compound and unknown words.

    Declaration
    public KuromojiTokenizationMode? Mode { get; set; }
    Property Value
    Type Description
    KuromojiTokenizationMode?
    | Edit this page View Source

    NBestCost

    The nbest_cost parameter specifies an additional Viterbi cost. The KuromojiTokenizer will include all tokens in Viterbi paths that are within the nbest_cost value of the best path.

    Declaration
    public int? NBestCost { get; set; }
    Property Value
    Type Description
    int?
    | Edit this page View Source

    NBestExamples

    The nbest_examples can be used to find a nbest_cost value based on examples. For example, a value of /箱根山-箱根/成田空港-成田/ indicates that in the texts, 箱根山 (Mt. Hakone) and 成田空港 (Narita Airport) we’d like a cost that gives is us 箱根 (Hakone) and 成田 (Narita).

    Declaration
    public string NBestExamples { get; set; }
    Property Value
    Type Description
    string
    | Edit this page View Source

    UserDictionary

    The Kuromoji tokenizer uses the MeCab-IPADIC dictionary by default. A user_dictionary may be appended to the default dictionary.

    Declaration
    public string UserDictionary { get; set; }
    Property Value
    Type Description
    string
    | Edit this page View Source

    UserDictionaryRules

    Inline rule version of UserDictionary

    Declaration
    public IEnumerable<string> UserDictionaryRules { get; set; }
    Property Value
    Type Description
    IEnumerable<string>

    Implements

    IKuromojiTokenizer
    ITokenizer

    Extension Methods

    SuffixExtensions.Suffix(object, string)
    • Edit this page
    • View Source
    In this article
    • Constructors
      • KuromojiTokenizer()
    • Properties
      • DiscardCompoundToken
      • DiscardPunctuation
      • Mode
      • NBestCost
      • NBestExamples
      • UserDictionary
      • UserDictionaryRules
    • Implements
    • Extension Methods
    Back to top Generated by DocFX