Search Results for

    Show / Hide Table of Contents

    Class KuromojiTokenizerDescriptor

    Inheritance
    object
    DescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>
    TokenizerDescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>
    KuromojiTokenizerDescriptor
    Implements
    IDescriptor
    IKuromojiTokenizer
    ITokenizer
    Inherited Members
    TokenizerDescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>.Type
    TokenizerDescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>.Version(string)
    DescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>.Self
    DescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>.Assign<TValue>(TValue, Action<IKuromojiTokenizer, TValue>)
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: OpenSearch.Client
    Assembly: OpenSearch.Client.dll
    Syntax
    public class KuromojiTokenizerDescriptor : TokenizerDescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>, IDescriptor, IKuromojiTokenizer, ITokenizer

    Properties

    | Edit this page View Source

    Type

    Declaration
    protected override string Type { get; }
    Property Value
    Type Description
    string
    Overrides
    TokenizerDescriptorBase<KuromojiTokenizerDescriptor, IKuromojiTokenizer>.Type

    Methods

    | Edit this page View Source

    DiscardCompoundToken(bool?)

    Whether original compound tokens should be discarded from the output with Search Mode. Defaults to false.

    Declaration
    public KuromojiTokenizerDescriptor DiscardCompoundToken(bool? discard = true)
    Parameters
    Type Name Description
    bool? discard
    Returns
    Type Description
    KuromojiTokenizerDescriptor
    | Edit this page View Source

    DiscardPunctuation(bool?)

    Whether punctuation should be discarded from the output. Defaults to true.

    Declaration
    public KuromojiTokenizerDescriptor DiscardPunctuation(bool? discard = true)
    Parameters
    Type Name Description
    bool? discard
    Returns
    Type Description
    KuromojiTokenizerDescriptor
    | Edit this page View Source

    Mode(KuromojiTokenizationMode?)

    The tokenization mode determines how the tokenizer handles compound and unknown words.

    Declaration
    public KuromojiTokenizerDescriptor Mode(KuromojiTokenizationMode? mode)
    Parameters
    Type Name Description
    KuromojiTokenizationMode? mode
    Returns
    Type Description
    KuromojiTokenizerDescriptor
    | Edit this page View Source

    NBestCost(int?)

    The nbest_cost parameter specifies an additional Viterbi cost. The KuromojiTokenizer will include all tokens in Viterbi paths that are within the nbest_cost value of the best path.

    Declaration
    public KuromojiTokenizerDescriptor NBestCost(int? cost)
    Parameters
    Type Name Description
    int? cost
    Returns
    Type Description
    KuromojiTokenizerDescriptor
    | Edit this page View Source

    NBestExamples(string)

    The nbest_examples can be used to find a nbest_cost value based on examples. For example, a value of /箱根山-箱根/成田空港-成田/ indicates that in the texts, 箱根山 (Mt. Hakone) and 成田空港 (Narita Airport) we’d like a cost that gives is us 箱根 (Hakone) and 成田 (Narita).

    Declaration
    public KuromojiTokenizerDescriptor NBestExamples(string examples)
    Parameters
    Type Name Description
    string examples
    Returns
    Type Description
    KuromojiTokenizerDescriptor
    | Edit this page View Source

    UserDictionary(string)

    The Kuromoji tokenizer uses the MeCab-IPADIC dictionary by default. A user_dictionary may be appended to the default dictionary.

    Declaration
    public KuromojiTokenizerDescriptor UserDictionary(string userDictionary)
    Parameters
    Type Name Description
    string userDictionary
    Returns
    Type Description
    KuromojiTokenizerDescriptor
    | Edit this page View Source

    UserDictionaryRules(IEnumerable<string>)

    Inline rule version of UserDictionary

    Declaration
    public KuromojiTokenizerDescriptor UserDictionaryRules(IEnumerable<string> rules)
    Parameters
    Type Name Description
    IEnumerable<string> rules
    Returns
    Type Description
    KuromojiTokenizerDescriptor
    | Edit this page View Source

    UserDictionaryRules(params string[])

    Inline rule version of UserDictionary

    Declaration
    public KuromojiTokenizerDescriptor UserDictionaryRules(params string[] rules)
    Parameters
    Type Name Description
    string[] rules
    Returns
    Type Description
    KuromojiTokenizerDescriptor

    Implements

    IDescriptor
    IKuromojiTokenizer
    ITokenizer

    Extension Methods

    SuffixExtensions.Suffix(object, string)
    • Edit this page
    • View Source
    In this article
    • Properties
      • Type
    • Methods
      • DiscardCompoundToken(bool?)
      • DiscardPunctuation(bool?)
      • Mode(KuromojiTokenizationMode?)
      • NBestCost(int?)
      • NBestExamples(string)
      • UserDictionary(string)
      • UserDictionaryRules(IEnumerable<string>)
      • UserDictionaryRules(params string[])
    • Implements
    • Extension Methods
    Back to top Generated by DocFX