Search Results for

    Show / Hide Table of Contents

    Class CharGroupTokenizerDescriptor

    A tokenizer that breaks text into terms whenever it encounters a character which is in a defined set. It is mostly useful for cases where a simple custom tokenization is desired, and the overhead of use of PatternTokenizer is not acceptable.

    Inheritance
    object
    DescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>
    TokenizerDescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>
    CharGroupTokenizerDescriptor
    Implements
    IDescriptor
    ICharGroupTokenizer
    ITokenizer
    Inherited Members
    TokenizerDescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>.Type
    TokenizerDescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>.Version(string)
    DescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>.Self
    DescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>.Assign<TValue>(TValue, Action<ICharGroupTokenizer, TValue>)
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: OpenSearch.Client
    Assembly: OpenSearch.Client.dll
    Syntax
    public class CharGroupTokenizerDescriptor : TokenizerDescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>, IDescriptor, ICharGroupTokenizer, ITokenizer

    Properties

    | Edit this page View Source

    Type

    Declaration
    protected override string Type { get; }
    Property Value
    Type Description
    string
    Overrides
    TokenizerDescriptorBase<CharGroupTokenizerDescriptor, ICharGroupTokenizer>.Type

    Methods

    | Edit this page View Source

    MaxTokenLength(int?)

    The maximum token length. If a token is seen that exceeds this length then it is split at MaxTokenLength intervals. Defaults to 255.

    Declaration
    public CharGroupTokenizerDescriptor MaxTokenLength(int? maxTokenLength)
    Parameters
    Type Name Description
    int? maxTokenLength
    Returns
    Type Description
    CharGroupTokenizerDescriptor
    | Edit this page View Source

    TokenizeOnCharacters(IEnumerable<string>)

    A list containing a list of characters to tokenize the string on. Whenever a character from this list is encountered, a new token is started. This accepts either single characters like eg. -, or character groups: whitespace, letter, digit, punctuation, symbol.

    Declaration
    public CharGroupTokenizerDescriptor TokenizeOnCharacters(IEnumerable<string> characters)
    Parameters
    Type Name Description
    IEnumerable<string> characters
    Returns
    Type Description
    CharGroupTokenizerDescriptor
    | Edit this page View Source

    TokenizeOnCharacters(params string[])

    A list containing a list of characters to tokenize the string on. Whenever a character from this list is encountered, a new token is started. This accepts either single characters like eg. -, or character groups: whitespace, letter, digit, punctuation, symbol.

    Declaration
    public CharGroupTokenizerDescriptor TokenizeOnCharacters(params string[] characters)
    Parameters
    Type Name Description
    string[] characters
    Returns
    Type Description
    CharGroupTokenizerDescriptor

    Implements

    IDescriptor
    ICharGroupTokenizer
    ITokenizer

    Extension Methods

    SuffixExtensions.Suffix(object, string)
    • Edit this page
    • View Source
    In this article
    • Properties
      • Type
    • Methods
      • MaxTokenLength(int?)
      • TokenizeOnCharacters(IEnumerable<string>)
      • TokenizeOnCharacters(params string[])
    • Implements
    • Extension Methods
    Back to top Generated by DocFX