Interface IWordDelimiterGraphTokenFilter
Named word_delimiter, it Splits words into subwords and performs optional transformations on subword groups. Unlike the word_delimiter this token filter named word_delimiter_graph correctly handles multi terms expansion at query time.
Namespace: OpenSearch.Client
Assembly: OpenSearch.Client.dll
Syntax
public interface IWordDelimiterGraphTokenFilter : ITokenFilter
Properties
| Edit this page View SourceAdjustOffsets
By default, the filter tries to output subtokens with adjusted offsets to reflect their actual position in the token stream. However, when used in combination with other filters that alter the length or starting position of tokens without changing their offsets (e.g. TrimTokenFilter) this can cause tokens with illegal offsets to be emitted. Setting AdjustOffsets to false will stop WordDelimiterGraphTokenFilter from adjusting these internal offsets.
Declaration
[DataMember(Name = "adjust_offsets")]
bool? AdjustOffsets { get; set; }
Property Value
Type | Description |
---|---|
bool? |
CatenateAll
If true causes all subword parts to be catenated: "wi-fi-4000" ⇒ "wifi4000". Defaults to false.
Declaration
[DataMember(Name = "catenate_all")]
bool? CatenateAll { get; set; }
Property Value
Type | Description |
---|---|
bool? |
CatenateNumbers
If true causes maximum runs of number parts to be catenated: "500-42" ⇒ "50042". Defaults to false.
Declaration
[DataMember(Name = "catenate_numbers")]
bool? CatenateNumbers { get; set; }
Property Value
Type | Description |
---|---|
bool? |
CatenateWords
If true causes maximum runs of word parts to be catenated: "wi-fi" ⇒ "wifi". Defaults to false.
Declaration
[DataMember(Name = "catenate_words")]
bool? CatenateWords { get; set; }
Property Value
Type | Description |
---|---|
bool? |
GenerateNumberParts
If true causes number subwords to be generated: "500-42" ⇒ "500" "42". Defaults to true.
Declaration
[DataMember(Name = "generate_number_parts")]
bool? GenerateNumberParts { get; set; }
Property Value
Type | Description |
---|---|
bool? |
GenerateWordParts
If true causes parts of words to be generated: "PowerShot" ⇒ "Power" "Shot". Defaults to true.
Declaration
[DataMember(Name = "generate_word_parts")]
bool? GenerateWordParts { get; set; }
Property Value
Type | Description |
---|---|
bool? |
IgnoreKeywords
If true, the filter skips tokens with a keyword attribute of true. Defaults to false.
Declaration
[DataMember(Name = "ignore_keywords")]
bool? IgnoreKeywords { get; set; }
Property Value
Type | Description |
---|---|
bool? |
PreserveOriginal
If true includes original words in subwords: "500-42" ⇒ "500-42" "500" "42". Defaults to false.
Declaration
[DataMember(Name = "preserve_original")]
bool? PreserveOriginal { get; set; }
Property Value
Type | Description |
---|---|
bool? |
ProtectedWords
A list of protected words from being delimiter.
Declaration
[DataMember(Name = "protected_words")]
IEnumerable<string> ProtectedWords { get; set; }
Property Value
Type | Description |
---|---|
IEnumerable<string> |
ProtectedWordsPath
protected_words_path which resolved to a file configured with protected words (one on each line). Automatically resolves to config/ based location if exists.
Declaration
[DataMember(Name = "protected_words_path ")]
string ProtectedWordsPath { get; set; }
Property Value
Type | Description |
---|---|
string |
SplitOnCaseChange
If true causes "PowerShot" to be two tokens; ("Power-Shot" remains two parts regards). Defaults to true.
Declaration
[DataMember(Name = "split_on_case_change")]
bool? SplitOnCaseChange { get; set; }
Property Value
Type | Description |
---|---|
bool? |
SplitOnNumerics
If true causes "j2se" to be three tokens; "j" "2" "se". Defaults to true.
Declaration
[DataMember(Name = "split_on_numerics")]
bool? SplitOnNumerics { get; set; }
Property Value
Type | Description |
---|---|
bool? |
StemEnglishPossessive
If true causes trailing "'s" to be removed for each subword: "O’Neil’s" ⇒ "O", "Neil". Defaults to true.
Declaration
[DataMember(Name = "stem_english_possessive")]
bool? StemEnglishPossessive { get; set; }
Property Value
Type | Description |
---|---|
bool? |
TypeTable
A custom type mapping table
Declaration
[DataMember(Name = "type_table")]
IEnumerable<string> TypeTable { get; set; }
Property Value
Type | Description |
---|---|
IEnumerable<string> |
TypeTablePath
A path to a custom type mapping table file
Declaration
[DataMember(Name = "type_table_path")]
string TypeTablePath { get; set; }
Property Value
Type | Description |
---|---|
string |