Interface IFingerprintAnalyzer
The fingerprint analyzer implements a fingerprinting algorithm which is used by the OpenRefine project to assist in clustering.
Namespace: OpenSearch.Client
Assembly: OpenSearch.Client.dll
Syntax
public interface IFingerprintAnalyzer : IAnalyzer
Properties
| Edit this page View SourceMaxOutputSize
The maximum token size to emit. Defaults to 255.
Declaration
[DataMember(Name = "max_output_size")]
int? MaxOutputSize { get; set; }
Property Value
Type | Description |
---|---|
int? |
PreserveOriginal
If true, emits both the original and folded version of tokens that contain extended characters. Defaults to false
Declaration
[DataMember(Name = "preserve_original")]
bool? PreserveOriginal { get; set; }
Property Value
Type | Description |
---|---|
bool? |
Separator
The character that separates the tokens after concatenation. Defaults to a space.
Declaration
[DataMember(Name = "separator")]
string Separator { get; set; }
Property Value
Type | Description |
---|---|
string |
StopWords
A list of stop words to use. Defaults to an empty list
Declaration
[DataMember(Name = "stopwords")]
StopWords StopWords { get; set; }
Property Value
Type | Description |
---|---|
StopWords |
StopWordsPath
A path(either relative to config location, or absolute) to a stopwords file configuration.Each stop word should be in its own "line" (separated by a line break). The file must be UTF-8 encoded.
Declaration
[DataMember(Name = "stopwords_path")]
string StopWordsPath { get; set; }
Property Value
Type | Description |
---|---|
string |