Interface ITermVectorFilter
Filter terms returned based on their TF-IDF scores. This can be useful in order find out a good characteristic vector of a document.
Namespace: OpenSearch.Client
Assembly: OpenSearch.Client.dll
Syntax
public interface ITermVectorFilter
Properties
| Edit this page View SourceMaximumDocumentFrequency
Ignore words which occur in more than this many docs. Defaults to unbounded.
Declaration
[DataMember(Name = "max_doc_freq")]
int? MaximumDocumentFrequency { get; set; }
Property Value
Type | Description |
---|---|
int? |
MaximumNumberOfTerms
Maximum number of terms that must be returned per field. Defaults to 25.
Declaration
[DataMember(Name = "max_num_terms")]
int? MaximumNumberOfTerms { get; set; }
Property Value
Type | Description |
---|---|
int? |
MaximumTermFrequency
Ignore words with more than this frequency in the source doc. Defaults to unbounded.
Declaration
[DataMember(Name = "max_term_freq")]
int? MaximumTermFrequency { get; set; }
Property Value
Type | Description |
---|---|
int? |
MaximumWordLength
The maximum word length above which words will be ignored. Defaults to unbounded.
Declaration
[DataMember(Name = "max_word_length")]
int? MaximumWordLength { get; set; }
Property Value
Type | Description |
---|---|
int? |
MinimumDocumentFrequency
Ignore terms which do not occur in at least this many docs. Defaults to 1.
Declaration
[DataMember(Name = "min_doc_freq")]
int? MinimumDocumentFrequency { get; set; }
Property Value
Type | Description |
---|---|
int? |
MinimumTermFrequency
Ignore words with less than this frequency in the source doc. Defaults to 1.
Declaration
[DataMember(Name = "min_term_freq")]
int? MinimumTermFrequency { get; set; }
Property Value
Type | Description |
---|---|
int? |
MinimumWordLength
The minimum word length below which words will be ignored. Defaults to 0.
Declaration
[DataMember(Name = "min_word_length")]
int? MinimumWordLength { get; set; }
Property Value
Type | Description |
---|---|
int? |