You are viewing documentation for version 2 of the AWS SDK for Ruby. Version 3 documentation can be found here.
Class: Aws::CloudSearch::Types::AnalysisOptions
- Inherits:
-
Struct
- Object
- Struct
- Aws::CloudSearch::Types::AnalysisOptions
- Defined in:
- (unknown)
Overview
When passing AnalysisOptions as input to an Aws::Client method, you can use a vanilla Hash:
{
synonyms: "String",
stopwords: "String",
stemming_dictionary: "String",
japanese_tokenization_dictionary: "String",
algorithmic_stemming: "none", # accepts none, minimal, light, full
}
Synonyms, stopwords, and stemming options for an analysis scheme. Includes tokenization dictionary for Japanese.
Returned by:
Instance Attribute Summary collapse
-
#algorithmic_stemming ⇒ String
The level of algorithmic stemming to perform:
none
,minimal
,light
, orfull
. -
#japanese_tokenization_dictionary ⇒ String
A JSON array that contains a collection of terms, tokens, readings and part of speech for Japanese Tokenizaiton.
-
#stemming_dictionary ⇒ String
A JSON object that contains a collection of string:value pairs that each map a term to its stem.
-
#stopwords ⇒ String
A JSON array of terms to ignore during indexing and searching.
-
#synonyms ⇒ String
A JSON object that defines synonym groups and aliases.
Instance Attribute Details
#algorithmic_stemming ⇒ String
The level of algorithmic stemming to perform: none
, minimal
,
light
, or full
. The available levels vary depending on the language.
For more information, see Language Specific Text Processing
Settingstarget="_blank" in the Amazon CloudSearch Developer
Guide
#japanese_tokenization_dictionary ⇒ String
A JSON array that contains a collection of terms, tokens, readings and part of speech for Japanese Tokenizaiton. The Japanese tokenization dictionary enables you to override the default tokenization for selected terms. This is only valid for Japanese language fields.
#stemming_dictionary ⇒ String
A JSON object that contains a collection of string:value pairs that each
map a term to its stem. For example, {"term1": "stem1", "term2":
"stem2", "term3": "stem3"}
. The stemming dictionary is applied in
addition to any algorithmic stemming. This enables you to override the
results of the algorithmic stemming to correct specific cases of
overstemming or understemming. The maximum size of a stemming dictionary
is 500 KB.
#stopwords ⇒ String
A JSON array of terms to ignore during indexing and searching. For
example, ["a", "an", "the", "of"]
. The stopwords dictionary must
explicitly list each word you want to ignore. Wildcards and regular
expressions are not supported.
#synonyms ⇒ String
A JSON object that defines synonym groups and aliases. A synonym group is an array of arrays, where each sub-array is a group of terms where each term in the group is considered a synonym of every other term in the group. The aliases value is an object that contains a collection of string:value pairs where the string specifies a term and the array of values specifies each of the aliases for that term. An alias is considered a synonym of the specified term, but the term is not considered a synonym of the alias. For more information about specifying synonyms, see Synonyms in the Amazon CloudSearch Developer Guide.