Class: Aws::CloudSearchDomain::Client
- Inherits:
-
Seahorse::Client::Base
- Object
- Seahorse::Client::Base
- Aws::CloudSearchDomain::Client
- Includes:
- Aws::ClientStubs
- Defined in:
- gems/aws-sdk-cloudsearchdomain/lib/aws-sdk-cloudsearchdomain/client.rb
Overview
An API client for CloudSearchDomain. To construct a client, you need to configure a :region
and :credentials
.
client = Aws::CloudSearchDomain::Client.new(
region: region_name,
credentials: credentials,
# ...
)
For details on configuring region and credentials see the developer guide.
See #initialize for a full list of supported configuration options.
Instance Attribute Summary
Attributes inherited from Seahorse::Client::Base
API Operations collapse
-
#search(params = {}) ⇒ Types::SearchResponse
Retrieves a list of documents that match the specified search criteria.
-
#suggest(params = {}) ⇒ Types::SuggestResponse
Retrieves autocomplete suggestions for a partial query string.
-
#upload_documents(params = {}) ⇒ Types::UploadDocumentsResponse
Posts a batch of documents to a search domain for indexing.
Instance Method Summary collapse
-
#initialize(options) ⇒ Client
constructor
A new instance of Client.
Methods included from Aws::ClientStubs
#api_requests, #stub_data, #stub_responses
Methods inherited from Seahorse::Client::Base
add_plugin, api, clear_plugins, define, new, #operation_names, plugins, remove_plugin, set_api, set_plugins
Methods included from Seahorse::Client::HandlerBuilder
#handle, #handle_request, #handle_response
Constructor Details
#initialize(options) ⇒ Client
Returns a new instance of Client.
Parameters:
- options (Hash)
Options Hash (options):
-
:plugins
(Array<Seahorse::Client::Plugin>)
— default:
[]]
—
A list of plugins to apply to the client. Each plugin is either a class name or an instance of a plugin class.
-
:credentials
(required, Aws::CredentialProvider)
—
Your AWS credentials. This can be an instance of any one of the following classes:
Aws::Credentials
- Used for configuring static, non-refreshing credentials.Aws::SharedCredentials
- Used for loading static credentials from a shared file, such as~/.aws/config
.Aws::AssumeRoleCredentials
- Used when you need to assume a role.Aws::AssumeRoleWebIdentityCredentials
- Used when you need to assume a role after providing credentials via the web.Aws::SSOCredentials
- Used for loading credentials from AWS SSO using an access token generated fromaws login
.Aws::ProcessCredentials
- Used for loading credentials from a process that outputs to stdout.Aws::InstanceProfileCredentials
- Used for loading credentials from an EC2 IMDS on an EC2 instance.Aws::ECSCredentials
- Used for loading credentials from instances running in ECS.Aws::CognitoIdentityCredentials
- Used for loading credentials from the Cognito Identity service.
When
:credentials
are not configured directly, the following locations will be searched for credentials:Aws.config[:credentials]
- The
:access_key_id
,:secret_access_key
,:session_token
, and:account_id
options. - ENV['AWS_ACCESS_KEY_ID'], ENV['AWS_SECRET_ACCESS_KEY'], ENV['AWS_SESSION_TOKEN'], and ENV['AWS_ACCOUNT_ID']
~/.aws/credentials
~/.aws/config
- EC2/ECS IMDS instance profile - When used by default, the timeouts
are very aggressive. Construct and pass an instance of
Aws::InstanceProfileCredentials
orAws::ECSCredentials
to enable retries and extended timeouts. Instance profile credential fetching can be disabled by setting ENV['AWS_EC2_METADATA_DISABLED'] to true.
- :access_key_id (String)
- :account_id (String)
-
:adaptive_retry_wait_to_fill
(Boolean)
— default:
true
—
Used only in
adaptive
retry mode. When true, the request will sleep until there is sufficent client side capacity to retry the request. When false, the request will raise aRetryCapacityNotAvailableError
and will not retry instead of sleeping. -
:client_side_monitoring
(Boolean)
— default:
false
—
When
true
, client-side metrics will be collected for all API requests from this client. -
:client_side_monitoring_client_id
(String)
— default:
""
—
Allows you to provide an identifier for this client which will be attached to all generated client side metrics. Defaults to an empty string.
-
:client_side_monitoring_host
(String)
— default:
"127.0.0.1"
—
Allows you to specify the DNS hostname or IPv4 or IPv6 address that the client side monitoring agent is running on, where client metrics will be published via UDP.
-
:client_side_monitoring_port
(Integer)
— default:
31000
—
Required for publishing client metrics. The port that the client side monitoring agent is running on, where client metrics will be published via UDP.
-
:client_side_monitoring_publisher
(Aws::ClientSideMonitoring::Publisher)
— default:
Aws::ClientSideMonitoring::Publisher
—
Allows you to provide a custom client-side monitoring publisher class. By default, will use the Client Side Monitoring Agent Publisher.
-
:convert_params
(Boolean)
— default:
true
—
When
true
, an attempt is made to coerce request parameters into the required types. -
:correct_clock_skew
(Boolean)
— default:
true
—
Used only in
standard
and adaptive retry modes. Specifies whether to apply a clock skew correction and retry requests with skewed client clocks. -
:defaults_mode
(String)
— default:
"legacy"
—
See DefaultsModeConfiguration for a list of the accepted modes and the configuration defaults that are included.
-
:disable_request_compression
(Boolean)
— default:
false
—
When set to 'true' the request body will not be compressed for supported operations.
-
:log_formatter
(Aws::Log::Formatter)
— default:
Aws::Log::Formatter.default
—
The log formatter.
-
:log_level
(Symbol)
— default:
:info
—
The log level to send messages to the
:logger
at. -
:logger
(Logger)
—
The Logger instance to send log messages to. If this option is not set, logging will be disabled.
-
:max_attempts
(Integer)
— default:
3
—
An integer representing the maximum number attempts that will be made for a single request, including the initial attempt. For example, setting this value to 5 will result in a request being retried up to 4 times. Used in
standard
andadaptive
retry modes. -
:profile
(String)
— default:
"default"
—
Used when loading credentials from the shared credentials file at HOME/.aws/credentials. When not specified, 'default' is used.
-
:request_checksum_calculation
(String)
— default:
"when_supported"
—
Determines when a checksum will be calculated for request payloads. Values are:
when_supported
- (default) When set, a checksum will be calculated for all request payloads of operations modeled with thehttpChecksum
trait whererequestChecksumRequired
istrue
and/or arequestAlgorithmMember
is modeled.when_required
- When set, a checksum will only be calculated for request payloads of operations modeled with thehttpChecksum
trait whererequestChecksumRequired
istrue
or where arequestAlgorithmMember
is modeled and supplied.
-
:request_min_compression_size_bytes
(Integer)
— default:
10240
—
The minimum size in bytes that triggers compression for request bodies. The value must be non-negative integer value between 0 and 10485780 bytes inclusive.
-
:response_checksum_validation
(String)
— default:
"when_supported"
—
Determines when checksum validation will be performed on response payloads. Values are:
when_supported
- (default) When set, checksum validation is performed on all response payloads of operations modeled with thehttpChecksum
trait whereresponseAlgorithms
is modeled, except when no modeled checksum algorithms are supported.when_required
- When set, checksum validation is not performed on response payloads of operations unless the checksum algorithm is supported and therequestValidationModeMember
member is set toENABLED
.
-
:retry_backoff
(Proc)
—
A proc or lambda used for backoff. Defaults to 2**retries * retry_base_delay. This option is only used in the
legacy
retry mode. -
:retry_base_delay
(Float)
— default:
0.3
—
The base delay in seconds used by the default backoff function. This option is only used in the
legacy
retry mode. -
:retry_jitter
(Symbol)
— default:
:none
—
A delay randomiser function used by the default backoff function. Some predefined functions can be referenced by name - :none, :equal, :full, otherwise a Proc that takes and returns a number. This option is only used in the
legacy
retry mode.@see https://www.awsarchitectureblog.com/2015/03/backoff.html
-
:retry_limit
(Integer)
— default:
3
—
The maximum number of times to retry failed requests. Only ~ 500 level server errors and certain ~ 400 level client errors are retried. Generally, these are throttling errors, data checksum errors, networking errors, timeout errors, auth errors, endpoint discovery, and errors from expired credentials. This option is only used in the
legacy
retry mode. -
:retry_max_delay
(Integer)
— default:
0
—
The maximum number of seconds to delay between retries (0 for no limit) used by the default backoff function. This option is only used in the
legacy
retry mode. -
:retry_mode
(String)
— default:
"legacy"
—
Specifies which retry algorithm to use. Values are:
legacy
- The pre-existing retry behavior. This is default value if no retry mode is provided.standard
- A standardized set of retry rules across the AWS SDKs. This includes support for retry quotas, which limit the number of unsuccessful retries a client can make.adaptive
- An experimental retry mode that includes all the functionality ofstandard
mode along with automatic client side throttling. This is a provisional mode that may change behavior in the future.
-
:sdk_ua_app_id
(String)
—
A unique and opaque application ID that is appended to the User-Agent header as app/sdk_ua_app_id. It should have a maximum length of 50. This variable is sourced from environment variable AWS_SDK_UA_APP_ID or the shared config profile attribute sdk_ua_app_id.
- :secret_access_key (String)
- :session_token (String)
-
:sigv4_region
(String)
—
Only needed when sending authenticated/signed requests to a Cloud Search domain and the endpoint does not contain the region name.
-
:stub_responses
(Boolean)
— default:
false
—
Causes the client to return stubbed responses. By default fake responses are generated and returned. You can specify the response data to return or errors to raise by calling Aws::ClientStubs#stub_responses. See Aws::ClientStubs for more information.
Please note When response stubbing is enabled, no HTTP requests are made, and retries are disabled.
-
:telemetry_provider
(Aws::Telemetry::TelemetryProviderBase)
— default:
Aws::Telemetry::NoOpTelemetryProvider
—
Allows you to provide a telemetry provider, which is used to emit telemetry data. By default, uses
NoOpTelemetryProvider
which will not record or emit any telemetry data. The SDK supports the following telemetry providers:- OpenTelemetry (OTel) - To use the OTel provider, install and require the
opentelemetry-sdk
gem and then, pass in an instance of aAws::Telemetry::OTelProvider
for telemetry provider.
- OpenTelemetry (OTel) - To use the OTel provider, install and require the
-
:token_provider
(Aws::TokenProvider)
—
A Bearer Token Provider. This can be an instance of any one of the following classes:
Aws::StaticTokenProvider
- Used for configuring static, non-refreshing tokens.Aws::SSOTokenProvider
- Used for loading tokens from AWS SSO using an access token generated fromaws login
.
When
:token_provider
is not configured directly, theAws::TokenProviderChain
will be used to search for tokens configured for your profile in shared configuration files. -
:validate_params
(Boolean)
— default:
true
—
When
true
, request parameters are validated before sending the request. -
:endpoint
(String, URI::HTTPS, URI::HTTP)
—
Normally you should not configure the
:endpoint
option directly. This is normally constructed from the:region
option. Configuring:endpoint
is normally reserved for connecting to test or custom endpoints. The endpoint should be a URI formatted like:'http://example.com' 'https://example.com' 'http://example.com:123'
-
:http_continue_timeout
(Float)
— default:
1
—
The number of seconds to wait for a 100-continue response before sending the request body. This option has no effect unless the request has "Expect" header set to "100-continue". Defaults to
nil
which disables this behaviour. This value can safely be set per request on the session. -
:http_idle_timeout
(Float)
— default:
5
—
The number of seconds a connection is allowed to sit idle before it is considered stale. Stale connections are closed and removed from the pool before making a request.
-
:http_open_timeout
(Float)
— default:
15
—
The default number of seconds to wait for response data. This value can safely be set per-request on the session.
-
:http_proxy
(URI::HTTP, String)
—
A proxy to send requests through. Formatted like 'http://proxy.com:123'.
-
:http_read_timeout
(Float)
— default:
60
—
The default number of seconds to wait for response data. This value can safely be set per-request on the session.
-
:http_wire_trace
(Boolean)
— default:
false
—
When
true
, HTTP debug output will be sent to the:logger
. -
:on_chunk_received
(Proc)
—
When a Proc object is provided, it will be used as callback when each chunk of the response body is received. It provides three arguments: the chunk, the number of bytes received, and the total number of bytes in the response (or nil if the server did not send a
content-length
). -
:on_chunk_sent
(Proc)
—
When a Proc object is provided, it will be used as callback when each chunk of the request body is sent. It provides three arguments: the chunk, the number of bytes read from the body, and the total number of bytes in the body.
-
:raise_response_errors
(Boolean)
— default:
true
—
When
true
, response errors are raised. -
:ssl_ca_bundle
(String)
—
Full path to the SSL certificate authority bundle file that should be used when verifying peer certificates. If you do not pass
:ssl_ca_bundle
or:ssl_ca_directory
the the system default will be used if available. -
:ssl_ca_directory
(String)
—
Full path of the directory that contains the unbundled SSL certificate authority files for verifying peer certificates. If you do not pass
:ssl_ca_bundle
or:ssl_ca_directory
the the system default will be used if available. -
:ssl_ca_store
(String)
—
Sets the X509::Store to verify peer certificate.
-
:ssl_cert
(OpenSSL::X509::Certificate)
—
Sets a client certificate when creating http connections.
-
:ssl_key
(OpenSSL::PKey)
—
Sets a client key when creating http connections.
-
:ssl_timeout
(Float)
—
Sets the SSL timeout in seconds
-
:ssl_verify_peer
(Boolean)
— default:
true
—
When
true
, SSL peer certificates are verified when establishing a connection.
406 407 408 |
# File 'gems/aws-sdk-cloudsearchdomain/lib/aws-sdk-cloudsearchdomain/client.rb', line 406 def initialize(*args) super end |
Instance Method Details
#search(params = {}) ⇒ Types::SearchResponse
Retrieves a list of documents that match the specified search criteria. How you specify the search criteria depends on which query parser you use. Amazon CloudSearch supports four query parsers:
simple
: search alltext
andtext-array
fields for the specified string. Search for phrases, individual terms, and prefixes.structured
: search specific fields, construct compound queries using Boolean operators, and use advanced features such as term boosting and proximity searching.lucene
: specify search criteria using the Apache Lucene query parser syntax.dismax
: specify search criteria using the simplified subset of the Apache Lucene query parser syntax defined by the DisMax query parser.
For more information, see Searching Your Data in the Amazon CloudSearch Developer Guide.
The endpoint for submitting Search
requests is domain-specific. You
submit search requests to a domain's search endpoint. To get the
search endpoint for your domain, use the Amazon CloudSearch
configuration service DescribeDomains
action. A domain's endpoints
are also displayed on the domain dashboard in the Amazon CloudSearch
console.
Examples:
Request syntax with placeholder values
Request syntax with placeholder values
resp = client.search({
cursor: "Cursor",
expr: "Expr",
facet: "Facet",
filter_query: "FilterQuery",
highlight: "Highlight",
partial: false,
query: "Query", # required
query_options: "QueryOptions",
query_parser: "simple", # accepts simple, structured, lucene, dismax
return: "Return",
size: 1,
sort: "Sort",
start: 1,
stats: "Stat",
})
Response structure
Response structure
resp.status.timems #=> Integer
resp.status.rid #=> String
resp.hits.found #=> Integer
resp.hits.start #=> Integer
resp.hits.cursor #=> String
resp.hits.hit #=> Array
resp.hits.hit[0].id #=> String
resp.hits.hit[0].fields #=> Hash
resp.hits.hit[0].fields["String"] #=> Array
resp.hits.hit[0].fields["String"][0] #=> String
resp.hits.hit[0].exprs #=> Hash
resp.hits.hit[0].exprs["String"] #=> String
resp.hits.hit[0].highlights #=> Hash
resp.hits.hit[0].highlights["String"] #=> String
resp.facets #=> Hash
resp.facets["String"].buckets #=> Array
resp.facets["String"].buckets[0].value #=> String
resp.facets["String"].buckets[0].count #=> Integer
resp.stats #=> Hash
resp.stats["String"].min #=> String
resp.stats["String"].max #=> String
resp.stats["String"].count #=> Integer
resp.stats["String"].missing #=> Integer
resp.stats["String"].sum #=> Float
resp.stats["String"].sum_of_squares #=> Float
resp.stats["String"].mean #=> String
resp.stats["String"].stddev #=> Float
Parameters:
-
params
(Hash)
(defaults to: {})
—
({})
Options Hash (params):
-
:cursor
(String)
—
Retrieves a cursor value you can use to page through large result sets. Use the
size
parameter to control the number of hits to include in each response. You can specify either thecursor
orstart
parameter in a request; they are mutually exclusive. To get the first cursor, set the cursor value toinitial
. In subsequent requests, specify the cursor value returned in the hits section of the response.For more information, see Paginating Results in the Amazon CloudSearch Developer Guide.
-
:expr
(String)
—
Defines one or more numeric expressions that can be used to sort results or specify search or filter criteria. You can also specify expressions as return fields.
You specify the expressions in JSON using the form
{"EXPRESSIONNAME":"EXPRESSION"}
. You can define and use multiple expressions in a search request. For example:{"expression1":"_score*rating", "expression2":"(1/rank)*year"}
For information about the variables, operators, and functions you can use in expressions, see Writing Expressions in the Amazon CloudSearch Developer Guide.
-
:facet
(String)
—
Specifies one or more fields for which to get facet information, and options that control how the facet information is returned. Each specified field must be facet-enabled in the domain configuration. The fields and options are specified in JSON using the form
{"FIELD":{"OPTION":VALUE,"OPTION:"STRING"},"FIELD":{"OPTION":VALUE,"OPTION":"STRING"}}
.You can specify the following faceting options:
buckets
specifies an array of the facet values or ranges to count. Ranges are specified using the same syntax that you use to search for a range of values. For more information, see Searching for a Range of Values in the Amazon CloudSearch Developer Guide. Buckets are returned in the order they are specified in the request. Thesort
andsize
options are not valid if you specifybuckets
.size
specifies the maximum number of facets to include in the results. By default, Amazon CloudSearch returns counts for the top- The
size
parameter is only valid when you specify thesort
option; it cannot be used in conjunction withbuckets
.
- The
sort
specifies how you want to sort the facets in the results:bucket
orcount
. Specifybucket
to sort alphabetically or numerically by facet value (in ascending order). Specifycount
to sort by the facet counts computed for each facet value (in descending order). To retrieve facet counts for particular values or ranges of values, use thebuckets
option instead ofsort
.
If no facet options are specified, facet counts are computed for all field values, the facets are sorted by facet count, and the top 10 facets are returned in the results.
To count particular buckets of values, use the
buckets
option. For example, the following request uses thebuckets
option to calculate and return facet counts by decade.{"year":{"buckets":["[1970,1979]","[1980,1989]","[1990,1999]","[2000,2009]","[2010,}"]}}
To sort facets by facet count, use the
count
option. For example, the following request sets thesort
option tocount
to sort the facet values by facet count, with the facet values that have the most matching documents listed first. Setting thesize
option to 3 returns only the top three facet values.{"year":{"sort":"count","size":3}}
To sort the facets by value, use the
bucket
option. For example, the following request sets thesort
option tobucket
to sort the facet values numerically by year, with earliest year listed first.{"year":{"sort":"bucket"}}
For more information, see Getting and Using Facet Information in the Amazon CloudSearch Developer Guide.
-
:filter_query
(String)
—
Specifies a structured query that filters the results of a search without affecting how the results are scored and sorted. You use
filterQuery
in conjunction with thequery
parameter to filter the documents that match the constraints specified in thequery
parameter. Specifying a filter controls only which matching documents are included in the results, it has no effect on how they are scored and sorted. ThefilterQuery
parameter supports the full structured query syntax.For more information about using filters, see Filtering Matching Documents in the Amazon CloudSearch Developer Guide.
-
:highlight
(String)
—
Retrieves highlights for matches in the specified
text
ortext-array
fields. Each specified field must be highlight enabled in the domain configuration. The fields and options are specified in JSON using the form{"FIELD":{"OPTION":VALUE,"OPTION:"STRING"},"FIELD":{"OPTION":VALUE,"OPTION":"STRING"}}
.You can specify the following highlight options:
format
: specifies the format of the data in the text field:text
orhtml
. When data is returned as HTML, all non-alphanumeric characters are encoded. The default ishtml
.max_phrases
: specifies the maximum number of occurrences of the search term(s) you want to highlight. By default, the first occurrence is highlighted.pre_tag
: specifies the string to prepend to an occurrence of a search term. The default for HTML highlights is<em>
. The default for text highlights is*
.post_tag
: specifies the string to append to an occurrence of a search term. The default for HTML highlights is</em>
. The default for text highlights is*
.
If no highlight options are specified for a field, the returned field text is treated as HTML and the first match is highlighted with emphasis tags:
<em>search-term</em>
.For example, the following request retrieves highlights for the
actors
andtitle
fields.{ "actors": {}, "title": {"format": "text","max_phrases": 2,"pre_tag": "","post_tag": ""} }
-
:partial
(Boolean)
—
Enables partial results to be returned if one or more index partitions are unavailable. When your search index is partitioned across multiple search instances, by default Amazon CloudSearch only returns results if every partition can be queried. This means that the failure of a single search instance can result in 5xx (internal server) errors. When you enable partial results, Amazon CloudSearch returns whatever results are available and includes the percentage of documents searched in the search results (percent-searched). This enables you to more gracefully degrade your users' search experience. For example, rather than displaying no results, you could display the partial results and a message indicating that the results might be incomplete due to a temporary system outage.
-
:query
(required, String)
—
Specifies the search criteria for the request. How you specify the search criteria depends on the query parser used for the request and the parser options specified in the
queryOptions
parameter. By default, thesimple
query parser is used to process requests. To use thestructured
,lucene
, ordismax
query parser, you must also specify thequeryParser
parameter.For more information about specifying search criteria, see Searching Your Data in the Amazon CloudSearch Developer Guide.
-
:query_options
(String)
—
Configures options for the query parser specified in the
queryParser
parameter. You specify the options in JSON using the following form{"OPTION1":"VALUE1","OPTION2":VALUE2"..."OPTIONN":"VALUEN"}.
The options you can configure vary according to which parser you use:
defaultOperator
: The default operator used to combine individual terms in the search string. For example:defaultOperator: 'or'
. For thedismax
parser, you specify a percentage that represents the percentage of terms in the search string (rounded down) that must match, rather than a default operator. A value of0%
is the equivalent to OR, and a value of100%
is equivalent to AND. The percentage must be specified as a value in the range 0-100 followed by the percent (%) symbol. For example,defaultOperator: 50%
. Valid values:and
,or
, a percentage in the range 0%-100% (dismax
). Default:and
(simple
,structured
,lucene
) or100
(dismax
). Valid for:simple
,structured
,lucene
, anddismax
.fields
: An array of the fields to search when no fields are specified in a search. If no fields are specified in a search and this option is not specified, all text and text-array fields are searched. You can specify a weight for each field to control the relative importance of each field when Amazon CloudSearch calculates relevance scores. To specify a field weight, append a caret (^
) symbol and the weight to the field name. For example, to boost the importance of thetitle
field over thedescription
field you could specify:"fields":["title^5","description"]
. Valid values: The name of any configured field and an optional numeric value greater than zero. Default: Alltext
andtext-array
fields. Valid for:simple
,structured
,lucene
, anddismax
.operators
: An array of the operators or special characters you want to disable for the simple query parser. If you disable theand
,or
, ornot
operators, the corresponding operators (+
,|
,-
) have no special meaning and are dropped from the search string. Similarly, disablingprefix
disables the wildcard operator (*
) and disablingphrase
disables the ability to search for phrases by enclosing phrases in double quotes. Disabling precedence disables the ability to control order of precedence using parentheses. Disablingnear
disables the ability to use the ~ operator to perform a sloppy phrase search. Disabling thefuzzy
operator disables the ability to use the ~ operator to perform a fuzzy search.escape
disables the ability to use a backslash (`) to escape special characters within the search string. Disabling whitespace is an advanced option that prevents the parser from tokenizing on whitespace, which can be useful for Vietnamese. (It prevents Vietnamese words from being split incorrectly.) For example, you could disable all operators other than the phrase operator to support just simple term and phrase queries:
"operators":["and","not","or", "prefix"]. Valid values:
and,
escape,
fuzzy,
near,
not,
or,
phrase,
precedence,
prefix,
whitespace. Default: All operators and special characters are enabled. Valid for:
simple`.phraseFields
: An array of thetext
ortext-array
fields you want to use for phrase searches. When the terms in the search string appear in close proximity within a field, the field scores higher. You can specify a weight for each field to boost that score. ThephraseSlop
option controls how much the matches can deviate from the search string and still be boosted. To specify a field weight, append a caret (^
) symbol and the weight to the field name. For example, to boost phrase matches in thetitle
field over theabstract
field, you could specify:"phraseFields":["title^3", "plot"]
Valid values: The name of anytext
ortext-array
field and an optional numeric value greater than zero. Default: No fields. If you don't specify any fields withphraseFields
, proximity scoring is disabled even ifphraseSlop
is specified. Valid for:dismax
.phraseSlop
: An integer value that specifies how much matches can deviate from the search phrase and still be boosted according to the weights specified in thephraseFields
option; for example,phraseSlop: 2
. You must also specifyphraseFields
to enable proximity scoring. Valid values: positive integers. Default: 0. Valid for:dismax
.explicitPhraseSlop
: An integer value that specifies how much a match can deviate from the search phrase when the phrase is enclosed in double quotes in the search string. (Phrases that exceed this proximity distance are not considered a match.) For example, to specify a slop of three for dismax phrase queries, you would specify"explicitPhraseSlop":3
. Valid values: positive integers. Default:- Valid for:
dismax
.
- Valid for:
tieBreaker
: When a term in the search string is found in a document's field, a score is calculated for that field based on how common the word is in that field compared to other documents. If the term occurs in multiple fields within a document, by default only the highest scoring field contributes to the document's overall score. You can specify atieBreaker
value to enable the matches in lower-scoring fields to contribute to the document's score. That way, if two documents have the same max field score for a particular term, the score for the document that has matches in more fields will be higher. The formula for calculating the score with a tieBreaker is(max field score) + (tieBreaker) * (sum of the scores for the rest of the matching fields)
. SettieBreaker
to 0 to disregard all but the highest scoring field (pure max):"tieBreaker":0
. Set to 1 to sum the scores from all fields (pure sum):"tieBreaker":1
. Valid values: 0.0 to 1.0. Default: 0.0. Valid for:dismax
.
-
:query_parser
(String)
—
Specifies which query parser to use to process the request. If
queryParser
is not specified, Amazon CloudSearch uses thesimple
query parser.Amazon CloudSearch supports four query parsers:
simple
: perform simple searches oftext
andtext-array
fields. By default, thesimple
query parser searches alltext
andtext-array
fields. You can specify which fields to search by with thequeryOptions
parameter. If you prefix a search term with a plus sign (+) documents must contain the term to be considered a match. (This is the default, unless you configure the default operator with thequeryOptions
parameter.) You can use the-
(NOT),|
(OR), and*
(wildcard) operators to exclude particular terms, find results that match any of the specified terms, or search for a prefix. To search for a phrase rather than individual terms, enclose the phrase in double quotes. For more information, see Searching for Text in the Amazon CloudSearch Developer Guide.structured
: perform advanced searches by combining multiple expressions to define the search criteria. You can also search within particular fields, search for values and ranges of values, and use advanced options such as term boosting,matchall
, andnear
. For more information, see Constructing Compound Queries in the Amazon CloudSearch Developer Guide.lucene
: search using the Apache Lucene query parser syntax. For more information, see Apache Lucene Query Parser Syntax.dismax
: search using the simplified subset of the Apache Lucene query parser syntax defined by the DisMax query parser. For more information, see DisMax Query Parser Syntax.
-
:return
(String)
—
Specifies the field and expression values to include in the response. Multiple fields or expressions are specified as a comma-separated list. By default, a search response includes all return enabled fields (
_all_fields
). To return only the document IDs for the matching documents, specify_no_fields
. To retrieve the relevance score calculated for each document, specify_score
. -
:size
(Integer)
—
Specifies the maximum number of search hits to include in the response.
-
:sort
(String)
—
Specifies the fields or custom expressions to use to sort the search results. Multiple fields or expressions are specified as a comma-separated list. You must specify the sort direction (
asc
ordesc
) for each field; for example,year desc,title asc
. To use a field to sort results, the field must be sort-enabled in the domain configuration. Array type fields cannot be used for sorting. If nosort
parameter is specified, results are sorted by their default relevance scores in descending order:_score desc
. You can also sort by document ID (_id asc
) and version (_version desc
).For more information, see Sorting Results in the Amazon CloudSearch Developer Guide.
-
:start
(Integer)
—
Specifies the offset of the first search hit you want to return. Note that the result set is zero-based; the first result is at index 0. You can specify either the
start
orcursor
parameter in a request, they are mutually exclusive.For more information, see Paginating Results in the Amazon CloudSearch Developer Guide.
-
:stats
(String)
—
Specifies one or more fields for which to get statistics information. Each specified field must be facet-enabled in the domain configuration. The fields are specified in JSON using the form:
{"FIELD-A":{},"FIELD-B":{}}
There are currently no options supported for statistics.
Returns:
861 862 863 864 |
# File 'gems/aws-sdk-cloudsearchdomain/lib/aws-sdk-cloudsearchdomain/client.rb', line 861 def search(params = {}, options = {}) req = build_request(:search, params) req.send_request(options) end |
#suggest(params = {}) ⇒ Types::SuggestResponse
Retrieves autocomplete suggestions for a partial query string. You can use suggestions enable you to display likely matches before users finish typing. In Amazon CloudSearch, suggestions are based on the contents of a particular text field. When you request suggestions, Amazon CloudSearch finds all of the documents whose values in the suggester field start with the specified query string. The beginning of the field must match the query string to be considered a match.
For more information about configuring suggesters and retrieving suggestions, see Getting Suggestions in the Amazon CloudSearch Developer Guide.
The endpoint for submitting Suggest
requests is domain-specific. You
submit suggest requests to a domain's search endpoint. To get the
search endpoint for your domain, use the Amazon CloudSearch
configuration service DescribeDomains
action. A domain's endpoints
are also displayed on the domain dashboard in the Amazon CloudSearch
console.
Examples:
Request syntax with placeholder values
Request syntax with placeholder values
resp = client.suggest({
query: "Query", # required
suggester: "Suggester", # required
size: 1,
})
Response structure
Response structure
resp.status.timems #=> Integer
resp.status.rid #=> String
resp.suggest.query #=> String
resp.suggest.found #=> Integer
resp.suggest.suggestions #=> Array
resp.suggest.suggestions[0].suggestion #=> String
resp.suggest.suggestions[0].score #=> Integer
resp.suggest.suggestions[0].id #=> String
Parameters:
-
params
(Hash)
(defaults to: {})
—
({})
Options Hash (params):
-
:query
(required, String)
—
Specifies the string for which you want to get suggestions.
-
:suggester
(required, String)
—
Specifies the name of the suggester to use to find suggested matches.
-
:size
(Integer)
—
Specifies the maximum number of suggestions to return.
Returns:
924 925 926 927 |
# File 'gems/aws-sdk-cloudsearchdomain/lib/aws-sdk-cloudsearchdomain/client.rb', line 924 def suggest(params = {}, options = {}) req = build_request(:suggest, params) req.send_request(options) end |
#upload_documents(params = {}) ⇒ Types::UploadDocumentsResponse
Posts a batch of documents to a search domain for indexing. A document batch is a collection of add and delete operations that represent the documents you want to add, update, or delete from your domain. Batches can be described in either JSON or XML. Each item that you want Amazon CloudSearch to return as a search result (such as a product) is represented as a document. Every document has a unique ID and one or more fields that contain the data that you want to search and return in results. Individual documents cannot contain more than 1 MB of data. The entire batch cannot exceed 5 MB. To get the best possible upload performance, group add and delete operations in batches that are close the 5 MB limit. Submitting a large volume of single-document batches can overload a domain's document service.
The endpoint for submitting UploadDocuments
requests is
domain-specific. To get the document endpoint for your domain, use the
Amazon CloudSearch configuration service DescribeDomains
action. A
domain's endpoints are also displayed on the domain dashboard in the
Amazon CloudSearch console.
For more information about formatting your data for Amazon CloudSearch, see Preparing Your Data in the Amazon CloudSearch Developer Guide. For more information about uploading data for indexing, see Uploading Data in the Amazon CloudSearch Developer Guide.
Examples:
Request syntax with placeholder values
Request syntax with placeholder values
resp = client.upload_documents({
documents: "data", # required
content_type: "application/json", # required, accepts application/json, application/xml
})
Response structure
Response structure
resp.status #=> String
resp.adds #=> Integer
resp.deletes #=> Integer
resp.warnings #=> Array
resp.warnings[0].message #=> String
Parameters:
-
params
(Hash)
(defaults to: {})
—
({})
Options Hash (params):
-
:documents
(required, String, StringIO, File)
—
A batch of documents formatted in JSON or HTML.
-
:content_type
(required, String)
—
The format of the batch you are uploading. Amazon CloudSearch supports two document batch formats:
- application/json
- application/xml
Returns:
993 994 995 996 |
# File 'gems/aws-sdk-cloudsearchdomain/lib/aws-sdk-cloudsearchdomain/client.rb', line 993 def upload_documents(params = {}, options = {}) req = build_request(:upload_documents, params) req.send_request(options) end |