Class InferenceConfiguration
- java.lang.Object
-
- software.amazon.awssdk.services.bedrockagentruntime.model.InferenceConfiguration
-
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<InferenceConfiguration.Builder,InferenceConfiguration>
@Generated("software.amazon.awssdk:codegen") public final class InferenceConfiguration extends Object implements SdkPojo, Serializable, ToCopyableBuilder<InferenceConfiguration.Builder,InferenceConfiguration>
Specifications about the inference parameters that were provided alongside the prompt. These are specified in the PromptOverrideConfiguration object that was set when the agent was created or updated. For more information, see Inference parameters for foundation models.
- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static interfaceInferenceConfiguration.Builder
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static InferenceConfiguration.Builderbuilder()booleanequals(Object obj)booleanequalsBySdkFields(Object obj)<T> Optional<T>getValueForField(String fieldName, Class<T> clazz)inthashCode()booleanhasStopSequences()For responses, this returns true if the service returned a value for the StopSequences property.IntegermaximumLength()The maximum number of tokens allowed in the generated response.Map<String,SdkField<?>>sdkFieldNameToField()List<SdkField<?>>sdkFields()static Class<? extends InferenceConfiguration.Builder>serializableBuilderClass()List<String>stopSequences()A list of stop sequences.Floattemperature()The likelihood of the model selecting higher-probability options while generating a response.InferenceConfiguration.BuildertoBuilder()IntegertopK()While generating a response, the model determines the probability of the following token at each point of generation.FloattopP()While generating a response, the model determines the probability of the following token at each point of generation.StringtoString()Returns a string representation of this object.-
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
-
-
-
Method Detail
-
maximumLength
public final Integer maximumLength()
The maximum number of tokens allowed in the generated response.
- Returns:
- The maximum number of tokens allowed in the generated response.
-
hasStopSequences
public final boolean hasStopSequences()
For responses, this returns true if the service returned a value for the StopSequences property. This DOES NOT check that the value is non-empty (for which, you should check theisEmpty()method on the property). This is useful because the SDK will never return a null collection or map, but you may need to differentiate between the service returning nothing (or null) and the service returning an empty collection or map. For requests, this returns true if a value for the property was specified in the request builder, and false if a value was not specified.
-
stopSequences
public final List<String> stopSequences()
A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.
Attempts to modify the collection returned by this method will result in an UnsupportedOperationException.
This method will never return null. If you would like to know whether the service returned this field (so that you can differentiate between null and empty), you can use the
hasStopSequences()method.- Returns:
- A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.
-
temperature
public final Float temperature()
The likelihood of the model selecting higher-probability options while generating a response. A lower value makes the model more likely to choose higher-probability options, while a higher value makes the model more likely to choose lower-probability options.
- Returns:
- The likelihood of the model selecting higher-probability options while generating a response. A lower value makes the model more likely to choose higher-probability options, while a higher value makes the model more likely to choose lower-probability options.
-
topK
public final Integer topK()
While generating a response, the model determines the probability of the following token at each point of generation. The value that you set for
topKis the number of most-likely candidates from which the model chooses the next token in the sequence. For example, if you settopKto 50, the model selects the next token from among the top 50 most likely choices.- Returns:
- While generating a response, the model determines the probability of the following token at each point of
generation. The value that you set for
topKis the number of most-likely candidates from which the model chooses the next token in the sequence. For example, if you settopKto 50, the model selects the next token from among the top 50 most likely choices.
-
topP
public final Float topP()
While generating a response, the model determines the probability of the following token at each point of generation. The value that you set for
Top Pdetermines the number of most-likely candidates from which the model chooses the next token in the sequence. For example, if you settopPto 0.8, the model only selects the next token from the top 80% of the probability distribution of next tokens.- Returns:
- While generating a response, the model determines the probability of the following token at each point of
generation. The value that you set for
Top Pdetermines the number of most-likely candidates from which the model chooses the next token in the sequence. For example, if you settopPto 0.8, the model only selects the next token from the top 80% of the probability distribution of next tokens.
-
toBuilder
public InferenceConfiguration.Builder toBuilder()
- Specified by:
toBuilderin interfaceToCopyableBuilder<InferenceConfiguration.Builder,InferenceConfiguration>
-
builder
public static InferenceConfiguration.Builder builder()
-
serializableBuilderClass
public static Class<? extends InferenceConfiguration.Builder> serializableBuilderClass()
-
equalsBySdkFields
public final boolean equalsBySdkFields(Object obj)
- Specified by:
equalsBySdkFieldsin interfaceSdkPojo
-
toString
public final String toString()
Returns a string representation of this object. This is useful for testing and debugging. Sensitive data will be redacted from this string using a placeholder value.
-
sdkFieldNameToField
public final Map<String,SdkField<?>> sdkFieldNameToField()
- Specified by:
sdkFieldNameToFieldin interfaceSdkPojo
-
-