public static interface ProcessOptions.LayoutConfig.ChunkingConfigOrBuilder
extends com.google.protobuf.MessageOrBuilder
| Modifier and Type | Method and Description |
|---|---|
int |
getBreakpointPercentileThreshold()
Optional.
|
int |
getChunkSize()
Optional.
|
boolean |
getIncludeAncestorHeadings()
Optional.
|
boolean |
getSemanticChunkingGroupSize()
Optional.
|
findInitializationErrors, getAllFields, getDefaultInstanceForType, getDescriptorForType, getField, getInitializationErrorString, getOneofFieldDescriptor, getRepeatedField, getRepeatedFieldCount, getUnknownFields, hasField, hasOneofint getChunkSize()
Optional. The chunk sizes to use when splitting documents, in order of level.
int32 chunk_size = 1 [(.google.api.field_behavior) = OPTIONAL];boolean getIncludeAncestorHeadings()
Optional. Whether or not to include ancestor headings when splitting.
bool include_ancestor_headings = 2 [(.google.api.field_behavior) = OPTIONAL];boolean getSemanticChunkingGroupSize()
Optional. The number of tokens to group together when evaluating semantic similarity.
bool semantic_chunking_group_size = 3 [(.google.api.field_behavior) = OPTIONAL];
int getBreakpointPercentileThreshold()
Optional. The percentile of cosine dissimilarity that must be exceeded between a group of tokens and the next. The smaller this number is, the more chunks will be generated.
int32 breakpoint_percentile_threshold = 4 [(.google.api.field_behavior) = OPTIONAL];
Copyright © 2024 Google LLC. All rights reserved.