| Package | Description |
|---|---|
| com.amazonaws.services.kendra.model |
| Modifier and Type | Method and Description |
|---|---|
WebCrawlerConfiguration |
WebCrawlerConfiguration.clone() |
WebCrawlerConfiguration |
DataSourceConfiguration.getWebCrawlerConfiguration() |
WebCrawlerConfiguration |
WebCrawlerConfiguration.withAuthenticationConfiguration(AuthenticationConfiguration authenticationConfiguration)
Configuration information required to connect to websites using authentication.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withCrawlDepth(Integer crawlDepth)
Specifies the number of levels in a website that you want to crawl.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withMaxContentSizePerPageInMegaBytes(Float maxContentSizePerPageInMegaBytes)
The maximum size (in MB) of a web page or attachment to crawl.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withMaxLinksPerPage(Integer maxLinksPerPage)
The maximum number of URLs on a web page to include when crawling a website.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withMaxUrlsPerMinuteCrawlRate(Integer maxUrlsPerMinuteCrawlRate)
The maximum number of URLs crawled per website host per minute.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withProxyConfiguration(ProxyConfiguration proxyConfiguration)
Configuration information required to connect to your internal websites via a web proxy.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withUrlExclusionPatterns(Collection<String> urlExclusionPatterns)
A list of regular expression patterns to exclude certain URLs to crawl.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withUrlExclusionPatterns(String... urlExclusionPatterns)
A list of regular expression patterns to exclude certain URLs to crawl.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withUrlInclusionPatterns(Collection<String> urlInclusionPatterns)
A list of regular expression patterns to include certain URLs to crawl.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withUrlInclusionPatterns(String... urlInclusionPatterns)
A list of regular expression patterns to include certain URLs to crawl.
|
WebCrawlerConfiguration |
WebCrawlerConfiguration.withUrls(Urls urls)
Specifies the seed or starting point URLs of the websites or the sitemap URLs of the websites you want to crawl.
|
| Modifier and Type | Method and Description |
|---|---|
void |
DataSourceConfiguration.setWebCrawlerConfiguration(WebCrawlerConfiguration webCrawlerConfiguration) |
DataSourceConfiguration |
DataSourceConfiguration.withWebCrawlerConfiguration(WebCrawlerConfiguration webCrawlerConfiguration) |
Copyright © 2023. All rights reserved.