Interface DatasetEntityRecognizerDocuments.Builder

    • Method Detail

      • s3Uri

        DatasetEntityRecognizerDocuments.Builder s3Uri​(String s3Uri)

        Specifies the Amazon S3 location where the documents for the dataset are located.

        Parameters:
        s3Uri - Specifies the Amazon S3 location where the documents for the dataset are located.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • inputFormat

        DatasetEntityRecognizerDocuments.Builder inputFormat​(String inputFormat)

        Specifies how the text in an input file should be processed. This is optional, and the default is ONE_DOC_PER_LINE. ONE_DOC_PER_FILE - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers. ONE_DOC_PER_LINE - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.

        Parameters:
        inputFormat - Specifies how the text in an input file should be processed. This is optional, and the default is ONE_DOC_PER_LINE. ONE_DOC_PER_FILE - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers. ONE_DOC_PER_LINE - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
        See Also:
        InputFormat, InputFormat
      • inputFormat

        DatasetEntityRecognizerDocuments.Builder inputFormat​(InputFormat inputFormat)

        Specifies how the text in an input file should be processed. This is optional, and the default is ONE_DOC_PER_LINE. ONE_DOC_PER_FILE - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers. ONE_DOC_PER_LINE - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.

        Parameters:
        inputFormat - Specifies how the text in an input file should be processed. This is optional, and the default is ONE_DOC_PER_LINE. ONE_DOC_PER_FILE - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers. ONE_DOC_PER_LINE - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
        See Also:
        InputFormat, InputFormat