combineFilesForSequenceFile(JavaSparkContext, String, String, PathToKeyConverter, PathToKeyConverter)
but with the PathToKeyConverter used for both file sourcesJavaPairRDD.saveAsNewAPIHadoopFile(String, Class, Class, Class))Tuple2<Collection<Collection<<Writable>>,Collection<Collection<Writable>>) using two SequenceRecordReaders.PathToKeyConverter),
second is an index, and third is the original data streamCollection<Writable>) using a RecordReaderCollection<Collection<<Writable>>) using a SequenceRecordReaderCopyright © 2016. All rights reserved.