Class TfIdf.ComputeTfIdf
- java.lang.Object
-
- org.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.net.URI,java.lang.String>>,org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.lang.String,org.apache.beam.sdk.values.KV<java.net.URI,java.lang.Double>>>>
-
- org.apache.beam.examples.complete.TfIdf.ComputeTfIdf
-
- All Implemented Interfaces:
java.io.Serializable,org.apache.beam.sdk.transforms.display.HasDisplayData
- Enclosing class:
- TfIdf
public static class TfIdf.ComputeTfIdf extends org.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.net.URI,java.lang.String>>,org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.lang.String,org.apache.beam.sdk.values.KV<java.net.URI,java.lang.Double>>>>A transform containing a basic TF-IDF pipeline. The input consists of KV objects where the key is the document's URI and the value is a piece of the document's content. The output is mapping from terms to scores for each document URI.- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description ComputeTfIdf()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.lang.String,org.apache.beam.sdk.values.KV<java.net.URI,java.lang.Double>>>expand(org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.net.URI,java.lang.String>> uriToContent)-
Methods inherited from class org.apache.beam.sdk.transforms.PTransform
addAnnotation, compose, compose, getAdditionalInputs, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setDisplayData, setResourceHints, toString, validate, validate
-
-
-
-
Method Detail
-
expand
public org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.lang.String,org.apache.beam.sdk.values.KV<java.net.URI,java.lang.Double>>> expand(org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.net.URI,java.lang.String>> uriToContent)
- Specified by:
expandin classorg.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.net.URI,java.lang.String>>,org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.lang.String,org.apache.beam.sdk.values.KV<java.net.URI,java.lang.Double>>>>
-
-