- All Implemented Interfaces:
- java.io.Serializable, org.apache.beam.sdk.transforms.display.HasDisplayData
- Enclosing class:
- GameStats
public static class GameStats.CalculateSpammyUsers
extends org.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.lang.String,java.lang.Integer>>,org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<java.lang.String,java.lang.Integer>>>
Filter out all users but those with a high clickrate, which we will consider as 'spammy' users.
We do this by finding the mean total score per user, then using that information as a side
input to filter out all but those user scores that are larger than (mean *
SCORE_WEIGHT).
- See Also:
- Serialized Form