Class SimplifiedElkanKMeans<V extends elki.data.NumberVector>

  • Type Parameters:
    V - vector datatype
    All Implemented Interfaces:
    elki.Algorithm, ClusteringAlgorithm<Clustering<KMeansModel>>, KMeans<V,​KMeansModel>
    Direct Known Subclasses:
    ElkanKMeans

    @Reference(authors="J. Newling",
               title="Fast k-means with accurate bounds",
               booktitle="Proc. 33nd Int. Conf. on Machine Learning, ICML 2016",
               url="http://jmlr.org/proceedings/papers/v48/newling16.html",
               bibkey="DBLP:conf/icml/NewlingF16")
    public class SimplifiedElkanKMeans<V extends elki.data.NumberVector>
    extends AbstractKMeans<V,​KMeansModel>
    Simplified version of Elkan's k-means by exploiting the triangle inequality.

    Compared to ElkanKMeans, this uses less pruning, but also does not need to maintain a matrix of pairwise centroid separation.

    Reference:

    J. Newling
    Fast k-means with accurate bounds
    Proc. 33nd Int. Conf. on Machine Learning, ICML 2016

    Since:
    0.7.0
    Author:
    Erich Schubert
    • Field Detail

      • LOG

        private static final elki.logging.Logging LOG
        The logger for this class.
      • varstat

        protected boolean varstat
        Flag whether to compute the final variance statistic.
    • Constructor Detail

      • SimplifiedElkanKMeans

        public SimplifiedElkanKMeans​(elki.distance.NumberVectorDistance<? super V> distance,
                                     int k,
                                     int maxiter,
                                     KMeansInitialization initializer,
                                     boolean varstat)
        Constructor.
        Parameters:
        distance - distance function
        k - k parameter
        maxiter - Maxiter parameter
        initializer - Initialization method
        varstat - Compute the variance statistic
    • Method Detail

      • run

        public Clustering<KMeansModel> run​(elki.database.relation.Relation<V> relation)
        Description copied from interface: KMeans
        Run the clustering algorithm.
        Parameters:
        relation - Relation to process.
        Returns:
        Clustering result
      • getLogger

        protected elki.logging.Logging getLogger()
        Description copied from class: AbstractKMeans
        Get the (STATIC) logger for this class.
        Specified by:
        getLogger in class AbstractKMeans<V extends elki.data.NumberVector,​KMeansModel>
        Returns:
        the static logger