Given a list of metrics, provides a builder that it turns computes metrics from a column.
Given a list of metrics, provides a builder that it turns computes metrics from a column.
See the documentation of Summarizer for an example.
The following metrics are accepted (case sensitive):
metrics that can be provided.
a builder.
IllegalArgumentException
if one of the metric names is not understood.
Note: Currently, the performance of this interface is about 2x~3x slower then using the RDD
interface.
Tools for vectorized statistics on MLlib Vectors.
The methods in this package provide various statistics for Vectors contained inside DataFrames.
This class lets users pick the statistics they would like to extract for a given column. Here is an example in Scala:
If one wants to get a single metric, shortcuts are also available:
Note: Currently, the performance of this interface is about 2x~3x slower then using the RDD interface.