Interface LibMatrixCountDistinct
-
public interface LibMatrixCountDistinctThis class contains various methods for counting the number of distinct values inside a MatrixBlock
-
-
Field Summary
Fields Modifier and Type Field Description static org.apache.commons.logging.LogLOGstatic intminimumSizeThe minimum number NonZero of cells in the input before using approximate techniques for counting number of distinct values.
-
Method Summary
Static Methods Modifier and Type Method Description static MatrixBlockcountDistinctValuesFromSketch(CorrMatrixBlock arg0, CountDistinctOperator op)static CorrMatrixBlockcreateSketch(MatrixBlock blkIn, CountDistinctOperator op)static intestimateDistinctValues(MatrixBlock in, CountDistinctOperator op)Public method to count the number of distinct values inside a matrix.static CorrMatrixBlockunionSketch(CorrMatrixBlock arg0, CorrMatrixBlock arg1, CountDistinctOperator op)
-
-
-
Field Detail
-
LOG
static final org.apache.commons.logging.Log LOG
-
minimumSize
static final int minimumSize
The minimum number NonZero of cells in the input before using approximate techniques for counting number of distinct values.- See Also:
- Constant Field Values
-
-
Method Detail
-
estimateDistinctValues
static int estimateDistinctValues(MatrixBlock in, CountDistinctOperator op)
Public method to count the number of distinct values inside a matrix. Depending on which CountDistinctOperator selected it either gets the absolute number or a estimated value. TODO: Support counting num distinct in rows, or columns axis. TODO: Add support for distributed spark operations TODO: If the MatrixBlock type is CompressedMatrix, simply read the values from the ColGroups.- Parameters:
in- the input matrix to count number distinct values inop- the selected operator to use- Returns:
- the distinct count
-
countDistinctValuesFromSketch
static MatrixBlock countDistinctValuesFromSketch(CorrMatrixBlock arg0, CountDistinctOperator op)
-
createSketch
static CorrMatrixBlock createSketch(MatrixBlock blkIn, CountDistinctOperator op)
-
unionSketch
static CorrMatrixBlock unionSketch(CorrMatrixBlock arg0, CorrMatrixBlock arg1, CountDistinctOperator op)
-
-