org.checkita.dqf.core.metrics.rdd.regular.BasicStringRDDMetrics
DistinctValuesRDDMetricCalculator
case class DistinctValuesRDDMetricCalculator(uniqueValues: Set[String] = Set.empty[String], failCount: Long = 0, status: CalculatorStatus = CalculatorStatus.Success, failMsg: String = "OK") extends RDDMetricCalculator with Product with Serializable
Calculates count of distinct values in processed elements WARNING: Uses set without any kind of trimming and hashing. Returns the exact count. So if a big diversion of elements needs to be processed and exact result is not mandatory, then it's better to use HyperLogLog version called "APPROXIMATE_DISTINCT_VALUES".
- uniqueValues
Set of processed values
- returns
result map with keys: "DISTINCT_VALUES"
- Alphabetic
- By Inheritance
- DistinctValuesRDDMetricCalculator
- Serializable
- Serializable
- Product
- Equals
- RDDMetricCalculator
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
- new DistinctValuesRDDMetricCalculator()
-
new
DistinctValuesRDDMetricCalculator(uniqueValues: Set[String] = Set.empty[String], failCount: Long = 0, status: CalculatorStatus = CalculatorStatus.Success, failMsg: String = "OK")
- uniqueValues
Set of processed values
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
def
copyWithError(status: CalculatorStatus, msg: String, failInc: Long = 1): RDDMetricCalculator
Copy calculator with error status and corresponding message.
Copy calculator with error status and corresponding message.
- status
Calculator status to copy with
- msg
Failure message
- failInc
Failure increment
- returns
Copy of this calculator with error status
- Attributes
- protected
- Definition Classes
- DistinctValuesRDDMetricCalculator → RDDMetricCalculator
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
val
failCount: Long
- Attributes
- protected
- Definition Classes
- DistinctValuesRDDMetricCalculator → RDDMetricCalculator
-
val
failMsg: String
- Attributes
- protected
- Definition Classes
- DistinctValuesRDDMetricCalculator → RDDMetricCalculator
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
getFailCounter: Long
Gets current metric failure counts
-
def
getFailMessage: String
Gets current failure or error message
-
def
getStatus: CalculatorStatus
Gets current metric calculator status
-
def
increment(values: Seq[Any]): RDDMetricCalculator
Safely updates metric calculator
Safely updates metric calculator
- values
values to process
- returns
updated calculator
- Definition Classes
- RDDMetricCalculator
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
def
merge(m2: RDDMetricCalculator): RDDMetricCalculator
Merges two metric calculators together
Merges two metric calculators together
- m2
second metric calculator
- returns
merged metric calculator
- Definition Classes
- DistinctValuesRDDMetricCalculator → RDDMetricCalculator
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
def
result(): Map[String, (Double, Option[String])]
Gets results of calculator in the current state
Gets results of calculator in the current state
- returns
Map of (result_name -> (result, additionalResults))
- Definition Classes
- DistinctValuesRDDMetricCalculator → RDDMetricCalculator
-
val
status: CalculatorStatus
- Attributes
- protected
- Definition Classes
- DistinctValuesRDDMetricCalculator → RDDMetricCalculator
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
tryToIncrement(values: Seq[Any]): RDDMetricCalculator
Increment metric calculator.
Increment metric calculator. May throw an exception.
- values
values to process
- returns
updated calculator or throws an exception
- Attributes
- protected
- Definition Classes
- DistinctValuesRDDMetricCalculator → RDDMetricCalculator
- val uniqueValues: Set[String]
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()