Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Main Code: 8,219 LOC (95 files) = SCALA (99%) + JAVA (<1%) | |||
Duplication: 6% | |||
File Size: 0% long (>1000 LOC), 63% short (<= 200 LOC) | |||
Unit Size: 1% long (>100 LOC), 65% short (<= 10 LOC) | |||
Conditional Complexity: 1% complex (McCabe index > 50), 80% simple (McCabe index <= 5) | |||
|
Logical Component Decomposition: primary (11 components) | ||
|
3 years, 5 months old
|
|
|
|
0% of code updated more than 50 times Also see temporal dependencies for files frequently changed in same commits. |
|
|
|
Goals: Keep the system simple and easy to change (4) |
|
|
Features of interest:
TODOs
11 files |
|
Latest commit date: 2021-09-07
0
commits
(30 days)
0
contributors
(30 days) |
|
generated by sokrates.dev (configuration) on 2022-01-31