Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
| Main Code: 8,219 LOC (95 files) = SCALA (99%) + JAVA (<1%) | |||
| Duplication: 6% | |||
| File Size: 0% long (>1000 LOC), 63% short (<= 200 LOC) | |||
| Unit Size: 1% long (>100 LOC), 65% short (<= 10 LOC) | |||
| Conditional Complexity: 1% complex (McCabe index > 50), 80% simple (McCabe index <= 5) | |||
|
|
Logical Component Decomposition: primary (11 components) | ||
|
|
3 years, 5 months old
|
|
|
|
|
0% of code updated more than 50 times Also see temporal dependencies for files frequently changed in same commits. |
|
|
|
|
Goals: Keep the system simple and easy to change (4) |
|
|
|
Features of interest:
TODOs
11 files |
|
||
Latest commit date: 2021-09-07
|
0
commits
(30 days)
0
contributors
(30 days) |
|
generated by sokrates.dev (configuration) on 2022-01-31