GoogleCloudPlatform / df-ml-anomaly-detection
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 32% | 25% | 41%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
java0% | 0% | 24% | 26% | 48%
ipynb0% | 0% | 100% | 0% | 0%
sql0% | 0% | 0% | 71% | 28%
yaml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src0% | 0% | 23% | 29% | 46%
fraud-detection-notebook0% | 0% | 100% | 0% | 0%
scripts0% | 0% | 0% | 0% | 100%
Longest Files (Top 29)
File# lines# units
credit-card-fraud-detection-v1.ipynb
in fraud-detection-notebook
313 -
PredictTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
247 10
DLPTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
240 12
Util.java
in src/main/java/com/google/solutions/df/log/aggregations/common
172 10
PredictTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
133 6
SecureLogAggregationPipeline.java
in src/main/java/com/google/solutions/df/log/aggregations
117 2
StreamingBenchmark.java
in src/main/java/com/google/solutions/df/log/aggregations
103 4
normalized_cluster_data.sql
in src/main/resources
103 -
ReadTransactionTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
84 4
JsonToRowValidationTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
78 3
BQWriteTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
74 2
IpToGeoDoFn.java
in src/main/java/com/google/solutions/df/log/aggregations/common
73 4
BQWriteTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
68 2
FraudDetectionFinServTranPipeline.java
in src/main/java/com/google/solutions/df/log/aggregations
60 2
LogRowTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
58 1
SecureLogAggregationPipelineOptions.java
in src/main/java/com/google/solutions/df/log/aggregations/common
58 -
AvgCombineFn.java
in src/main/java/com/google/solutions/df/log/aggregations/common
54 6
FraudDetectionFinServTranPipelineOptions.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
52 -
Util.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
49 -
ReadFlowLogTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
49 2
cluster_details.sql
in src/main/resources
41 -
RawLogDataTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
36 2
MergeLogAggrMap.java
in src/main/java/com/google/solutions/df/log/aggregations/common
33 1
MergeOutlierMap.java
in src/main/java/com/google/solutions/df/log/aggregations/common
31 1
LogAggrMapElement.java
in src/main/java/com/google/solutions/df/log/aggregations/common
31 1
ClusterDataMapElement.java
in src/main/java/com/google/solutions/df/log/aggregations/common
29 1
28 -
CentroidVector.java
in src/main/java/com/google/solutions/df/log/aggregations/common
24 1
3 -
Files With Most Units (Top 21)
File# lines# units
DLPTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
240 12
PredictTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
247 10
Util.java
in src/main/java/com/google/solutions/df/log/aggregations/common
172 10
AvgCombineFn.java
in src/main/java/com/google/solutions/df/log/aggregations/common
54 6
PredictTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
133 6
ReadTransactionTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
84 4
IpToGeoDoFn.java
in src/main/java/com/google/solutions/df/log/aggregations/common
73 4
StreamingBenchmark.java
in src/main/java/com/google/solutions/df/log/aggregations
103 4
JsonToRowValidationTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
78 3
BQWriteTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection
68 2
ReadFlowLogTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
49 2
RawLogDataTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
36 2
BQWriteTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
74 2
FraudDetectionFinServTranPipeline.java
in src/main/java/com/google/solutions/df/log/aggregations
60 2
SecureLogAggregationPipeline.java
in src/main/java/com/google/solutions/df/log/aggregations
117 2
MergeOutlierMap.java
in src/main/java/com/google/solutions/df/log/aggregations/common
31 1
LogRowTransform.java
in src/main/java/com/google/solutions/df/log/aggregations/common
58 1
ClusterDataMapElement.java
in src/main/java/com/google/solutions/df/log/aggregations/common
29 1
LogAggrMapElement.java
in src/main/java/com/google/solutions/df/log/aggregations/common
31 1
CentroidVector.java
in src/main/java/com/google/solutions/df/log/aggregations/common
24 1
MergeLogAggrMap.java
in src/main/java/com/google/solutions/df/log/aggregations/common
33 1
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 25 long lines.

File# lines# units# long lines
normalized_cluster_data.sql
in src/main/resources
103 - 17
28 - 6
3 - 1
credit-card-fraud-detection-v1.ipynb
in fraud-detection-notebook
313 - 1
Correlations

File Size vs. Commits (all time): 29 points

fraud-detection-notebook/credit-card-fraud-detection-v1.ipynb x: 6 commits (all time) y: 313 lines of code src/main/java/com/google/solutions/df/log/aggregations/SecureLogAggregationPipeline.java x: 30 commits (all time) y: 117 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/BQWriteTransform.java x: 12 commits (all time) y: 74 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/IpToGeoDoFn.java x: 6 commits (all time) y: 73 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/JsonToRowValidationTransform.java x: 14 commits (all time) y: 78 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/RawLogDataTransform.java x: 4 commits (all time) y: 36 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/SecureLogAggregationPipelineOptions.java x: 17 commits (all time) y: 58 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/Util.java x: 27 commits (all time) y: 172 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/PredictTransform.java x: 16 commits (all time) y: 247 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/ReadTransactionTransform.java x: 11 commits (all time) y: 84 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/LogAggrMapElement.java x: 8 commits (all time) y: 31 lines of code src/main/java/com/google/solutions/df/log/aggregations/FraudDetectionFinServTranPipeline.java x: 16 commits (all time) y: 60 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/BQWriteTransform.java x: 7 commits (all time) y: 68 lines of code src/main/java/com/google/solutions/df/log/aggregations/StreamingBenchmark.java x: 6 commits (all time) y: 103 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/AvgCombineFn.java x: 5 commits (all time) y: 54 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/CentroidVector.java x: 5 commits (all time) y: 24 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/ClusterDataMapElement.java x: 5 commits (all time) y: 29 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/DLPTransform.java x: 20 commits (all time) y: 240 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/MergeLogAggrMap.java x: 10 commits (all time) y: 33 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/MergeOutlierMap.java x: 4 commits (all time) y: 31 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/PredictTransform.java x: 8 commits (all time) y: 133 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/ReadFlowLogTransform.java x: 9 commits (all time) y: 49 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/Util.java x: 7 commits (all time) y: 49 lines of code scripts/cloud-build-data-generator.yaml x: 4 commits (all time) y: 3 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/FraudDetectionFinServTranPipelineOptions.java x: 5 commits (all time) y: 52 lines of code scripts/cloud-build-demo.yaml x: 9 commits (all time) y: 28 lines of code src/main/resources/cluster_details.sql x: 2 commits (all time) y: 41 lines of code src/main/resources/normalized_cluster_data.sql x: 4 commits (all time) y: 103 lines of code
313.0
lines of code
  min: 3.0
  average: 84.17
  25th percentile: 34.5
  median: 58.0
  75th percentile: 103.0
  max: 313.0
0 30.0
commits (all time)
min: 2.0 | average: 10.14 | 25th percentile: 5.0 | median: 8.0 | 75th percentile: 15.0 | max: 30.0

File Size vs. Contributors (all time): 29 points

fraud-detection-notebook/credit-card-fraud-detection-v1.ipynb x: 2 contributors (all time) y: 313 lines of code src/main/java/com/google/solutions/df/log/aggregations/SecureLogAggregationPipeline.java x: 1 contributors (all time) y: 117 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/BQWriteTransform.java x: 1 contributors (all time) y: 74 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/IpToGeoDoFn.java x: 1 contributors (all time) y: 73 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/JsonToRowValidationTransform.java x: 1 contributors (all time) y: 78 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/RawLogDataTransform.java x: 1 contributors (all time) y: 36 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/SecureLogAggregationPipelineOptions.java x: 1 contributors (all time) y: 58 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/Util.java x: 1 contributors (all time) y: 172 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/PredictTransform.java x: 1 contributors (all time) y: 247 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/ReadTransactionTransform.java x: 1 contributors (all time) y: 84 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/LogAggrMapElement.java x: 1 contributors (all time) y: 31 lines of code src/main/java/com/google/solutions/df/log/aggregations/FraudDetectionFinServTranPipeline.java x: 1 contributors (all time) y: 60 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/BQWriteTransform.java x: 1 contributors (all time) y: 68 lines of code src/main/java/com/google/solutions/df/log/aggregations/StreamingBenchmark.java x: 1 contributors (all time) y: 103 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/AvgCombineFn.java x: 1 contributors (all time) y: 54 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/CentroidVector.java x: 1 contributors (all time) y: 24 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/ClusterDataMapElement.java x: 1 contributors (all time) y: 29 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/DLPTransform.java x: 1 contributors (all time) y: 240 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/MergeLogAggrMap.java x: 1 contributors (all time) y: 33 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/PredictTransform.java x: 1 contributors (all time) y: 133 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/ReadFlowLogTransform.java x: 1 contributors (all time) y: 49 lines of code scripts/cloud-build-data-generator.yaml x: 2 contributors (all time) y: 3 lines of code src/main/java/com/google/solutions/df/log/aggregations/common/fraud/detection/FraudDetectionFinServTranPipelineOptions.java x: 1 contributors (all time) y: 52 lines of code scripts/cloud-build-demo.yaml x: 2 contributors (all time) y: 28 lines of code src/main/resources/cluster_details.sql x: 1 contributors (all time) y: 41 lines of code
313.0
lines of code
  min: 3.0
  average: 84.17
  25th percentile: 34.5
  median: 58.0
  75th percentile: 103.0
  max: 313.0
0 2.0
contributors (all time)
min: 1.0 | average: 1.1 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 2.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 1 points

fraud-detection-notebook/credit-card-fraud-detection-v1.ipynb x: 1 commits (90d) y: 313 lines of code
313.0
lines of code
  min: 313.0
  average: 313.0
  25th percentile: 313.0
  median: 313.0
  75th percentile: 313.0
  max: 313.0
0 1.0
commits (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (90 days): 1 points

fraud-detection-notebook/credit-card-fraud-detection-v1.ipynb x: 1 contributors (90d) y: 313 lines of code
313.0
lines of code
  min: 313.0
  average: 313.0
  25th percentile: 313.0
  median: 313.0
  75th percentile: 313.0
  max: 313.0
0 1.0
contributors (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0