aws / sagemaker-feature-store-spark
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 21% | 37% | 41%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
scala0% | 0% | 34% | 22% | 43%
py0% | 0% | 0% | 53% | 46%
sbt0% | 0% | 0% | 95% | 4%
cfg0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
scala-spark-sdk0% | 0% | 30% | 31% | 38%
pyspark-sdk0% | 0% | 0% | 51% | 48%
Longest Files (Top 21)
File# lines# units
FeatureStoreManager.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk
251 8
BatchIngestionTest.py
in pyspark-sdk/integration_test
165 4
InputDataSchemaValidator.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/validators
160 5
build.sbt
in scala-spark-sdk
102 -
ClientFactory.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
99 1
setup.py
in pyspark-sdk
90 3
FeatureGroupHelper.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
89 8
SparkSessionInitializer.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
71 2
DataFrameRepartitioner.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
26 2
wrapper.py
in pyspark-sdk/src/feature_store_pyspark
23 4
FeatureGroupArnResolver.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
21 3
FeatureStoreManager.py
in pyspark-sdk/src/feature_store_pyspark
19 4
__init__.py
in pyspark-sdk/src/feature_store_pyspark
10 1
cfg
setup.cfg
in pyspark-sdk
7 -
plugins.sbt
in scala-spark-sdk/project
5 -
in
MANIFEST.in
in pyspark-sdk
4 -
BaseException.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions
2 -
ValidationError.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions
2 -
StreamIngestionFailureException.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions
2 -
__init__.py
in pyspark-sdk
1 -
__init__.py
in pyspark-sdk/src
1 -
Files With Most Units (Top 12)
File# lines# units
FeatureStoreManager.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk
251 8
FeatureGroupHelper.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
89 8
InputDataSchemaValidator.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/validators
160 5
wrapper.py
in pyspark-sdk/src/feature_store_pyspark
23 4
FeatureStoreManager.py
in pyspark-sdk/src/feature_store_pyspark
19 4
BatchIngestionTest.py
in pyspark-sdk/integration_test
165 4
setup.py
in pyspark-sdk
90 3
FeatureGroupArnResolver.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
21 3
DataFrameRepartitioner.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
26 2
SparkSessionInitializer.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
71 2
__init__.py
in pyspark-sdk/src/feature_store_pyspark
10 1
ClientFactory.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
99 1
Files With Long Lines (Top 3)

There are 3 files with lines longer than 120 characters. In total, there are 8 long lines.

File# lines# units# long lines
BatchIngestionTest.py
in pyspark-sdk/integration_test
165 4 5
FeatureStoreManager.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk
251 8 2
FeatureGroupHelper.scala
in scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers
89 8 1
Correlations

File Size vs. Commits (all time): 21 points

scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/FeatureStoreManager.scala x: 6 commits (all time) y: 251 lines of code pyspark-sdk/src/feature_store_pyspark/FeatureStoreManager.py x: 7 commits (all time) y: 19 lines of code scala-spark-sdk/build.sbt x: 10 commits (all time) y: 102 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions/StreamIngestionFailureException.scala x: 2 commits (all time) y: 2 lines of code pyspark-sdk/integration_test/BatchIngestionTest.py x: 4 commits (all time) y: 165 lines of code pyspark-sdk/setup.py x: 8 commits (all time) y: 90 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/validators/InputDataSchemaValidator.scala x: 5 commits (all time) y: 160 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/ClientFactory.scala x: 4 commits (all time) y: 99 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions/BaseException.scala x: 1 commits (all time) y: 2 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions/ValidationError.scala x: 3 commits (all time) y: 2 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/DataFrameRepartitioner.scala x: 3 commits (all time) y: 26 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/FeatureGroupHelper.scala x: 3 commits (all time) y: 89 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/SparkSessionInitializer.scala x: 3 commits (all time) y: 71 lines of code pyspark-sdk/MANIFEST.in x: 3 commits (all time) y: 4 lines of code pyspark-sdk/__init__.py x: 2 commits (all time) y: 1 lines of code pyspark-sdk/setup.cfg x: 2 commits (all time) y: 7 lines of code pyspark-sdk/src/feature_store_pyspark/__init__.py x: 4 commits (all time) y: 10 lines of code pyspark-sdk/src/feature_store_pyspark/wrapper.py x: 2 commits (all time) y: 23 lines of code scala-spark-sdk/project/plugins.sbt x: 3 commits (all time) y: 5 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/FeatureGroupArnResolver.scala x: 2 commits (all time) y: 21 lines of code
251.0
lines of code
  min: 1.0
  average: 54.76
  25th percentile: 3.0
  median: 21.0
  75th percentile: 94.5
  max: 251.0
0 10.0
commits (all time)
min: 1.0 | average: 3.76 | 25th percentile: 2.0 | median: 3.0 | 75th percentile: 4.5 | max: 10.0

File Size vs. Contributors (all time): 21 points

scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/FeatureStoreManager.scala x: 3 contributors (all time) y: 251 lines of code pyspark-sdk/src/feature_store_pyspark/FeatureStoreManager.py x: 2 contributors (all time) y: 19 lines of code scala-spark-sdk/build.sbt x: 2 contributors (all time) y: 102 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions/StreamIngestionFailureException.scala x: 2 contributors (all time) y: 2 lines of code pyspark-sdk/integration_test/BatchIngestionTest.py x: 2 contributors (all time) y: 165 lines of code pyspark-sdk/setup.py x: 2 contributors (all time) y: 90 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/validators/InputDataSchemaValidator.scala x: 2 contributors (all time) y: 160 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/ClientFactory.scala x: 2 contributors (all time) y: 99 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/exceptions/BaseException.scala x: 1 contributors (all time) y: 2 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/DataFrameRepartitioner.scala x: 2 contributors (all time) y: 26 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/FeatureGroupHelper.scala x: 2 contributors (all time) y: 89 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/SparkSessionInitializer.scala x: 2 contributors (all time) y: 71 lines of code pyspark-sdk/MANIFEST.in x: 2 contributors (all time) y: 4 lines of code pyspark-sdk/__init__.py x: 2 contributors (all time) y: 1 lines of code pyspark-sdk/setup.cfg x: 2 contributors (all time) y: 7 lines of code pyspark-sdk/src/feature_store_pyspark/__init__.py x: 2 contributors (all time) y: 10 lines of code pyspark-sdk/src/feature_store_pyspark/wrapper.py x: 2 contributors (all time) y: 23 lines of code scala-spark-sdk/project/plugins.sbt x: 2 contributors (all time) y: 5 lines of code scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/helpers/FeatureGroupArnResolver.scala x: 2 contributors (all time) y: 21 lines of code
251.0
lines of code
  min: 1.0
  average: 54.76
  25th percentile: 3.0
  median: 21.0
  75th percentile: 94.5
  max: 251.0
0 3.0
contributors (all time)
min: 1.0 | average: 2.0 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.0 | max: 3.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 1 points

scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/FeatureStoreManager.scala x: 1 commits (90d) y: 251 lines of code
251.0
lines of code
  min: 251.0
  average: 251.0
  25th percentile: 251.0
  median: 251.0
  75th percentile: 251.0
  max: 251.0
0 1.0
commits (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (90 days): 1 points

scala-spark-sdk/src/main/scala/software/amazon/sagemaker/featurestore/sparksdk/FeatureStoreManager.scala x: 1 contributors (90d) y: 251 lines of code
251.0
lines of code
  min: 251.0
  average: 251.0
  25th percentile: 251.0
  median: 251.0
  75th percentile: 251.0
  max: 251.0
0 1.0
contributors (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0