aws-samples / aws-glue-samples
Unit Size

The distribution of size of units (measured in lines of code).

Intro
  • Unit size measurements show the distribution of size of units of code (methods, functions...).
  • Units are classified in four categories based on their size (lines of code): 1-20 (small units), 20-50 (medium size units), 51-100 (long units), 101+ (very long units).
  • You should aim at keeping units small (< 20 lines). Long units may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
Unit Size Overall
  • There are 243 units with 2,318 lines of code in units (54.3% of code).
    • 0 very long units (0 lines of code)
    • 2 long units (105 lines of code)
    • 29 medium size units (794 lines of code)
    • 43 small units (622 lines of code)
    • 169 very small units (797 lines of code)
0% | 4% | 34% | 26% | 34%
Legend:
101+
51-100
21-50
11-20
1-10
Unit Size per Extension
101+
51-100
21-50
11-20
1-10
py0% | 8% | 31% | 26% | 33%
scala0% | 0% | 38% | 30% | 31%
java0% | 0% | 35% | 21% | 43%
Unit Size per Logical Component
primary logical decomposition
101+
51-100
21-50
11-20
1-10
utilities/Hive_metastore_migration/src0% | 9% | 31% | 25% | 33%
GlueCustomConnectors/development/Spark0% | 0% | 29% | 30% | 39%
GlueCustomConnectors/gluescripts/withoutConnection0% | 0% | 73% | 26% | 0%
GlueCustomConnectors/gluescripts/withConnection0% | 0% | 72% | 27% | 0%
utilities/Crawler_undo_redo/src0% | 0% | 31% | 31% | 37%
GlueCustomConnectors/development/Athena0% | 0% | 21% | 28% | 50%
GlueCustomConnectors/localValidation0% | 0% | 0% | 10% | 89%
Alternative Visuals
Longest Units
Top 20 longest units
Unit# linesMcCabe index# params
def main()
in utilities/Hive_metastore_migration/src/export_from_datacatalog.py
53 6 0
def main()
in utilities/Hive_metastore_migration/src/import_into_datacatalog.py
52 9 0
def main()
in GlueCustomConnectors/development/Spark/glue-3.0/tpcds-custom-connector-for-glue3.0/jobvalidation/scala/GlueJobValidationDataSchemaTest.scala
50 1 1
def main()
in GlueCustomConnectors/development/Spark/glue-3.0/tpcds-custom-connector-for-glue3.0/jobvalidation/scala/GlueJobValidationDataPartitioningTest.scala
45 1 1
def main()
in GlueCustomConnectors/gluescripts/withoutConnection/JDBCSalesforce.scala
35 1 1
def transform_storage_descriptors()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
34 1 11
def crawler_undo_options()
in utilities/Crawler_undo_redo/src/crawler_undo.py
32 5 1
def transform()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
31 1 2
def transform_tables()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
30 1 5
override def planInputPartitions()
in GlueCustomConnectors/development/Spark/glue-3.0/tpcds-custom-connector-for-glue3.0/src/main/scala/com/amazonaws/services/glue/marketplace/connector/tpcds/TPCDSScan.scala
30 7 0
def main()
in GlueCustomConnectors/gluescripts/withConnection/JDBCSalesforce.scala
30 1 1
def validate_aws_regions()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
29 4 1
def transform_partitions()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
28 1 6
def parse_arguments()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
26 3 1
def main()
in GlueCustomConnectors/gluescripts/withoutConnection/SparkSnowflake.scala
26 1 1
def extract_from_sds_skewed_info()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
25 1 3
def main()
in GlueCustomConnectors/gluescripts/withConnection/SparkSnowflake.scala
25 1 1
def main()
in GlueCustomConnectors/gluescripts/withoutConnection/JDBCSalesforce.java
25 1 1
def transform_skewed_values_and_loc_map()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
24 1 3
def extract_sds()
in utilities/Hive_metastore_migration/src/hive_metastore_migration.py
24 1 3