apache / kudu
Source Code Overview

Analysis scope, overview of main, test, generated, deployment, build, and other code.

Source Code Analysis Scope
Files includes and excluded from analyses
txt
patch
proto
properties
in
pxd
xsl
cfg
tex
clang-format
clang-tidy
ini
tpl
bib
Overview of Analyzed Files
Basic stats on analyzed files
Intro
For analysis purposes we separate files in scope into several categories: main, test, generated, deployment and build, and other.

  • The main category contains all manually created source code files that are being used in the production.
  • Files in the main category are used as input for other analyses: logical decomposition, concerns, duplication, file size, unit size, and conditional complexity.
  • Test source code files are used only for testing of the product. These files are normally not deployed to production.
  • Build and deployment source code files are used to configure or support build and deployment process.
  • Generated source code files are automatically generated files that have not been manually changed after generation.
  • While a source code folder may contain a number of files, we are primarily interested in the source code files that are being written and maintained by developers.
  • Files containing binaries, documentation, or third-party libraries, for instance, are excluded from analysis. The exception are third-party libraries that have been changed by developers.

main373794 LOC (78%) 1,669 files
test83530 LOC (17%) 294 files
generated0 LOC (0%) 0 files
build and deployment3916 LOC (<1%) 56 files
other14763 LOC (3%) 123 files
Main Code
All manually created or maintained source code that defines logic of the product that is run in a production environment.
proto
pxd
tpl
in
cfg
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*".
  • 1669 files match defined criteria (373,794 LOC, 100.0% vs. main code):
    • 713 *.cc files (257,588 LOC)
    • 580 *.h files (68,006 LOC)
    • 180 *.java files (23,045 LOC)
    • 41 *.proto files (6,485 LOC)
    • 32 *.py files (4,692 LOC)
    • 25 *.scala files (3,729 LOC)
    • 3 *.pyx files (2,044 LOC)
    • 33 *.cmake files (1,775 LOC)
    • 1 *.thrift files (1,664 LOC)
    • 17 *.mustache files (1,132 LOC)
    • 8 *.clj files (820 LOC)
    • 5 *.pxd files (627 LOC)
    • 5 *.pl files (528 LOC)
    • 8 *.yaml files (512 LOC)
    • 3 *.c files (371 LOC)
    • 5 *.xml files (309 LOC)
    • 2 *.js files (288 LOC)
    • 1 *.tpl files (79 LOC)
    • 3 *.in files (40 LOC)
    • 1 *.css files (37 LOC)
    • 2 *.html files (21 LOC)
    • 1 *.cfg files (2 LOC)
  • " *.cc" is biggest, containing 68.91% of LOC.
  • " *.cfg" is smallest, containing 0% of LOC.


*.cc257588 LOC (68%) 713 files
*.h68006 LOC (18%) 580 files
*.java23045 LOC (6%) 180 files
*.proto6485 LOC (1%) 41 files
*.py4692 LOC (1%) 32 files
*.scala3729 LOC (<1%) 25 files
*.pyx2044 LOC (<1%) 3 files
*.cmake1775 LOC (<1%) 33 files
*.thrift1664 LOC (<1%) 1 file
*.mustache1132 LOC (<1%) 17 files
*.clj820 LOC (<1%) 8 files
*.pxd627 LOC (<1%) 5 files
*.pl528 LOC (<1%) 5 files
*.yaml512 LOC (<1%) 8 files
*.c371 LOC (<1%) 3 files
*.xml309 LOC (<1%) 5 files
*.js288 LOC (<1%) 2 files
*.tpl79 LOC (<1%) 1 file
*.in40 LOC (<1%) 3 files
*.css37 LOC (<1%) 1 file
*.html21 LOC (<1%) 2 files
*.cfg2 LOC (<1%) 1 file
Test Code
Used only for testing of the product. Normally not deployed in a production environment.
proto
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/[Tt]est/.*".
    • files with paths like ".*_test[.].*".
    • files with paths like ".*[-]test[-].*".
    • files with paths like ".*[-]tests/.*".
    • files with paths like ".*/test_.*".
    • files with paths like ".*/[Tt]ests/.*".
  • 294 files match defined criteria (83,530 LOC, 22.3% vs. main code):
    • 114 *.cc files (48,172 LOC)
    • 98 *.java files (22,286 LOC)
    • 41 *.h files (6,220 LOC)
    • 16 *.scala files (3,367 LOC)
    • 12 *.py files (3,126 LOC)
    • 3 *.proto files (123 LOC)
    • 1 *.sh files (77 LOC)
    • 1 *.clj files (62 LOC)
    • 1 *.gradle files (48 LOC)
    • 1 *.yaml files (27 LOC)
    • 6 *.xml files (22 LOC)
  • " *.cc" is biggest, containing 57.67% of LOC.
  • " *.xml" is smallest, containing 0.03% of LOC.


*.cc48172 LOC (57%) 114 files
*.java22286 LOC (26%) 98 files
*.h6220 LOC (7%) 41 files
*.scala3367 LOC (4%) 16 files
*.py3126 LOC (3%) 12 files
*.proto123 LOC (<1%) 3 files
*.sh77 LOC (<1%) 1 file
*.clj62 LOC (<1%) 1 file
*.gradle48 LOC (<1%) 1 file
*.yaml27 LOC (<1%) 1 file
*.xml22 LOC (<1%) 6 files
Build and Deployment Code
Source code used to configure or support build and deployment process.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]gradle".
    • files with paths like ".*[.]git[a-z]+".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*/pom[.]xml".
    • files with paths like ".*[.]sh".
  • 56 files match defined criteria (3,916 LOC, 1.0% vs. main code):
    • 31 *.sh files (2,815 LOC)
    • 25 *.gradle files (1,101 LOC)
  • " *.sh" is biggest, containing 71.88% of LOC.
  • " *.gradle" is smallest, containing 28.12% of LOC.


*.sh2815 LOC (71%) 31 files
*.gradle1101 LOC (28%) 25 files
Other Code
txt
properties
ini
cfg
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]properties".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*[.]adoc".
    • files with paths like ".*/README[.][a-z0-9]+".
    • files with paths like ".*/config/checkstyle/.*".
    • files with paths like ".*/checkstyle[.]xml".
    • files with paths like ".*/checkstyle.*".
    • files with paths like ".*/[Ee]xamples/.*".
    • files with paths like ".*[.]md".
    • files with paths like ".*[.]txt".
    • files with paths like ".*/LICENSE[.][a-z0-9]+".
    • files with paths like ".*[.]ini".
    • files with paths like ".*[.]patch".
    • files with paths like ".*[.]json".
    • files with paths like ".*/[.]dockerignore".
  • 123 files match defined criteria (14,763 LOC, 3.9% vs. main code):
    • 59 *.txt files (8,600 LOC)
    • 20 *.adoc files (2,539 LOC)
    • 8 *.xml files (1,531 LOC)
    • 2 *.cc files (530 LOC)
    • 6 *.py files (491 LOC)
    • 4 *.java files (425 LOC)
    • 13 *.properties files (386 LOC)
    • 4 *.md files (124 LOC)
    • 3 *.scala files (111 LOC)
    • 2 *.sbt files (20 LOC)
    • 1 *.ini files (4 LOC)
    • 1 *.cfg files (2 LOC)
  • " *.txt" is biggest, containing 58.25% of LOC.
  • " *.cfg" is smallest, containing 0.01% of LOC.


*.txt8600 LOC (58%) 59 files
*.adoc2539 LOC (17%) 20 files
*.xml1531 LOC (10%) 8 files
*.cc530 LOC (3%) 2 files
*.py491 LOC (3%) 6 files
*.java425 LOC (2%) 4 files
*.properties386 LOC (2%) 13 files
*.md124 LOC (<1%) 4 files
*.scala111 LOC (<1%) 3 files
*.sbt20 LOC (<1%) 2 files
*.ini4 LOC (<1%) 1 file
*.cfg2 LOC (<1%) 1 file
Analyzers
Info about analyzers used for source code examinations.


2025-05-07 15:58