apache / incubator-pegasus
Source Code Overview

Analysis scope, overview of main, test, generated, deployment, build, and other code.

Source Code Analysis Scope
Files includes and excluded from analyses
ini
txt
patch
properties
mod
clang-format
Overview of Analyzed Files
Basic stats on analyzed files
Intro
For analysis purposes we separate files in scope into several categories: main, test, generated, deployment and build, and other.

  • The main category contains all manually created source code files that are being used in the production.
  • Files in the main category are used as input for other analyses: logical decomposition, concerns, duplication, file size, unit size, and conditional complexity.
  • Test source code files are used only for testing of the product. These files are normally not deployed to production.
  • Build and deployment source code files are used to configure or support build and deployment process.
  • Generated source code files are automatically generated files that have not been manually changed after generation.
  • While a source code folder may contain a number of files, we are primarily interested in the source code files that are being written and maintained by developers.
  • Files containing binaries, documentation, or third-party libraries, for instance, are excluded from analysis. The exception are third-party libraries that have been changed by developers.

main138590 LOC (61%) 931 files
test63124 LOC (28%) 357 files
generated0 LOC (0%) 0 files
build and deployment5660 LOC (2%) 55 files
other17594 LOC (7%) 211 files
Main Code
All manually created or maintained source code that defines logic of the product that is run in a production environment.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*".
  • 931 files match defined criteria (138,590 LOC, 100.0% vs. main code):
    • 254 *.cpp files (75,423 LOC)
    • 346 *.h files (32,171 LOC)
    • 153 *.go files (11,422 LOC)
    • 109 *.java files (9,862 LOC)
    • 20 *.py files (2,820 LOC)
    • 10 *.js files (1,877 LOC)
    • 2 *.c files (1,624 LOC)
    • 18 *.thrift files (1,595 LOC)
    • 8 *.scala files (823 LOC)
    • 6 *.cmake files (693 LOC)
    • 2 *.xml files (249 LOC)
    • 2 *.sbt files (26 LOC)
    • 1 *.yml files (5 LOC)
  • " *.cpp" is biggest, containing 54.42% of LOC.
  • " *.yml" is smallest, containing 0% of LOC.


*.cpp75423 LOC (54%) 254 files
*.h32171 LOC (23%) 346 files
*.go11422 LOC (8%) 153 files
*.java9862 LOC (7%) 109 files
*.py2820 LOC (2%) 20 files
*.js1877 LOC (1%) 10 files
*.c1624 LOC (1%) 2 files
*.thrift1595 LOC (1%) 18 files
*.scala823 LOC (<1%) 8 files
*.cmake693 LOC (<1%) 6 files
*.xml249 LOC (<1%) 2 files
*.sbt26 LOC (<1%) 2 files
*.yml5 LOC (<1%) 1 file
Test Code
Used only for testing of the product. Normally not deployed in a production environment.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*_test[.].*".
    • files with paths like ".*/[Tt]est/.*".
    • files with paths like ".*/[Tt]ests/.*".
    • files with paths like ".*/test_.*".
    • files with paths like ".*[.][Tt]est[.].*".
    • files with paths like ".*[.]test[.].*".
    • files with paths like ".*[-]test[-].*".
  • 357 files match defined criteria (63,124 LOC, 45.5% vs. main code):
    • 220 *.cpp files (47,163 LOC)
    • 28 *.java files (8,530 LOC)
    • 41 *.h files (2,923 LOC)
    • 25 *.go files (2,517 LOC)
    • 4 *.py files (801 LOC)
    • 37 *.sh files (667 LOC)
    • 1 *.js files (291 LOC)
    • 1 *.scala files (232 LOC)
  • " *.cpp" is biggest, containing 74.71% of LOC.
  • " *.scala" is smallest, containing 0.37% of LOC.


*.cpp47163 LOC (74%) 220 files
*.java8530 LOC (13%) 28 files
*.h2923 LOC (4%) 41 files
*.go2517 LOC (3%) 25 files
*.py801 LOC (1%) 4 files
*.sh667 LOC (1%) 37 files
*.js291 LOC (<1%) 1 file
*.scala232 LOC (<1%) 1 file
Build and Deployment Code
Source code used to configure or support build and deployment process.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]sh".
    • files with paths like ".*/docker[-]compose[.]yml".
    • files with paths like ".*/package[.]json".
    • files with paths like ".*/pom[.]xml".
    • files with paths like ".*[.]git[a-z]+".
    • files with paths like ".*/[.]gitignore".
  • 55 files match defined criteria (5,660 LOC, 4.1% vs. main code):
    • 53 *.sh files (5,180 LOC)
    • 1 *.xml files (461 LOC)
    • 1 *.yml files (19 LOC)
  • " *.sh" is biggest, containing 91.52% of LOC.
  • " *.yml" is smallest, containing 0.34% of LOC.


*.sh5180 LOC (91%) 53 files
*.xml461 LOC (8%) 1 file
*.yml19 LOC (<1%) 1 file
Other Code
ini
txt
properties
mod
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/go[.]mod".
    • files with paths like ".*[.]json".
    • files with paths like ".*[.]md".
    • files with paths like ".*/README[.][a-z0-9]+".
    • files with paths like ".*[.]ini".
    • files with paths like ".*[.]txt".
    • files with paths like ".*[.]properties".
    • files with paths like ".*/checkstyle.*".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*[.]patch".
  • 211 files match defined criteria (17,594 LOC, 12.7% vs. main code):
    • 97 *.ini files (12,673 LOC)
    • 83 *.txt files (3,105 LOC)
    • 20 *.md files (1,501 LOC)
    • 5 *.properties files (132 LOC)
    • 3 *.mod files (109 LOC)
    • 2 *.json files (45 LOC)
    • 1 *.xml files (29 LOC)
  • " *.ini" is biggest, containing 72.03% of LOC.
  • " *.xml" is smallest, containing 0.16% of LOC.


*.ini12673 LOC (72%) 97 files
*.txt3105 LOC (17%) 83 files
*.md1501 LOC (8%) 20 files
*.properties132 LOC (<1%) 5 files
*.mod109 LOC (<1%) 3 files
*.json45 LOC (<1%) 2 files
*.xml29 LOC (<1%) 1 file
Analyzers
Info about analyzers used for source code examinations.


2023-08-10 15:57