apache / arrow-java
Source Code Overview

Analysis scope, overview of main, test, generated, deployment, build, and other code.

Source Code Analysis Scope
Files includes and excluded from analyses
txt
proto
gitattributes
editorconfig
gitmodules
properties
Overview of Analyzed Files
Basic stats on analyzed files
Intro
For analysis purposes we separate files in scope into several categories: main, test, generated, deployment and build, and other.

  • The main category contains all manually created source code files that are being used in the production.
  • Files in the main category are used as input for other analyses: logical decomposition, concerns, duplication, file size, unit size, and conditional complexity.
  • Test source code files are used only for testing of the product. These files are normally not deployed to production.
  • Build and deployment source code files are used to configure or support build and deployment process.
  • Generated source code files are automatically generated files that have not been manually changed after generation.
  • While a source code folder may contain a number of files, we are primarily interested in the source code files that are being written and maintained by developers.
  • Files containing binaries, documentation, or third-party libraries, for instance, are excluded from analysis. The exception are third-party libraries that have been changed by developers.

main86055 LOC (45%) 889 files
test91627 LOC (48%) 438 files
generated0 LOC (0%) 0 files
build and deployment4711 LOC (2%) 39 files
other4886 LOC (2%) 33 files
Main Code
All manually created or maintained source code that defines logic of the product that is run in a production environment.
proto
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*".
  • 889 files match defined criteria (86,055 LOC, 100.0% vs. main code):
    • 865 *.java files (80,269 LOC)
    • 7 *.cc files (2,518 LOC)
    • 3 *.proto files (2,504 LOC)
    • 7 *.h files (286 LOC)
    • 1 *.cpp files (230 LOC)
    • 2 *.yaml files (122 LOC)
    • 2 *.ftl files (66 LOC)
    • 1 *.py files (33 LOC)
    • 1 *.xml files (27 LOC)
  • " *.java" is biggest, containing 93.28% of LOC.
  • " *.xml" is smallest, containing 0.03% of LOC.


*.java80269 LOC (93%) 865 files
*.cc2518 LOC (2%) 7 files
*.proto2504 LOC (2%) 3 files
*.h286 LOC (<1%) 7 files
*.cpp230 LOC (<1%) 1 file
*.yaml122 LOC (<1%) 2 files
*.ftl66 LOC (<1%) 2 files
*.py33 LOC (<1%) 1 file
*.xml27 LOC (<1%) 1 file
Test Code
Used only for testing of the product. Normally not deployed in a production environment.
proto
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/[Tt]est/.*".
    • files with paths like ".*_test[.].*".
    • files with paths like ".*_tests[.].*".
    • files with paths like ".*/test[.].*".
    • files with paths like ".*[-]tests/.*".
    • files with paths like ".*/[Tt]ests/.*".
    • files with paths like ".*/test_.*".
  • 438 files match defined criteria (91,627 LOC, 106.5% vs. main code):
    • 349 *.java files (89,111 LOC)
    • 68 *.avsc files (1,749 LOC)
    • 1 *.py files (298 LOC)
    • 15 *.xml files (282 LOC)
    • 1 *.cc files (86 LOC)
    • 2 *.proto files (58 LOC)
    • 1 *.sh files (33 LOC)
    • 1 *.sql files (10 LOC)
  • " *.java" is biggest, containing 97.25% of LOC.
  • " *.sql" is smallest, containing 0.01% of LOC.


*.java89111 LOC (97%) 349 files
*.avsc1749 LOC (1%) 68 files
*.py298 LOC (<1%) 1 file
*.xml282 LOC (<1%) 15 files
*.cc86 LOC (<1%) 1 file
*.proto58 LOC (<1%) 2 files
*.sh33 LOC (<1%) 1 file
*.sql10 LOC (<1%) 1 file
Build and Deployment Code
Source code used to configure or support build and deployment process.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/pom[.]xml".
    • files with paths like ".*[.]sh".
    • files with paths like ".*[.]dockerfile".
    • files with paths like ".*[.]git[a-z]+".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*/[.]gitmodules".
    • files with paths like ".*/[.]gitattributes".
  • 39 files match defined criteria (4,711 LOC, 5.5% vs. main code):
    • 24 *.xml files (3,592 LOC)
    • 13 *.sh files (1,051 LOC)
    • 2 *.dockerfile files (68 LOC)
  • " *.xml" is biggest, containing 76.25% of LOC.
  • " *.dockerfile" is smallest, containing 1.44% of LOC.


*.xml3592 LOC (76%) 24 files
*.sh1051 LOC (22%) 13 files
*.dockerfile68 LOC (1%) 2 files
Other Code
txt
properties
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]txt".
    • files with paths like ".*/LICENSE[.][a-z0-9]+".
    • files with paths like ".*[.]json".
    • files with paths like ".*[.]md".
    • files with paths like ".*/README[.][a-z0-9]+".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*/checkstyle[.]xml".
    • files with paths like ".*/checkstyle.*".
    • files with paths like ".*[.](rst|rest|resttxt|rsttxt)".
    • files with paths like ".*[.]editorconfig".
    • files with paths like ".*[.]properties".
  • 33 files match defined criteria (4,886 LOC, 5.7% vs. main code):
    • 16 *.txt files (3,498 LOC)
    • 5 *.json files (645 LOC)
    • 8 *.md files (509 LOC)
    • 2 *.xml files (200 LOC)
    • 1 *.properties files (18 LOC)
    • 1 *.rst files (16 LOC)
  • " *.txt" is biggest, containing 71.59% of LOC.
  • " *.rst" is smallest, containing 0.33% of LOC.


*.txt3498 LOC (71%) 16 files
*.json645 LOC (13%) 5 files
*.md509 LOC (10%) 8 files
*.xml200 LOC (4%) 2 files
*.properties18 LOC (<1%) 1 file
*.rst16 LOC (<1%) 1 file
Analyzers
Info about analyzers used for source code examinations.


2025-05-07 21:05