apache / doris
Source Code Overview

Analysis scope, overview of main, test, generated, deployment, build, and other code.

Source Code Analysis Scope
Files includes and excluded from analyses
txt
patch
tpl
properties
orc
proto
ini
mod
s
in
editorconfig
cfg
l
y
clang-format
gitmodules
clang-tidy
gitattributes
flex
Overview of Analyzed Files
Basic stats on analyzed files
Intro
For analysis purposes we separate files in scope into several categories: main, test, generated, deployment and build, and other.

  • The main category contains all manually created source code files that are being used in the production.
  • Files in the main category are used as input for other analyses: logical decomposition, concerns, duplication, file size, unit size, and conditional complexity.
  • Test source code files are used only for testing of the product. These files are normally not deployed to production.
  • Build and deployment source code files are used to configure or support build and deployment process.
  • Generated source code files are automatically generated files that have not been manually changed after generation.
  • While a source code folder may contain a number of files, we are primarily interested in the source code files that are being written and maintained by developers.
  • Files containing binaries, documentation, or third-party libraries, for instance, are excluded from analysis. The exception are third-party libraries that have been changed by developers.

main1480145 LOC (64%) 14,440 files
test695275 LOC (30%) 4,624 files
generated81 LOC (<1%) 1 file
build and deployment19866 LOC (<1%) 249 files
other95206 LOC (4%) 657 files
Main Code
All manually created or maintained source code that defines logic of the product that is run in a production environment.
orc
proto
tpl
flex
y
s
l
in
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*".
  • 14440 files match defined criteria (1,480,145 LOC, 100.0% vs. main code):
    • 5,024 *.java files (633,756 LOC)
    • 1,034 *.cpp files (264,712 LOC)
    • 2,280 *.groovy files (241,572 LOC)
    • 1,328 *.h files (156,904 LOC)
    • 4,166 *.sql files (114,002 LOC)
    • 121 *.py files (19,424 LOC)
    • 26 *.thrift files (7,391 LOC)
    • 147 *.hql files (5,676 LOC)
    • 27 *.hpp files (5,361 LOC)
    • 17 *.orc files (4,513 LOC)
    • 8 *.g4 files (4,493 LOC)
    • 63 *.c files (3,952 LOC)
    • 11 *.proto files (3,793 LOC)
    • 39 *.tsx files (2,648 LOC)
    • 19 *.cc files (2,424 LOC)
    • 32 *.tpl files (2,060 LOC)
    • 2 *.pl files (1,304 LOC)
    • 24 *.xml files (1,215 LOC)
    • 1 *.r files (1,149 LOC)
    • 1 *.flex files (782 LOC)
    • 10 *.go files (498 LOC)
    • 14 *.less files (478 LOC)
    • 3 *.rb files (318 LOC)
    • 9 *.ts files (318 LOC)
    • 3 *.cmake files (299 LOC)
    • 2 *.avsc files (205 LOC)
    • 1 *.y files (176 LOC)
    • 5 *.js files (165 LOC)
    • 3 *.mustache files (124 LOC)
    • 4 *.css files (110 LOC)
    • 5 *.yaml files (79 LOC)
    • 4 *.s files (71 LOC)
    • 1 *.l files (58 LOC)
    • 1 *.gemspec files (35 LOC)
    • 1 *.rdf files (29 LOC)
    • 2 *.html files (28 LOC)
    • 2 *.in files (23 LOC)
  • " *.java" is biggest, containing 42.82% of LOC.
  • " *.in" is smallest, containing 0% of LOC.


*.java633756 LOC (42%) 5,024 files
*.cpp264712 LOC (17%) 1,034 files
*.groovy241572 LOC (16%) 2,280 files
*.h156904 LOC (10%) 1,328 files
*.sql114002 LOC (7%) 4,166 files
*.py19424 LOC (1%) 121 files
*.thrift7391 LOC (<1%) 26 files
*.hql5676 LOC (<1%) 147 files
*.hpp5361 LOC (<1%) 27 files
*.orc4513 LOC (<1%) 17 files
*.g44493 LOC (<1%) 8 files
*.c3952 LOC (<1%) 63 files
*.proto3793 LOC (<1%) 11 files
*.tsx2648 LOC (<1%) 39 files
*.cc2424 LOC (<1%) 19 files
*.tpl2060 LOC (<1%) 32 files
*.pl1304 LOC (<1%) 2 files
*.xml1215 LOC (<1%) 24 files
*.r1149 LOC (<1%) 1 file
*.flex782 LOC (<1%) 1 file
*.go498 LOC (<1%) 10 files
*.less478 LOC (<1%) 14 files
*.rb318 LOC (<1%) 3 files
*.ts318 LOC (<1%) 9 files
*.cmake299 LOC (<1%) 3 files
*.avsc205 LOC (<1%) 2 files
*.y176 LOC (<1%) 1 file
*.js165 LOC (<1%) 5 files
*.mustache124 LOC (<1%) 3 files
*.css110 LOC (<1%) 4 files
*.yaml79 LOC (<1%) 5 files
*.s71 LOC (<1%) 4 files
*.l58 LOC (<1%) 1 file
*.gemspec35 LOC (<1%) 1 file
*.rdf29 LOC (<1%) 1 file
*.html28 LOC (<1%) 2 files
*.in23 LOC (<1%) 2 files
Test Code
Used only for testing of the product. Normally not deployed in a production environment.
orc
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/[Tt]est/.*".
    • files with paths like ".*/test_.*".
    • files with paths like ".*/test[.].*".
    • files with paths like ".*_test[.].*".
    • files with paths like ".*_tests[.].*".
  • 4624 files match defined criteria (695,275 LOC, 47.0% vs. main code):
    • 2,775 *.groovy files (329,019 LOC)
    • 1,008 *.java files (154,205 LOC)
    • 519 *.cpp files (139,505 LOC)
    • 166 *.py files (60,713 LOC)
    • 51 *.h files (6,800 LOC)
    • 8 *.orc files (1,872 LOC)
    • 21 *.hql files (1,296 LOC)
    • 45 *.sql files (1,242 LOC)
    • 24 *.sh files (295 LOC)
    • 1 *.go files (251 LOC)
    • 5 *.xml files (75 LOC)
    • 1 *.cc files (2 LOC)
  • " *.groovy" is biggest, containing 47.32% of LOC.
  • " *.cc" is smallest, containing 0% of LOC.


*.groovy329019 LOC (47%) 2,775 files
*.java154205 LOC (22%) 1,008 files
*.cpp139505 LOC (20%) 519 files
*.py60713 LOC (8%) 166 files
*.h6800 LOC (<1%) 51 files
*.orc1872 LOC (<1%) 8 files
*.hql1296 LOC (<1%) 21 files
*.sql1242 LOC (<1%) 45 files
*.sh295 LOC (<1%) 24 files
*.go251 LOC (<1%) 1 file
*.xml75 LOC (<1%) 5 files
*.cc2 LOC (<1%) 1 file
Generated Code
Automatically generated files, not manually changed after generation.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/package[-]lock[.]json".
  • 1 file matches defined criteria (81 LOC, 0.0% vs. main code). All matches are in *.json files.


*.json81 LOC (100%) 1 file
Build and Deployment Code
Source code used to configure or support build and deployment process.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/pom[.]xml".
    • files with paths like ".*[.]sh".
    • files with paths like ".*[.]git[a-z]+".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*/package[.]json".
    • files with paths like ".*([.]|/)webpack([.]|/).*".
    • files with paths like ".*/[.]gitmodules".
    • files with paths like ".*/assembly[.]xml".
    • files with paths like ".*/[.]gitattributes".
    • files with paths like ".*/package[-]lock[.]json".
    • files with paths like ".*/docker[-]compose[.]yaml".
  • 249 files match defined criteria (19,866 LOC, 1.3% vs. main code):
    • 211 *.sh files (13,087 LOC)
    • 31 *.xml files (6,249 LOC)
    • 3 *.yaml files (266 LOC)
    • 4 *.js files (264 LOC)
  • " *.sh" is biggest, containing 65.88% of LOC.
  • " *.js" is smallest, containing 1.33% of LOC.


*.sh13087 LOC (65%) 211 files
*.xml6249 LOC (31%) 31 files
*.yaml266 LOC (1%) 3 files
*.js264 LOC (1%) 4 files
Other Code
txt
proto
properties
mod
ini
cfg
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]properties".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*[.]ini".
    • files with paths like ".*[.]md".
    • files with paths like ".*/README[.][a-z0-9]+".
    • files with paths like ".*[.]txt".
    • files with paths like ".*[.]json".
    • files with paths like ".*/checkstyle.*".
    • files with paths like ".*/checkstyle[.]xml".
    • files with paths like ".*[.]patch".
    • files with paths like ".*/LICENSE[.][a-z0-9]+".
    • files with paths like ".*/go[.]mod".
    • files with paths like ".*[.]svg".
    • files with paths like ".*/[Ss]amples/.*".
    • files with paths like ".*/[Dd]emos?/.*".
    • files with paths like ".*[.]editorconfig".
    • files with paths like ".*/[.]dockerignore".
  • 657 files match defined criteria (95,206 LOC, 6.4% vs. main code):
    • 244 *.json files (41,211 LOC)
    • 132 *.txt files (37,761 LOC)
    • 70 *.java files (4,251 LOC)
    • 73 *.md files (3,947 LOC)
    • 23 *.xml files (1,871 LOC)
    • 12 *.proto files (1,650 LOC)
    • 28 *.properties files (1,559 LOC)
    • 32 *.sh files (871 LOC)
    • 6 *.py files (439 LOC)
    • 12 *.sql files (325 LOC)
    • 3 *.cpp files (310 LOC)
    • 3 *.go files (250 LOC)
    • 4 *.mod files (246 LOC)
    • 7 *.ini files (218 LOC)
    • 1 *.js files (65 LOC)
    • 1 *.php files (62 LOC)
    • 1 *.scala files (56 LOC)
    • 1 *.rs files (39 LOC)
    • 1 *.toml files (30 LOC)
    • 1 *.svg files (21 LOC)
    • 1 *.h files (19 LOC)
    • 1 *.cfg files (5 LOC)
  • " *.json" is biggest, containing 43.29% of LOC.
  • " *.cfg" is smallest, containing 0.01% of LOC.


*.json41211 LOC (43%) 244 files
*.txt37761 LOC (39%) 132 files
*.java4251 LOC (4%) 70 files
*.md3947 LOC (4%) 73 files
*.xml1871 LOC (1%) 23 files
*.proto1650 LOC (1%) 12 files
*.properties1559 LOC (1%) 28 files
*.sh871 LOC (<1%) 32 files
*.py439 LOC (<1%) 6 files
*.sql325 LOC (<1%) 12 files
*.cpp310 LOC (<1%) 3 files
*.go250 LOC (<1%) 3 files
*.mod246 LOC (<1%) 4 files
*.ini218 LOC (<1%) 7 files
*.js65 LOC (<1%) 1 file
*.php62 LOC (<1%) 1 file
*.scala56 LOC (<1%) 1 file
*.rs39 LOC (<1%) 1 file
*.toml30 LOC (<1%) 1 file
*.svg21 LOC (<1%) 1 file
*.h19 LOC (<1%) 1 file
*.cfg5 LOC (<1%) 1 file
Analyzers
Info about analyzers used for source code examinations.


2025-05-07 17:05