alibaba / GraphScope
Source Code Overview

Analysis scope, overview of main, test, generated, deployment, build, and other code.

Source Code Analysis Scope
Files includes and excluded from analyses
proto
txt
properties
cfg
in
tpl
clang-format
ini
gitmodules
gitattributes
lds
Overview of Analyzed Files
Basic stats on analyzed files
Intro
For analysis purposes we separate files in scope into several categories: main, test, generated, deployment and build, and other.

  • The main category contains all manually created source code files that are being used in the production.
  • Files in the main category are used as input for other analyses: logical decomposition, concerns, duplication, file size, unit size, and conditional complexity.
  • Test source code files are used only for testing of the product. These files are normally not deployed to production.
  • Build and deployment source code files are used to configure or support build and deployment process.
  • Generated source code files are automatically generated files that have not been manually changed after generation.
  • While a source code folder may contain a number of files, we are primarily interested in the source code files that are being written and maintained by developers.
  • Files containing binaries, documentation, or third-party libraries, for instance, are excluded from analysis. The exception are third-party libraries that have been changed by developers.

main429820 LOC (78%) 3,021 files
test72571 LOC (13%) 525 files
generated3 LOC (<1%) 1 file
build and deployment7272 LOC (1%) 51 files
other34615 LOC (6%) 250 files
Main Code
All manually created or maintained source code that defines logic of the product that is run in a production environment.
proto
tpl
cfg
in
lds
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*".
  • 3021 files match defined criteria (429,820 LOC, 100.0% vs. main code):
    • 1,200 *.java files (108,160 LOC)
    • 525 *.h files (99,006 LOC)
    • 454 *.rs files (89,926 LOC)
    • 167 *.cc files (43,768 LOC)
    • 392 *.py files (42,766 LOC)
    • 56 *.yaml files (18,609 LOC)
    • 22 *.ipynb files (9,525 LOC)
    • 65 *.scala files (8,829 LOC)
    • 50 *.proto files (4,966 LOC)
    • 50 *.toml files (1,193 LOC)
    • 3 *.g4 files (1,054 LOC)
    • 13 *.cmake files (843 LOC)
    • 4 *.tpl files (623 LOC)
    • 11 *.xml files (415 LOC)
    • 5 *.cfg files (130 LOC)
    • 3 *.in files (6 LOC)
    • 1 *.lds files (1 LOC)
  • " *.java" is biggest, containing 25.16% of LOC.
  • " *.lds" is smallest, containing 0% of LOC.


*.java108160 LOC (25%) 1,200 files
*.h99006 LOC (23%) 525 files
*.rs89926 LOC (20%) 454 files
*.cc43768 LOC (10%) 167 files
*.py42766 LOC (9%) 392 files
*.yaml18609 LOC (4%) 56 files
*.ipynb9525 LOC (2%) 22 files
*.scala8829 LOC (2%) 65 files
*.proto4966 LOC (1%) 50 files
*.toml1193 LOC (<1%) 50 files
*.g41054 LOC (<1%) 3 files
*.cmake843 LOC (<1%) 13 files
*.tpl623 LOC (<1%) 4 files
*.xml415 LOC (<1%) 11 files
*.cfg130 LOC (<1%) 5 files
*.in6 LOC (<1%) 3 files
*.lds1 LOC (<1%) 1 file
Test Code
Used only for testing of the product. Normally not deployed in a production environment.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/[Tt]est/.*".
    • files with paths like ".*/[Tt]ests/.*".
    • files with paths like ".*/test_.*".
    • files with paths like ".*_test[.].*".
    • files with paths like ".*_tests[.].*".
    • files with paths like ".*/test[.].*".
  • 525 files match defined criteria (72,571 LOC, 16.9% vs. main code):
    • 192 *.java files (21,320 LOC)
    • 51 *.rs files (18,364 LOC)
    • 174 *.py files (17,452 LOC)
    • 36 *.cc files (8,292 LOC)
    • 25 *.yaml files (2,234 LOC)
    • 5 *.h files (1,849 LOC)
    • 25 *.sh files (1,639 LOC)
    • 1 *.c files (852 LOC)
    • 10 *.scala files (295 LOC)
    • 4 *.xml files (194 LOC)
    • 2 *.toml files (80 LOC)
  • " *.java" is biggest, containing 29.38% of LOC.
  • " *.toml" is smallest, containing 0.11% of LOC.


*.java21320 LOC (29%) 192 files
*.rs18364 LOC (25%) 51 files
*.py17452 LOC (24%) 174 files
*.cc8292 LOC (11%) 36 files
*.yaml2234 LOC (3%) 25 files
*.h1849 LOC (2%) 5 files
*.sh1639 LOC (2%) 25 files
*.c852 LOC (1%) 1 file
*.scala295 LOC (<1%) 10 files
*.xml194 LOC (<1%) 4 files
*.toml80 LOC (<1%) 2 files
Generated Code
Automatically generated files, not manually changed after generation.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*/generated/.*".
  • 1 file matches defined criteria (3 LOC, 0.0% vs. main code). All matches are in *.py files.


*.py3 LOC (100%) 1 file
Build and Deployment Code
Source code used to configure or support build and deployment process.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]sh".
    • files with paths like ".*[.]git[a-z]+".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*/[.]gitmodules".
    • files with paths like ".*/[.]gitattributes".
    • files with paths like ".*/pom[.]xml".
    • files with paths like ".*/build[.]xml".
    • files with paths like ".*/assembly[.]xml".
  • 51 files match defined criteria (7,272 LOC, 1.7% vs. main code):
    • 26 *.xml files (4,921 LOC)
    • 25 *.sh files (2,351 LOC)
  • " *.xml" is biggest, containing 67.67% of LOC.
  • " *.sh" is smallest, containing 32.33% of LOC.


*.xml4921 LOC (67%) 26 files
*.sh2351 LOC (32%) 25 files
Other Code
txt
properties
ini
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]md".
    • files with paths like ".*/README[.][a-z0-9]+".
    • files with paths like ".*[.]txt".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*[.]json".
    • files with paths like ".*/[Ee]xamples/.*".
    • files with paths like ".*/[.]dockerignore".
    • files with paths like ".*[.](rst|rest|resttxt|rsttxt)".
    • files with paths like ".*[.]properties".
    • files with paths like ".*[.]ini".
  • 250 files match defined criteria (34,615 LOC, 8.1% vs. main code):
    • 113 *.json files (24,155 LOC)
    • 61 *.md files (3,600 LOC)
    • 33 *.txt files (3,180 LOC)
    • 15 *.rs files (1,460 LOC)
    • 11 *.yaml files (1,144 LOC)
    • 6 *.properties files (297 LOC)
    • 2 *.java files (285 LOC)
    • 1 *.py files (278 LOC)
    • 5 *.toml files (171 LOC)
    • 1 *.xml files (29 LOC)
    • 2 *.ini files (16 LOC)
  • " *.json" is biggest, containing 69.78% of LOC.
  • " *.ini" is smallest, containing 0.05% of LOC.


*.json24155 LOC (69%) 113 files
*.md3600 LOC (10%) 61 files
*.txt3180 LOC (9%) 33 files
*.rs1460 LOC (4%) 15 files
*.yaml1144 LOC (3%) 11 files
*.properties297 LOC (<1%) 6 files
*.java285 LOC (<1%) 2 files
*.py278 LOC (<1%) 1 file
*.toml171 LOC (<1%) 5 files
*.xml29 LOC (<1%) 1 file
*.ini16 LOC (<1%) 2 files
Analyzers
Info about analyzers used for source code examinations.


2025-05-14 18:54