facebook / mysql-5.6
Source Code Overview

Analysis scope, overview of main, test, generated, deployment, build, and other code.

Source Code Analysis Scope
Files includes and excluded from analyses
txt
yy
cmake
t
properties
cfg
in
patch
ctl
frm
ini
dsp
m4
awk
g
l
spec
tst
gdb
plist
y
watchmanconfig
dfm
arcconfig
i
bzrignore
mysql
clang-format
wxs
gitmodules
gitattributes
  • 56 extensions are included in analyses: inc, h, c, cpp, java, hpp, cc, txt, sh, pm, py, yy, pl, rst, cmake, t, xml, sql, properties, cfg, in, patch, ctl, frm, ini, bat, d, php, dsp, pp, html, js, m4, awk, g, l, spec, tst, gdb, plist, y, css, watchmanconfig, dfm, gitignore, arcconfig, i, r, bzrignore, mysql, yaml, clang-format, wxs, gitmodules, gitattributes, json
  • 6 criteria are used to exclude files from analysis:
    • exclude files with path like ".*/[.][a-zA-Z0-9_]+.*" (Hidden files and folders) (9 files).
    • exclude files with path like ".*[.]m4" (stuff autogenerated by autoconf - still C deps) (4 files).
    • exclude files with path like ".*/docs/.*" (Documentation) (13 files).
    • exclude files with path like "(?i).*/jquery.*[.]js" (jQuery files) (1 file).
    • exclude files with path like ".*[.]min[.]js" (Minimized JS library) (0 files).
    • exclude files with path like ".*jquery[.].*[.]js" (jQuery library) (0 files).
Overview of Analyzed Files
Basic stats on analyzed files
Intro
For analysis purposes we separate files in scope into several categories: main, test, generated, deployment and build, and other.

  • The main category contains all manually created source code files that are being used in the production.
  • Files in the main category are used as input for other analyses: logical decomposition, concerns, duplication, file size, unit size, and conditional complexity.
  • Test source code files are used only for testing of the product. These files are normally not deployed to production.
  • Build and deployment source code files are used to configure or support build and deployment process.
  • Generated source code files are automatically generated files that have not been manually changed after generation.
  • While a source code folder may contain a number of files, we are primarily interested in the source code files that are being written and maintained by developers.
  • Files containing binaries, documentation, or third-party libraries, for instance, are excluded from analysis. The exception are third-party libraries that have been changed by developers.

main1313215 LOC (77%) 4918 files
test276483 LOC (16%) 1381 files
generated1155 LOC (<1%) 10 files
build and deployment34457 LOC (2%) 134 files
other70851 LOC (4%) 473 files
Main Code
All manually created or maintained source code that defines logic of the product that is run in a production environment.
cfg
in
cmake
t
g
awk
y
l
i
spec
ctl
wxs
frm
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*".
  • 4918 files match defined criteria (1,313,215 lines of code, 100.0% vs. main code):
    • 562 *.cc files (435,074 lines of code)
    • 442 *.cpp files (230,747 lines of code)
    • 543 *.c files (218,092 lines of code)
    • 892 *.h files (132,239 lines of code)
    • 1,150 *.inc files (108,388 lines of code)
    • 534 *.hpp files (72,903 lines of code)
    • 411 *.java files (40,702 lines of code)
    • 25 *.cfg files (13,449 lines of code)
    • 56 *.pl files (11,218 lines of code)
    • 33 *.in files (10,152 lines of code)
    • 63 *.cmake files (7,074 lines of code)
    • 17 *.py files (6,064 lines of code)
    • 22 *.pm files (4,659 lines of code)
    • 28 *.xml files (4,098 lines of code)
    • 55 *.t files (3,091 lines of code)
    • 9 *.pp files (2,983 lines of code)
    • 3 *.g files (2,186 lines of code)
    • 18 *.sql files (2,036 lines of code)
    • 1 *.css files (1,366 lines of code)
    • 1 *.r files (1,153 lines of code)
    • 4 *.awk files (898 lines of code)
    • 2 *.y files (874 lines of code)
    • 3 *.l files (626 lines of code)
    • 1 *.i files (561 lines of code)
    • 10 *.php files (556 lines of code)
    • 5 *.html files (524 lines of code)
    • 10 *.d files (412 lines of code)
    • 3 *.js files (411 lines of code)
    • 3 *.spec files (339 lines of code)
    • 9 *.ctl files (291 lines of code)
    • 1 *.wxs files (21 lines of code)
    • 1 *.yaml files (16 lines of code)
    • 1 *.frm files (12 lines of code)
  • " *.cc" is biggest, containing 33.13% of code.
  • " *.frm" is smallest, containing 0% of code.


*.cc435074 LOC (33%) 562 files
*.cpp230747 LOC (17%) 442 files
*.c218092 LOC (16%) 543 files
*.h132239 LOC (10%) 892 files
*.inc108388 LOC (8%) 1150 files
*.hpp72903 LOC (5%) 534 files
*.java40702 LOC (3%) 411 files
*.cfg13449 LOC (1%) 25 files
*.pl11218 LOC (<1%) 56 files
*.in10152 LOC (<1%) 33 files
*.cmake7074 LOC (<1%) 63 files
*.py6064 LOC (<1%) 17 files
*.pm4659 LOC (<1%) 22 files
*.xml4098 LOC (<1%) 28 files
*.t3091 LOC (<1%) 55 files
*.pp2983 LOC (<1%) 9 files
*.g2186 LOC (<1%) 3 files
*.sql2036 LOC (<1%) 18 files
*.css1366 LOC (<1%) 1 files
*.r1153 LOC (<1%) 1 files
*.awk898 LOC (<1%) 4 files
*.y874 LOC (<1%) 2 files
*.l626 LOC (<1%) 3 files
*.i561 LOC (<1%) 1 files
*.php556 LOC (<1%) 10 files
*.html524 LOC (<1%) 5 files
*.d412 LOC (<1%) 10 files
*.js411 LOC (<1%) 3 files
*.spec339 LOC (<1%) 3 files
*.ctl291 LOC (<1%) 9 files
*.wxs21 LOC (<1%) 1 files
*.yaml16 LOC (<1%) 1 files
*.frm12 LOC (<1%) 1 files
Test Code
Used only for testing of the product. Normally not deployed in a production environment.
yy
t
tst
cfg
gdb
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*_test[.].*".
    • files with paths like ".*/test_.*".
    • files with paths like ".*/[Tt]est/.*".
    • files with paths like ".*/[Tt]ests/.*".
    • files with paths like ".*_tests[.].*".
    • files with paths like ".*/tests_.*".
    • files with paths like ".*[-]test[-].*".
  • 1381 files match defined criteria (276,483 lines of code, 21.1% vs. main code):
    • 195 *.cpp files (121,061 lines of code)
    • 137 *.yy files (34,816 lines of code)
    • 191 *.c files (27,568 lines of code)
    • 142 *.py files (18,593 lines of code)
    • 53 *.pl files (17,743 lines of code)
    • 187 *.pm files (17,202 lines of code)
    • 200 *.java files (13,096 lines of code)
    • 61 *.hpp files (13,040 lines of code)
    • 124 *.sh files (5,033 lines of code)
    • 9 *.xml files (1,830 lines of code)
    • 11 *.inc files (1,817 lines of code)
    • 20 *.h files (1,732 lines of code)
    • 17 *.cc files (1,651 lines of code)
    • 16 *.sql files (737 lines of code)
    • 7 *.t files (195 lines of code)
    • 3 *.tst files (163 lines of code)
    • 4 *.cfg files (127 lines of code)
    • 2 *.bat files (41 lines of code)
    • 2 *.gdb files (38 lines of code)
  • " *.cpp" is biggest, containing 43.79% of code.
  • " *.gdb" is smallest, containing 0.01% of code.


*.cpp121061 LOC (43%) 195 files
*.yy34816 LOC (12%) 137 files
*.c27568 LOC (9%) 191 files
*.py18593 LOC (6%) 142 files
*.pl17743 LOC (6%) 53 files
*.pm17202 LOC (6%) 187 files
*.java13096 LOC (4%) 200 files
*.hpp13040 LOC (4%) 61 files
*.sh5033 LOC (1%) 124 files
*.xml1830 LOC (<1%) 9 files
*.inc1817 LOC (<1%) 11 files
*.h1732 LOC (<1%) 20 files
*.cc1651 LOC (<1%) 17 files
*.sql737 LOC (<1%) 16 files
*.t195 LOC (<1%) 7 files
*.tst163 LOC (<1%) 3 files
*.cfg127 LOC (<1%) 4 files
*.bat41 LOC (<1%) 2 files
*.gdb38 LOC (<1%) 2 files
Generated Code
Automatically generated files, not manually changed after generation.
dsp
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]dsp" AND any line of content like ".*[#] Microsoft Developer Studio Generated Build File.*".
  • 10 files match defined criteria (1,155 lines of code, 0.1% vs. main code). All matches are in *.dsp files.


*.dsp1155 LOC (100%) 10 files
Build and Deployment Code
Source code used to configure or support build and deployment process.
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]sh".
    • files with paths like ".*[.]bat".
    • files with paths like ".*[.]git[a-z]+".
    • files with paths like ".*/[.]gitmodules".
    • files with paths like ".*/build[.]xml".
    • files with paths like ".*/pom[.]xml".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*/[.]gitattributes".
  • 134 files match defined criteria (34,457 lines of code, 2.6% vs. main code):
    • 117 *.sh files (33,253 lines of code)
    • 9 *.xml files (927 lines of code)
    • 8 *.bat files (277 lines of code)
  • " *.sh" is biggest, containing 96.51% of code.
  • " *.bat" is smallest, containing 0.8% of code.


*.sh33253 LOC (96%) 117 files
*.xml927 LOC (2%) 9 files
*.bat277 LOC (<1%) 8 files
Other Code
txt
properties
dsp
ini
patch
yy
plist
cfg
Explore:   circles  |  sunburst
  • The following criteria are used to filter files:
    • files with paths like ".*[.]txt".
    • files with paths like ".*/[Mm]an/.*".
    • files with paths like ".*[.]patch".
    • files with paths like ".*/[Ee]xamples/.*".
    • files with paths like ".*[.]json".
    • files with paths like ".*[.]plist".
    • files with paths like ".*[.](rst|rest|resttxt|rsttxt)".
    • files with paths like ".*/[.]bzrignore".
    • files with paths like ".*[.]ini".
    • files with paths like ".*[.]properties".
    • files with paths like ".*/INSTALL[.][a-z0-9]+".
    • files with paths like ".*/README[.][a-z0-9]+".
    • files with paths like ".*/[Dd]emos?/.*".
    • files with paths like ".*/[.]gitignore".
    • files with paths like ".*[.]dsp".
  • 473 files match defined criteria (70,851 lines of code, 5.4% vs. main code):
    • 260 *.txt files (61,972 lines of code)
    • 81 *.rst files (2,528 lines of code)
    • 40 *.properties files (1,481 lines of code)
    • 10 *.dsp files (1,155 lines of code)
    • 10 *.c files (936 lines of code)
    • 17 *.ini files (924 lines of code)
    • 6 *.cpp files (667 lines of code)
    • 27 *.patch files (605 lines of code)
    • 2 *.yy files (233 lines of code)
    • 5 *.sh files (138 lines of code)
    • 1 *.json files (72 lines of code)
    • 2 *.h files (67 lines of code)
    • 2 *.plist files (53 lines of code)
    • 10 *.cfg files (20 lines of code)
  • " *.txt" is biggest, containing 87.47% of code.
  • " *.cfg" is smallest, containing 0.03% of code.


*.txt61972 LOC (87%) 260 files
*.rst2528 LOC (3%) 81 files
*.properties1481 LOC (2%) 40 files
*.dsp1155 LOC (1%) 10 files
*.c936 LOC (1%) 10 files
*.ini924 LOC (1%) 17 files
*.cpp667 LOC (<1%) 6 files
*.patch605 LOC (<1%) 27 files
*.yy233 LOC (<1%) 2 files
*.sh138 LOC (<1%) 5 files
*.json72 LOC (<1%) 1 files
*.h67 LOC (<1%) 2 files
*.plist53 LOC (<1%) 2 files
*.cfg20 LOC (<1%) 10 files
Analyzers
Info about analyzers used for source code examinations.
  • *.cc files are analyzed with CppAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Advanced heuristic dependency analysis
  • *.cpp files are analyzed with CppAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Advanced heuristic dependency analysis
  • *.c files are analyzed with CStyleAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • No dependency analysis
  • *.h files are analyzed with CppAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Advanced heuristic dependency analysis
  • *.inc files are analyzed with PhpAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Basic heuristic dependency analysis
  • *.hpp files are analyzed with CppAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Advanced heuristic dependency analysis
  • *.java files are analyzed with JavaAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Advanced heuristic dependency analysis (based on package names)
  • *.cfg files are analyzed with CfgAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.pl files are analyzed with PerlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Basic heuristic dependency analysis
  • *.in files are analyzed with RustAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • No dependency analysis
  • *.cmake files are analyzed with DefaultLanguageAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Basic code cleaning (empty lines removed for LOC calculations and duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.py files are analyzed with PythonAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Basic heuristic dependency analysis
  • *.pm files are analyzed with PerlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Basic heuristic dependency analysis
  • *.xml files are analyzed with XmlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.t files are analyzed with PerlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Basic heuristic dependency analysis
  • *.pp files are analyzed with PuppetAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.g files are analyzed with DefaultLanguageAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Basic code cleaning (empty lines removed for LOC calculations and duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.sql files are analyzed with SqlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.css files are analyzed with CssAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.r files are analyzed with RAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • No dependency analysis
  • *.awk files are analyzed with DefaultLanguageAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Basic code cleaning (empty lines removed for LOC calculations and duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.y files are analyzed with DefaultLanguageAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Basic code cleaning (empty lines removed for LOC calculations and duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.l files are analyzed with DefaultLanguageAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Basic code cleaning (empty lines removed for LOC calculations and duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.i files are analyzed with DefaultLanguageAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Basic code cleaning (empty lines removed for LOC calculations and duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.php files are analyzed with PhpAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Basic heuristic dependency analysis
  • *.html files are analyzed with HtmlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • Advanced heuristic dependency analysis
  • *.d files are analyzed with DAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • No dependency analysis
  • *.js files are analyzed with JavaScriptAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • No dependency analysis
  • *.spec files are analyzed with DefaultLanguageAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Basic code cleaning (empty lines removed for LOC calculations and duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.ctl files are analyzed with VisualBasicAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • No dependency analysis
  • *.wxs files are analyzed with XmlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.yaml files are analyzed with YamlAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • No unit size analysis
    • No conditional complexity analysis
    • No dependency analysis
  • *.frm files are analyzed with VisualBasicAnalyzer:
    • All basic standard analyses supported (source code overview, duplication, file size, concerns, findings, metrics, controls)
    • Advanced code cleaning (empty lines and comments removed for LOC calculations, additional cleaning for duplication calculations)
    • Unit size analysis
    • Conditional complexity analysis
    • No dependency analysis


2022-04-14 22:46