apache / arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
GitHub Repo
162K
lines of main code
458 files
28K
lines of test code
176 files
13K
lines of other code
107 files
99%
main code touched
1 year (161K LOC)
49%
new main code
1 year (81K LOC)
42
recent contributors
past 30 days
7y
age
2,733 days
159K
rs
PROTO
1.3K
proto
883
toml
646
sql
552
py

github actions
dependabot
make
docker


Main Code: 162,237 LOC (458 files) = RS (97%) + PROTO (<1%) + TOML (<1%) + SQL (<1%) + PY (<1%)
Secondary code: Test: 28,003 LOC (176); Generated: 3,128 LOC (3); Build & Deploy: 718 LOC (16); Other: 9,446 LOC (88);
Duplication: 20%
File Size: 45% long (>1000 LOC), 11% short (<= 200 LOC)
Unit Size: 22% long (>100 LOC), 33% short (<= 10 LOC)
Conditional Complexity: 3% complex (McCabe index > 50), 82% simple (McCabe index <= 5)
Logical Component Decomposition: primary (8 components)

7 years, 5 months old

  • 50% of code older than 365 days
  • <1% of code not updated in the past 365 days

15% of code updated more than 50 times

Also see temporal dependencies for files frequently changed in same commits.

Goals: Keep the system simple and easy to change (4)
Straight_Line
Features of interest:
TODOs
82 files

generated by sokrates.dev (configuration) on 2023-08-11; reference date: 2023-08-08