apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
GitHub Repo 
637K
lines of main code
4.1K files
802K
lines of test code
5.5K files
301K
lines of other code
2.6K files
15y
age
5,513 days
77%
main code touched
1 year (493K LOC)
11%
new main code
1 year (71K LOC)
453K
scala
96K
py
60K
java
PYI
15K
pyi
PROTO
4.5K
proto
4.3K
js
2.7K
g4
0.9K
css
0.7K
html
0.4K
xml
0.1K
bash
0.05K
yaml
0.04K
ps1
0.03K
toml
IN
0.03K
in
0.02K
c

1201

3748

4100

3218

3082

2861

2618

2330

2508

4312

4982

3526

3899

1085

320

299

166

301

346

312

342

322

368

389

426

482

661

411

153

47

14

10

2025 2024 2023 2022 2021 2020 2019 2018 2017 2016 2015 2014 2013 2012 2011 2010

generated by sokrates.dev (configuration) on 2025-05-07