apache / spark
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
21% | 17% | 27% | 17% | 15%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
scala18% | 17% | 27% | 19% | 16%
py35% | 21% | 26% | 10% | 5%
pyi75% | 17% | 2% | 1% | 2%
java5% | 9% | 36% | 19% | 29%
g476% | 23% | 0% | 0% | 0%
js26% | 17% | 29% | 22% | 4%
proto0% | 66% | 26% | 4% | 2%
css0% | 0% | 69% | 13% | 16%
html0% | 0% | 53% | 37% | 9%
xml0% | 0% | 100% | 0% | 0%
yml0% | 0% | 85% | 0% | 14%
bash0% | 0% | 0% | 100% | 0%
ps10% | 0% | 0% | 73% | 26%
toml0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
c0% | 0% | 0% | 0% | 100%
sbt0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
sql21% | 18% | 27% | 16% | 15%
python40% | 21% | 23% | 9% | 4%
core20% | 18% | 24% | 20% | 17%
connector16% | 12% | 35% | 19% | 16%
common13% | 0% | 33% | 20% | 32%
mllib4% | 20% | 34% | 24% | 16%
project85% | 0% | 0% | 10% | 4%
resource-managers8% | 23% | 25% | 20% | 20%
dev34% | 0% | 35% | 20% | 10%
mllib-local0% | 94% | 0% | 0% | 5%
streaming0% | 5% | 35% | 29% | 29%
launcher0% | 0% | 56% | 20% | 22%
graphx0% | 0% | 26% | 33% | 39%
licenses-binary0% | 0% | 100% | 0% | 0%
ROOT0% | 0% | 88% | 0% | 11%
R0% | 0% | 86% | 0% | 13%
repl0% | 0% | 0% | 60% | 39%
build0% | 0% | 0% | 78% | 21%
hadoop-cloud0% | 0% | 0% | 0% | 100%
tools0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
SQLConf.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/internal
4424 42
collectionOperations.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
4395 173
AstBuilder.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser
3431 244
feature.py
in python/pyspark/ml
3363 447
QueryCompilationErrors.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/errors
3219 470
pyi
relations_pb2.pyi
in python/pyspark/sql/connect/proto
2915 375
SparkConnectPlanner.scala
in connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner
2873 154
Analyzer.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis
2820 128
datetimeExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
2799 145
stringExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
2504 148
QueryExecutionErrors.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/errors
2370 331
package.scala
in core/src/main/scala/org/apache/spark/internal/config
2224 -
series.py
in python/pyspark/pandas
2180 143
Utils.scala
in core/src/main/scala/org/apache/spark/util
2147 163
pyi
base_pb2.pyi
in python/pyspark/sql/connect/proto
2137 299
classification.py
in python/pyspark/ml
2099 245
functions.py
in python/pyspark/sql/connect
2070 433
DAGScheduler.scala
in core/src/main/scala/org/apache/spark/scheduler
2008 99
functions.scala
in sql/core/src/main/scala/org/apache/spark/sql
2003 511
SparkContext.scala
in core/src/main/scala/org/apache/spark
1860 119
Cast.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
1828 78
dataframe.py
in python/pyspark/sql/connect
1749 133
plan.py
in python/pyspark/sql/connect
1734 196
SqlBaseParser.g4
in sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser
1695 -
groupby.py
in python/pyspark/pandas
1638 84
Optimizer.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer
1590 91
RemoteBlockPushResolver.java
in common/network-shuffle/src/main/java/org/apache/spark/network/shuffle
1586 102
regression.py
in python/pyspark/ml
1523 213
BlockManager.scala
in core/src/main/scala/org/apache/spark/storage
1519 83
rdd.py
in python/pyspark
1514 184
pyi
commands_pb2.pyi
in python/pyspark/sql/connect/proto
1509 167
mathExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
1507 104
types.py
in python/pyspark/sql
1478 169
HiveShim.scala
in sql/hive/src/main/scala/org/apache/spark/sql/hive/client
1468 82
Dataset.scala
in sql/core/src/main/scala/org/apache/spark/sql
1462 150
namespace.py
in python/pyspark/pandas
1460 42
basicLogicalOperators.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical
1440 97
objects.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects
1431 67
dataframe.py
in python/pyspark/sql
1405 163
1369 12
JsonProtocol.scala
in core/src/main/scala/org/apache/spark/util
1350 93
functions.scala
in connector/connect/client/jvm/src/main/scala/org/apache/spark/sql
1323 60
pyi
expressions_pb2.pyi
in python/pyspark/sql/connect/proto
1268 148
SessionCatalog.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog
1204 126
indexing.py
in python/pyspark/pandas
1202 49
FsHistoryProvider.scala
in core/src/main/scala/org/apache/spark/deploy/history
1190 57
Client.scala
in resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn
1167 40
CodeGenerator.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen
1157 68
core.py
in python/pyspark/sql/connect/client
1150 92
SortMergeJoinExec.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/joins
1142 36
Files With Most Units (Top 50)
File# lines# units
functions.scala
in sql/core/src/main/scala/org/apache/spark/sql
2003 511
QueryCompilationErrors.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/errors
3219 470
feature.py
in python/pyspark/ml
3363 447
functions.py
in python/pyspark/sql/connect
2070 433
pyi
relations_pb2.pyi
in python/pyspark/sql/connect/proto
2915 375
QueryExecutionErrors.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/errors
2370 331
pyi
base_pb2.pyi
in python/pyspark/sql/connect/proto
2137 299
classification.py
in python/pyspark/ml
2099 245
AstBuilder.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser
3431 244
regression.py
in python/pyspark/ml
1523 213
plan.py
in python/pyspark/sql/connect
1734 196
rdd.py
in python/pyspark
1514 184
collectionOperations.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
4395 173
types.py
in python/pyspark/sql
1478 169
pyi
commands_pb2.pyi
in python/pyspark/sql/connect/proto
1509 167
Utils.scala
in core/src/main/scala/org/apache/spark/util
2147 163
dataframe.py
in python/pyspark/sql
1405 163
SparkConnectPlanner.scala
in connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner
2873 154
Dataset.scala
in sql/core/src/main/scala/org/apache/spark/sql
1462 150
pyi
expressions_pb2.pyi
in python/pyspark/sql/connect/proto
1268 148
stringExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
2504 148
datetimeExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
2799 145
series.py
in python/pyspark/pandas
2180 143
clustering.py
in python/pyspark/ml
958 135
dataframe.py
in python/pyspark/sql/connect
1749 133
Dataset.scala
in connector/connect/client/jvm/src/main/scala/org/apache/spark/sql
947 129
Analyzer.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis
2820 128
SessionCatalog.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog
1204 126
ColumnType.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/columnar
589 120
SparkContext.scala
in core/src/main/scala/org/apache/spark
1860 119
pyi
catalog_pb2.pyi
in python/pyspark/sql/connect/proto
910 119
__init__.py
in python/pyspark/mllib/linalg
908 109
ParquetVectorUpdaterFactory.java
in sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet
996 104
mathExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
1507 104
RemoteBlockPushResolver.java
in common/network-shuffle/src/main/java/org/apache/spark/network/shuffle
1586 102
RDD.scala
in core/src/main/scala/org/apache/spark/rdd
1072 101
pyi
types_pb2.pyi
in python/pyspark/sql/connect/proto
876 101
listener.py
in python/pyspark/sql/streaming
609 100
DAGScheduler.scala
in core/src/main/scala/org/apache/spark/scheduler
2008 99
v2Commands.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical
988 98
basicLogicalOperators.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical
1440 97
__init__.py
in python/pyspark/ml/linalg
779 94
PythonMLLibAPI.scala
in mllib/src/main/scala/org/apache/spark/mllib/api/python
1121 94
JsonProtocol.scala
in core/src/main/scala/org/apache/spark/util
1350 93
core.py
in python/pyspark/sql/connect/client
1150 92
Optimizer.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer
1590 91
QueryParsingErrors.scala
in sql/api/src/main/scala/org/apache/spark/sql/errors
565 91
tuning.py
in python/pyspark/ml
1099 88
groupby.py
in python/pyspark/pandas
1638 84
BlockManager.scala
in core/src/main/scala/org/apache/spark/storage
1519 83
Files With Long Lines (Top 50)

There are 298 files with lines longer than 120 characters. In total, there are 817 long lines.

File# lines# units# long lines
UDFRegistration.scala
in sql/core/src/main/scala/org/apache/spark/sql
935 53 100
stagepage.js
in core/src/main/resources/org/apache/spark/ui/static
1040 61 47
functions.scala
in sql/core/src/main/scala/org/apache/spark/sql
2003 511 33
datetimeExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
2799 145 31
sharedParams.scala
in mllib/src/main/scala/org/apache/spark/ml/param/shared
149 32 22
StreamingQueryStatisticsPage.scala
in sql/core/src/main/scala/org/apache/spark/sql/streaming/ui
455 11 21
misc.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
389 16 19
executorspage-template.html
in core/src/main/resources/org/apache/spark/ui/static
140 - 15
executorspage.js
in core/src/main/resources/org/apache/spark/ui/static
701 46 13
SqlBaseParser.g4
in sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser
1695 - 13
LogDivertAppender.java
in sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/operation
227 28 12
stringExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
2504 148 11
linearRegression.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate
275 10 10
shared.py
in python/pyspark/ml/param
440 76 9
windowExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
797 28 9
ScalaUDF.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
1053 6 9
KryoSerializer.scala
in core/src/main/scala/org/apache/spark/serializer
569 30 8
1369 12 8
SparkConnectPlanner.scala
in connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner
2873 154 7
xpath.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/xml
195 16 7
327 - 6
generators.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
432 24 6
TimeWindow.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
211 8 6
DataFrameWriter.scala
in sql/core/src/main/scala/org/apache/spark/sql
444 31 5
DataSourceV2Strategy.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2
502 14 5
SessionWindow.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
66 1 5
StreamingPage.scala
in streaming/src/main/scala/org/apache/spark/streaming/ui
417 13 4
v2Commands.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical
988 98 4
maskExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
253 8 4
QueryCompilationErrors.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/errors
3219 470 4
SparkCoreErrors.scala
in core/src/main/scala/org/apache/spark/errors
412 68 3
pyi
functions.pyi
in python/pyspark/sql/pandas
110 20 3
treeParams.scala
in mllib/src/main/scala/org/apache/spark/ml/tree
297 14 3
42 - 3
GangliaReporter.java
in connector/spark-ganglia-lgpl/src/main/java/com/codahale/metrics/ganglia
286 28 3
V2ExpressionBuilder.scala
in sql/core/src/main/scala/org/apache/spark/sql/catalyst/util
321 6 3
CatalogImpl.scala
in sql/core/src/main/scala/org/apache/spark/sql/internal
541 57 3
V2SessionCatalog.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2
323 28 3
V2ScanRelationPushDown.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2
429 15 3
ParquetFilters.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet
666 15 3
InMemoryRelation.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/columnar
353 19 3
MicroBatchExecution.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution/streaming
554 15 3
SparkOptimizer.scala
in sql/core/src/main/scala/org/apache/spark/sql/execution
74 - 3
AstBuilder.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser
3431 244 3
SerializerBuildHelper.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst
371 33 3
jsonExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
716 26 3
ToStringBase.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
364 4 3
higherOrderFunctions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
986 57 3
regexpExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
954 39 3
mathExpressions.scala
in sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions
1507 104 3
Correlations

File Size vs. Commits (all time): 3586 points

connector/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala x: 19 commits (all time) y: 157 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/CachedStreamResponse.scala x: 3 commits (all time) y: 8 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteGrpcResponseSender.scala x: 3 commits (all time) y: 191 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteResponseObserver.scala x: 5 commits (all time) y: 205 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteThreadRunner.scala x: 8 commits (all time) y: 149 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/ExecuteHolder.scala x: 9 commits (all time) y: 109 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectExecutePlanHandler.scala x: 4 commits (all time) y: 18 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectReattachExecuteHandler.scala x: 2 commits (all time) y: 33 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/functions.scala x: 53 commits (all time) y: 1323 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Dataset.scala x: 48 commits (all time) y: 947 lines of code project/MimaExcludes.scala x: 478 commits (all time) y: 163 lines of code sql/api/src/main/java/org/apache/spark/api/java/function/FlatMapGroupsWithStateFunction.java x: 1 commits (all time) y: 11 lines of code sql/api/src/main/scala/org/apache/spark/sql/streaming/GroupState.scala x: 1 commits (all time) y: 44 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/CodeGeneratorWithInterpretedFallback.scala x: 6 commits (all time) y: 28 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala x: 687 commits (all time) y: 4424 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/WholeStageCodegenExec.scala x: 84 commits (all time) y: 528 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala x: 814 commits (all time) y: 2820 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveLateralColumnAliasReference.scala x: 9 commits (all time) y: 138 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowConverters.scala x: 30 commits (all time) y: 341 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala x: 493 commits (all time) y: 3431 lines of code python/pyspark/pandas/base.py x: 52 commits (all time) y: 607 lines of code core/src/main/scala/org/apache/spark/deploy/history/EventLogFileCompactor.scala x: 3 commits (all time) y: 126 lines of code python/pyspark/errors/error_classes.py x: 54 commits (all time) y: 3 lines of code python/pyspark/worker.py x: 200 commits (all time) y: 777 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala x: 184 commits (all time) y: 2873 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala x: 52 commits (all time) y: 378 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExec.scala x: 22 commits (all time) y: 187 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 57 commits (all time) y: 363 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala x: 30 commits (all time) y: 643 lines of code python/pyspark/ml/connect/tuning.py x: 3 commits (all time) y: 328 lines of code python/pyspark/ml/torch/distributor.py x: 27 commits (all time) y: 624 lines of code python/pyspark/ml/util.py x: 55 commits (all time) y: 388 lines of code python/pyspark/pandas/utils.py x: 42 commits (all time) y: 629 lines of code python/pyspark/sql/connect/session.py x: 84 commits (all time) y: 620 lines of code python/pyspark/sql/connect/udf.py x: 23 commits (all time) y: 212 lines of code python/pyspark/sql/connect/udtf.py x: 5 commits (all time) y: 147 lines of code python/pyspark/sql/session.py x: 142 commits (all time) y: 763 lines of code python/pyspark/sql/utils.py x: 67 commits (all time) y: 176 lines of code core/src/main/scala/org/apache/spark/ui/storage/StoragePage.scala x: 21 commits (all time) y: 194 lines of code sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala x: 392 commits (all time) y: 1462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/JoinCodegenSupport.scala x: 6 commits (all time) y: 62 lines of code sql/api/src/main/scala/org/apache/spark/sql/execution/streaming/Triggers.scala x: 1 commits (all time) y: 61 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcExceptionConverter.scala x: 3 commits (all time) y: 69 lines of code dev/sparktestsupport/modules.py x: 221 commits (all time) y: 1015 lines of code python/pyspark/pandas/__init__.py x: 22 commits (all time) y: 112 lines of code python/pyspark/pandas/indexes/base.py x: 58 commits (all time) y: 1008 lines of code python/pyspark/pandas/indexes/category.py x: 24 commits (all time) y: 175 lines of code python/pyspark/pandas/indexes/datetimes.py x: 18 commits (all time) y: 266 lines of code python/pyspark/pandas/series.py x: 111 commits (all time) y: 2180 lines of code python/pyspark/pandas/spark/accessors.py x: 25 commits (all time) y: 242 lines of code python/pyspark/pandas/usage_logging/__init__.py x: 13 commits (all time) y: 100 lines of code python/pyspark/sql/pandas/serializers.py x: 35 commits (all time) y: 546 lines of code python/pyspark/sql/pandas/types.py x: 23 commits (all time) y: 599 lines of code core/src/main/scala/org/apache/spark/serializer/GenericAvroSerializer.scala x: 9 commits (all time) y: 96 lines of code core/src/main/scala/org/apache/spark/serializer/KryoSerializer.scala x: 148 commits (all time) y: 569 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/Encoders.scala x: 21 commits (all time) y: 92 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala x: 70 commits (all time) y: 423 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/FailureSafeParser.scala x: 14 commits (all time) y: 59 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala x: 243 commits (all time) y: 2370 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/DataTypeErrors.scala x: 8 commits (all time) y: 238 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryErrorsBase.scala x: 26 commits (all time) y: 29 lines of code python/pyspark/testing/pandasutils.py x: 18 commits (all time) y: 440 lines of code project/SparkBuild.scala x: 1094 commits (all time) y: 1369 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala x: 181 commits (all time) y: 546 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/JavaTypeInference.scala x: 3 commits (all time) y: 102 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala x: 101 commits (all time) y: 755 lines of code core/src/main/scala/org/apache/spark/SparkConf.scala x: 165 commits (all time) y: 463 lines of code python/pyspark/sql/udtf.py x: 11 commits (all time) y: 279 lines of code python/pyspark/cloudpickle/cloudpickle_fast.py x: 7 commits (all time) y: 452 lines of code python/pyspark/sql/connect/plan.py x: 131 commits (all time) y: 1734 lines of code python/pyspark/sql/connect/client/core.py x: 19 commits (all time) y: 1150 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/CloseableIterator.scala x: 1 commits (all time) y: 19 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/CustomSparkConnectBlockingStub.scala x: 4 commits (all time) y: 53 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ExecutePlanResponseReattachableIterator.scala x: 6 commits (all time) y: 181 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcRetryHandler.scala x: 5 commits (all time) y: 114 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkConnectClient.scala x: 33 commits (all time) y: 435 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkResult.scala x: 13 commits (all time) y: 225 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowSerializer.scala x: 5 commits (all time) y: 447 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala x: 118 commits (all time) y: 541 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileIndex.scala x: 9 commits (all time) y: 67 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategy.scala x: 74 commits (all time) y: 227 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningAwareFileIndex.scala x: 38 commits (all time) y: 157 lines of code python/pyspark/sql/worker/analyze_udtf.py x: 3 commits (all time) y: 108 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonFunction.scala x: 16 commits (all time) y: 192 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/SortMergeJoinExec.scala x: 65 commits (all time) y: 1142 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala x: 513 commits (all time) y: 1590 lines of code python/pyspark/pandas/groupby.py x: 91 commits (all time) y: 1638 lines of code python/pyspark/pandas/namespace.py x: 63 commits (all time) y: 1460 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryListener.scala x: 3 commits (all time) y: 73 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/StreamingQueryListenerHelper.scala x: 2 commits (all time) y: 41 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala x: 15 commits (all time) y: 139 lines of code connector/connect/common/src/main/protobuf/spark/connect/base.proto x: 26 commits (all time) y: 662 lines of code python/pyspark/sql/connect/proto/base_pb2.pyi x: 32 commits (all time) y: 2137 lines of code python/pyspark/testing/utils.py x: 30 commits (all time) y: 367 lines of code core/src/main/scala/org/apache/spark/MapOutputTracker.scala x: 172 commits (all time) y: 1104 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryCompilationErrors.scala x: 244 commits (all time) y: 3219 lines of code python/pyspark/sql/types.py x: 148 commits (all time) y: 1478 lines of code python/pyspark/testing/connectutils.py x: 33 commits (all time) y: 135 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectService.scala x: 40 commits (all time) y: 283 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectStreamingQueryCache.scala x: 5 commits (all time) y: 133 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseLexer.g4 x: 2 commits (all time) y: 514 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4 x: 3 commits (all time) y: 1695 lines of code core/src/main/scala/org/apache/spark/executor/Executor.scala x: 316 commits (all time) y: 875 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala x: 115 commits (all time) y: 901 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/JdbcDialects.scala x: 84 commits (all time) y: 354 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala x: 38 commits (all time) y: 213 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsAtomicPartitionManagement.java x: 8 commits (all time) y: 40 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsPartitionManagement.java x: 11 commits (all time) y: 48 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/TableCatalog.java x: 20 commits (all time) y: 65 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/sources/filters.scala x: 14 commits (all time) y: 174 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/dsl/package.scala x: 32 commits (all time) y: 979 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerListener.scala x: 3 commits (all time) y: 344 lines of code core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala x: 37 commits (all time) y: 429 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala x: 45 commits (all time) y: 458 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStoreConf.scala x: 16 commits (all time) y: 27 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/expressions/UserDefinedFunction.scala x: 11 commits (all time) y: 110 lines of code core/src/main/scala/org/apache/spark/util/Utils.scala x: 498 commits (all time) y: 2147 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala x: 66 commits (all time) y: 1053 lines of code python/setup.py x: 74 commits (all time) y: 276 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala x: 210 commits (all time) y: 462 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/SparkConnectPlanExecution.scala x: 4 commits (all time) y: 187 lines of code python/pyspark/sql/connect/proto/base_pb2_grpc.py x: 8 commits (all time) y: 349 lines of code common/network-common/src/main/java/org/apache/spark/network/util/TransportConf.java x: 39 commits (all time) y: 178 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SQLExecution.scala x: 42 commits (all time) y: 164 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 6 commits (all time) y: 89 lines of code connector/connect/common/src/main/protobuf/spark/connect/commands.proto x: 32 commits (all time) y: 341 lines of code python/pyspark/sql/connect/proto/commands_pb2.pyi x: 32 commits (all time) y: 1509 lines of code python/pyspark/sql/connect/streaming/query.py x: 9 commits (all time) y: 219 lines of code python/pyspark/sql/streaming/listener.py x: 9 commits (all time) y: 609 lines of code python/pyspark/sql/streaming/query.py x: 17 commits (all time) y: 119 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/DeduplicateRelations.scala x: 21 commits (all time) y: 329 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 2 commits (all time) y: 1078 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/UdfUtils.scala x: 4 commits (all time) y: 493 lines of code core/src/main/scala/org/apache/spark/scheduler/ShuffleMapTask.scala x: 118 commits (all time) y: 62 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala x: 131 commits (all time) y: 810 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasExec.scala x: 22 commits (all time) y: 37 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/window/WindowExec.scala x: 26 commits (all time) y: 35 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/subquery.scala x: 56 commits (all time) y: 139 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala x: 147 commits (all time) y: 2504 lines of code sql/core/src/main/scala/org/apache/spark/sql/functions.scala x: 442 commits (all time) y: 2003 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/artifact/SparkConnectArtifactManager.scala x: 14 commits (all time) y: 186 lines of code core/src/main/scala/org/apache/spark/internal/config/package.scala x: 255 commits (all time) y: 2224 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ArtifactManager.scala x: 8 commits (all time) y: 252 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SQLAppStatusListener.scala x: 40 commits (all time) y: 470 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SQLListener.scala x: 40 commits (all time) y: 67 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerAppStatusStore.scala x: 1 commits (all time) y: 93 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerPage.scala x: 1 commits (all time) y: 442 lines of code core/src/main/scala/org/apache/spark/SparkContext.scala x: 730 commits (all time) y: 1860 lines of code core/src/main/scala/org/apache/spark/ui/SparkUI.scala x: 95 commits (all time) y: 169 lines of code common/utils/src/main/java/org/apache/spark/network/util/JavaUtils.java x: 1 commits (all time) y: 253 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowVectorReader.scala x: 3 commits (all time) y: 208 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/LiteralValueProtoConverter.scala x: 7 commits (all time) y: 313 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/SparkIntervalUtils.scala x: 2 commits (all time) y: 393 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/AnsiTypeCoercion.scala x: 26 commits (all time) y: 177 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala x: 270 commits (all time) y: 1828 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ToStringBase.scala x: 3 commits (all time) y: 364 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/object.scala x: 42 commits (all time) y: 596 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/IntervalUtils.scala x: 83 commits (all time) y: 711 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/package.scala x: 47 commits (all time) y: 146 lines of code mllib/src/main/scala/org/apache/spark/mllib/evaluation/RankingMetrics.scala x: 22 commits (all time) y: 155 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroDeserializer.scala x: 7 commits (all time) y: 455 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroSerializer.scala x: 3 commits (all time) y: 304 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/DataTypeProtoConverter.scala x: 4 commits (all time) y: 236 lines of code sql/api/src/main/scala/org/apache/spark/sql/Row.scala x: 2 commits (all time) y: 230 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/DataTypeAstBuilder.scala x: 2 commits (all time) y: 142 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/parsers.scala x: 3 commits (all time) y: 275 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/DateFormatter.scala x: 1 commits (all time) y: 140 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeFormatterHelper.scala x: 1 commits (all time) y: 228 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/RebaseDateTime.scala x: 1 commits (all time) y: 208 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala x: 1 commits (all time) y: 419 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/ExecutionErrors.scala x: 1 commits (all time) y: 171 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/QueryParsingErrors.scala x: 2 commits (all time) y: 565 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/DataType.scala x: 2 commits (all time) y: 284 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/Decimal.scala x: 2 commits (all time) y: 473 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/StructType.scala x: 2 commits (all time) y: 367 lines of code sql/api/src/main/scala/org/apache/spark/sql/util/ArrowUtils.scala x: 2 commits (all time) y: 171 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/CSVOptions.scala x: 31 commits (all time) y: 267 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/UnivocityParser.scala x: 46 commits (all time) y: 299 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JSONOptions.scala x: 42 commits (all time) y: 188 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala x: 173 commits (all time) y: 379 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowWriter.scala x: 5 commits (all time) y: 358 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/Columnar.scala x: 28 commits (all time) y: 373 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceUtils.scala x: 28 commits (all time) y: 207 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFilters.scala x: 56 commits (all time) y: 666 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetReadSupport.scala x: 28 commits (all time) y: 365 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetUtils.scala x: 16 commits (all time) y: 333 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala x: 26 commits (all time) y: 323 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetScanBuilder.scala x: 18 commits (all time) y: 78 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/progress.scala x: 36 commits (all time) y: 181 lines of code python/pyspark/testing/__init__.py x: 4 commits (all time) y: 3 lines of code core/src/main/scala/org/apache/spark/executor/CoarseGrainedExecutorBackend.scala x: 178 commits (all time) y: 456 lines of code core/src/main/scala/org/apache/spark/scheduler/SchedulerBackend.scala x: 32 commits (all time) y: 27 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedClusterMessage.scala x: 69 commits (all time) y: 92 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala x: 237 commits (all time) y: 696 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2ScanExecBase.scala x: 16 commits (all time) y: 148 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveShim.scala x: 89 commits (all time) y: 1468 lines of code dev/appveyor-install-dependencies.ps1 x: 55 commits (all time) y: 112 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala x: 208 commits (all time) y: 654 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala x: 67 commits (all time) y: 556 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/BatchEvalPythonUDTFExec.scala x: 5 commits (all time) y: 79 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveTimeWindows.scala x: 7 commits (all time) y: 223 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala x: 59 commits (all time) y: 408 lines of code python/pyspark/sql/connect/types.py x: 28 commits (all time) y: 230 lines of code python/pyspark/sql/connect/dataframe.py x: 167 commits (all time) y: 1749 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/bitmapExpressions.scala x: 5 commits (all time) y: 244 lines of code mllib/src/main/scala/org/apache/spark/ml/source/image/ImageFileFormat.scala x: 7 commits (all time) y: 71 lines of code mllib/src/main/scala/org/apache/spark/ml/source/libsvm/LibSVMRelation.scala x: 46 commits (all time) y: 138 lines of code sql/api/src/main/java/org/apache/spark/sql/types/DataTypes.java x: 1 commits (all time) y: 110 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/encoders/ExpressionEncoder.scala x: 72 commits (all time) y: 238 lines of code sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 145 commits (all time) y: 557 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkStrategies.scala x: 351 commits (all time) y: 685 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2CommandExec.scala x: 12 commits (all time) y: 27 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/FlatMapGroupsInPandasWithStateExec.scala x: 9 commits (all time) y: 166 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/MicroBatchExecution.scala x: 89 commits (all time) y: 554 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/sources/memory.scala x: 15 commits (all time) y: 140 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala x: 140 commits (all time) y: 1431 lines of code dev/run-tests.py x: 163 commits (all time) y: 427 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 7 commits (all time) y: 111 lines of code python/pyspark/util.py x: 40 commits (all time) y: 171 lines of code core/src/main/scala/org/apache/spark/package.scala x: 42 commits (all time) y: 12 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/AggregateInPandasExec.scala x: 18 commits (all time) y: 136 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ArrowEvalPythonExec.scala x: 26 commits (all time) y: 85 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/FlatMapGroupsInPandasExec.scala x: 22 commits (all time) y: 55 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasEvaluatorFactory.scala x: 2 commits (all time) y: 245 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala x: 246 commits (all time) y: 1440 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statements.scala x: 93 commits (all time) y: 114 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala x: 133 commits (all time) y: 988 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/DataWritingCommand.scala x: 18 commits (all time) y: 59 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/createDataSourceTables.scala x: 72 commits (all time) y: 150 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala x: 207 commits (all time) y: 953 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/views.scala x: 86 commits (all time) y: 453 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/SaveIntoDataSourceCommand.scala x: 13 commits (all time) y: 36 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/CreateHiveTableAsSelectCommand.scala x: 36 commits (all time) y: 74 lines of code core/src/main/scala/org/apache/spark/ui/UIUtils.scala x: 114 commits (all time) y: 584 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/rules.scala x: 114 commits (all time) y: 421 lines of code connector/connect/common/src/main/protobuf/spark/connect/expressions.proto x: 31 commits (all time) y: 311 lines of code python/pyspark/sql/connect/expressions.py x: 33 commits (all time) y: 835 lines of code python/pyspark/sql/connect/functions.py x: 113 commits (all time) y: 2070 lines of code python/pyspark/sql/connect/proto/expressions_pb2.pyi x: 39 commits (all time) y: 1268 lines of code python/pyspark/sql/streaming/readwriter.py x: 10 commits (all time) y: 540 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Strategy.scala x: 205 commits (all time) y: 502 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Column.scala x: 13 commits (all time) y: 273 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/encoders/AgnosticEncoder.scala x: 1 commits (all time) y: 182 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ordering.scala x: 12 commits (all time) y: 70 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/CharVarcharUtils.scala x: 15 commits (all time) y: 219 lines of code core/src/main/resources/org/apache/spark/ui/static/stagepage.js x: 30 commits (all time) y: 1040 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreeNode.scala x: 138 commits (all time) y: 824 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/StringUtils.scala x: 26 commits (all time) y: 88 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AdaptiveSparkPlanExec.scala x: 108 commits (all time) y: 564 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/QueryStageExec.scala x: 35 commits (all time) y: 179 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala x: 88 commits (all time) y: 642 lines of code project/plugins.sbt x: 152 commits (all time) y: 14 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala x: 314 commits (all time) y: 1081 lines of code python/pyspark/sql/dataframe.py x: 348 commits (all time) y: 1405 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/BroadcastExchangeExec.scala x: 44 commits (all time) y: 148 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/HDFSBackedStateStoreProvider.scala x: 42 commits (all time) y: 553 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala x: 14 commits (all time) y: 483 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBStateStoreProvider.scala x: 9 commits (all time) y: 275 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2SessionCatalog.scala x: 43 commits (all time) y: 323 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/RewriteRowLevelCommand.scala x: 6 commits (all time) y: 161 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala x: 125 commits (all time) y: 382 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/dsl/package.scala x: 142 commits (all time) y: 376 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/HyperLogLogPlusPlus.scala x: 21 commits (all time) y: 83 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/PivotFirst.scala x: 11 commits (all time) y: 98 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/interfaces.scala x: 60 commits (all time) y: 255 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/objects.scala x: 17 commits (all time) y: 177 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/ParserUtils.scala x: 30 commits (all time) y: 76 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/QueryPlan.scala x: 116 commits (all time) y: 343 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/LocalRelation.scala x: 31 commits (all time) y: 69 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/ResolveDefaultColumnsUtil.scala x: 20 commits (all time) y: 287 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala x: 93 commits (all time) y: 691 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/TungstenAggregationIterator.scala x: 44 commits (all time) y: 229 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala x: 79 commits (all time) y: 387 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2Writes.scala x: 12 commits (all time) y: 124 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/ShuffleExchangeExec.scala x: 42 commits (all time) y: 227 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/objects.scala x: 39 commits (all time) y: 449 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StageTable.scala x: 127 commits (all time) y: 300 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala x: 32 commits (all time) y: 495 lines of code core/src/main/scala/org/apache/spark/SparkEnv.scala x: 219 commits (all time) y: 402 lines of code core/src/main/scala/org/apache/spark/api/python/PythonWorkerFactory.scala x: 57 commits (all time) y: 292 lines of code python/pyspark/daemon.py x: 43 commits (all time) y: 151 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/PythonUDF.scala x: 18 commits (all time) y: 146 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala x: 52 commits (all time) y: 242 lines of code python/pyspark/sql/__init__.py x: 29 commits (all time) y: 34 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ConcatenatingArrowStreamReader.scala x: 1 commits (all time) y: 131 lines of code core/src/main/scala/org/apache/spark/rdd/RDDBarrier.scala x: 7 commits (all time) y: 40 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/BatchEvalPythonExec.scala x: 22 commits (all time) y: 89 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/EvalPythonEvaluatorFactory.scala x: 1 commits (all time) y: 80 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/EvalPythonExec.scala x: 18 commits (all time) y: 22 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/debug/package.scala x: 63 commits (all time) y: 183 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala x: 435 commits (all time) y: 2008 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AdaptiveSparkPlanHelper.scala x: 10 commits (all time) y: 58 lines of code python/pyspark/sql/connect/streaming/readwriter.py x: 11 commits (all time) y: 511 lines of code python/pyspark/version.py x: 17 commits (all time) y: 1 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/window/WindowExecBase.scala x: 22 commits (all time) y: 24 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala x: 348 commits (all time) y: 856 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CountMinSketchAgg.scala x: 12 commits (all time) y: 169 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/generators.scala x: 69 commits (all time) y: 432 lines of code dev/merge_spark_pr.py x: 62 commits (all time) y: 406 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/ui/StreamingQueryStatusListener.scala x: 15 commits (all time) y: 114 lines of code connector/connect/common/src/main/protobuf/spark/connect/relations.proto x: 60 commits (all time) y: 796 lines of code python/pyspark/sql/connect/proto/relations_pb2.pyi x: 92 commits (all time) y: 2915 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/EnsureRequirements.scala x: 49 commits (all time) y: 428 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FilePartition.scala x: 9 commits (all time) y: 84 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DistributionAndOrderingUtils.scala x: 10 commits (all time) y: 76 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryListener.scala x: 22 commits (all time) y: 85 lines of code python/pyspark/sql/sql_formatter.py x: 4 commits (all time) y: 49 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/jsonExpressions.scala x: 94 commits (all time) y: 716 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/text/TextOptions.scala x: 10 commits (all time) y: 28 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala x: 220 commits (all time) y: 1204 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/AssignmentUtils.scala x: 6 commits (all time) y: 147 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TableOutputResolver.scala x: 25 commits (all time) y: 442 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala x: 249 commits (all time) y: 444 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriterV2.scala x: 25 commits (all time) y: 148 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala x: 124 commits (all time) y: 354 lines of code python/pyspark/rdd.py x: 377 commits (all time) y: 1514 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ExtractPythonUDFs.scala x: 47 commits (all time) y: 215 lines of code python/pyspark/sql/udf.py x: 62 commits (all time) y: 425 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/SessionState.scala x: 86 commits (all time) y: 108 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NestedColumnAliasing.scala x: 32 commits (all time) y: 256 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Implicits.scala x: 28 commits (all time) y: 149 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Implicits.scala x: 17 commits (all time) y: 96 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Relation.scala x: 23 commits (all time) y: 174 lines of code sql/core/src/main/scala/org/apache/spark/sql/RelationalGroupedDataset.scala x: 61 commits (all time) y: 342 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/udaf.scala x: 33 commits (all time) y: 415 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/SetCommand.scala x: 28 commits (all time) y: 137 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala x: 153 commits (all time) y: 707 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/functions.scala x: 45 commits (all time) y: 133 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormat.scala x: 31 commits (all time) y: 175 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/LogicalRelation.scala x: 36 commits (all time) y: 61 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala x: 48 commits (all time) y: 173 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala x: 116 commits (all time) y: 373 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/FileScan.scala x: 27 commits (all time) y: 145 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/PushDownUtils.scala x: 37 commits (all time) y: 128 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/memory.scala x: 72 commits (all time) y: 199 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/sources/ForeachWriterTable.scala x: 19 commits (all time) y: 112 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/SymmetricHashJoinStateManager.scala x: 23 commits (all time) y: 446 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/CatalogImpl.scala x: 86 commits (all time) y: 541 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamReader.scala x: 91 commits (all time) y: 135 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 78 commits (all time) y: 287 lines of code core/src/main/scala/org/apache/spark/storage/BlockInfoManager.scala x: 17 commits (all time) y: 324 lines of code core/src/main/scala/org/apache/spark/storage/BlockManager.scala x: 341 commits (all time) y: 1519 lines of code core/src/main/scala/org/apache/spark/storage/DiskBlockManager.scala x: 106 commits (all time) y: 249 lines of code core/src/main/scala/org/apache/spark/storage/DiskStore.scala x: 90 commits (all time) y: 261 lines of code core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala x: 87 commits (all time) y: 1081 lines of code core/src/main/scala/org/apache/spark/storage/memory/MemoryStore.scala x: 36 commits (all time) y: 601 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala x: 200 commits (all time) y: 1003 lines of code core/src/main/resources/org/apache/spark/ui/static/executorspage.js x: 31 commits (all time) y: 701 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/DeserializerBuildHelper.scala x: 11 commits (all time) y: 349 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/SerializerBuildHelper.scala x: 17 commits (all time) y: 371 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/VectorizedParquetRecordReader.java x: 40 commits (all time) y: 299 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcDeserializer.scala x: 18 commits (all time) y: 222 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRowConverter.scala x: 46 commits (all time) y: 556 lines of code python/pyspark/context.py x: 309 commits (all time) y: 747 lines of code python/pyspark/pandas/supported_api_gen.py x: 14 commits (all time) y: 206 lines of code connector/protobuf/src/main/scala/org/apache/spark/sql/protobuf/utils/ProtobufUtils.scala x: 11 commits (all time) y: 194 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala x: 193 commits (all time) y: 4395 lines of code core/src/main/scala/org/apache/spark/errors/SparkCoreErrors.scala x: 14 commits (all time) y: 412 lines of code core/src/main/scala/org/apache/spark/shuffle/ShufflePartitionPairsWriter.scala x: 4 commits (all time) y: 104 lines of code python/pyspark/pandas/resample.py x: 6 commits (all time) y: 387 lines of code python/pyspark/pandas/spark/functions.py x: 27 commits (all time) y: 128 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/ToNumberParser.scala x: 8 commits (all time) y: 640 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/ParserInterface.scala x: 9 commits (all time) y: 20 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java x: 31 commits (all time) y: 1586 lines of code sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala x: 35 commits (all time) y: 155 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala x: 64 commits (all time) y: 238 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreePatterns.scala x: 48 commits (all time) y: 126 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/csvExpressions.scala x: 25 commits (all time) y: 209 lines of code python/pyspark/sql/group.py x: 51 commits (all time) y: 88 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ddl.scala x: 56 commits (all time) y: 104 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/BaseSessionStateBuilder.scala x: 78 commits (all time) y: 228 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionStateBuilder.scala x: 54 commits (all time) y: 156 lines of code R/pkg/pkgdown/_pkgdown_template.yml x: 8 commits (all time) y: 291 lines of code core/src/main/scala/org/apache/spark/api/java/JavaSparkContext.scala x: 126 commits (all time) y: 252 lines of code core/src/main/scala/org/apache/spark/status/AppStatusListener.scala x: 68 commits (all time) y: 1100 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectAnalyzeHandler.scala x: 11 commits (all time) y: 167 lines of code core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala x: 222 commits (all time) y: 1133 lines of code common/network-common/src/main/java/org/apache/spark/network/client/TransportClientFactory.java x: 20 commits (all time) y: 230 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala x: 130 commits (all time) y: 964 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/higherOrderFunctions.scala x: 17 commits (all time) y: 77 lines of code common/unsafe/src/main/java/org/apache/spark/unsafe/Platform.java x: 22 commits (all time) y: 240 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/DecimalPrecision.scala x: 25 commits (all time) y: 120 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/KeyValueGroupedDataset.scala x: 5 commits (all time) y: 413 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/MySQLDialect.scala x: 36 commits (all time) y: 243 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/IsolatedClientLoader.scala x: 79 commits (all time) y: 231 lines of code python/pyspark/pandas/window.py x: 30 commits (all time) y: 539 lines of code core/src/main/scala/org/apache/spark/api/python/PythonUtils.scala x: 36 commits (all time) y: 96 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/metric/SQLMetrics.scala x: 55 commits (all time) y: 168 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/namedExpressions.scala x: 121 commits (all time) y: 394 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/CodeGenerator.scala x: 247 commits (all time) y: 1157 lines of code core/src/main/resources/org/apache/spark/ui/static/timeline-view.js x: 18 commits (all time) y: 245 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/Config.scala x: 95 commits (all time) y: 661 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerMessages.scala x: 51 commits (all time) y: 82 lines of code core/src/main/scala/org/apache/spark/ui/exec/ExecutorsTab.scala x: 33 commits (all time) y: 38 lines of code core/src/main/scala/org/apache/spark/resource/ResourceProfile.scala x: 20 commits (all time) y: 348 lines of code core/src/main/scala/org/apache/spark/resource/ResourceUtils.scala x: 19 commits (all time) y: 349 lines of code python/pyspark/pandas/typedef/typehints.py x: 31 commits (all time) y: 393 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGSchedulerEvent.scala x: 80 commits (all time) y: 75 lines of code core/src/main/scala/org/apache/spark/scheduler/ResultTask.scala x: 76 commits (all time) y: 47 lines of code core/src/main/scala/org/apache/spark/scheduler/Task.scala x: 101 commits (all time) y: 122 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala x: 221 commits (all time) y: 945 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FlatMapGroupsWithStateExec.scala x: 38 commits (all time) y: 357 lines of code python/pyspark/ml/base.py x: 20 commits (all time) y: 174 lines of code python/pyspark/ml/classification.py x: 182 commits (all time) y: 2099 lines of code python/pyspark/ml/clustering.py x: 88 commits (all time) y: 958 lines of code python/pyspark/ml/feature.py x: 179 commits (all time) y: 3363 lines of code python/pyspark/ml/fpm.py x: 30 commits (all time) y: 226 lines of code python/pyspark/ml/recommendation.py x: 48 commits (all time) y: 322 lines of code python/pyspark/ml/regression.py x: 137 commits (all time) y: 1523 lines of code python/pyspark/ml/tree.py x: 10 commits (all time) y: 252 lines of code python/pyspark/ml/tuning.py x: 71 commits (all time) y: 1099 lines of code python/pyspark/ml/wrapper.py x: 50 commits (all time) y: 213 lines of code python/pyspark/mllib/classification.py x: 74 commits (all time) y: 398 lines of code python/pyspark/mllib/clustering.py x: 83 commits (all time) y: 449 lines of code python/pyspark/mllib/feature.py x: 54 commits (all time) y: 346 lines of code python/pyspark/mllib/linalg/__init__.py x: 37 commits (all time) y: 908 lines of code python/pyspark/mllib/linalg/distributed.py x: 23 commits (all time) y: 365 lines of code python/pyspark/mllib/recommendation.py x: 55 commits (all time) y: 136 lines of code python/pyspark/mllib/regression.py x: 68 commits (all time) y: 371 lines of code python/pyspark/sql/observation.py x: 15 commits (all time) y: 49 lines of code python/pyspark/streaming/context.py x: 37 commits (all time) y: 210 lines of code python/pyspark/sql/connect/group.py x: 20 commits (all time) y: 311 lines of code python/pyspark/sql/connect/column.py x: 77 commits (all time) y: 379 lines of code core/src/main/protobuf/org/apache/spark/status/protobuf/store_types.proto x: 38 commits (all time) y: 740 lines of code core/src/main/scala/org/apache/spark/status/LiveEntity.scala x: 43 commits (all time) y: 817 lines of code core/src/main/scala/org/apache/spark/status/api/v1/api.scala x: 59 commits (all time) y: 467 lines of code connector/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/consumer/KafkaDataConsumer.scala x: 3 commits (all time) y: 442 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/windowExpressions.scala x: 79 commits (all time) y: 797 lines of code sql/core/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveSessionCatalog.scala x: 178 commits (all time) y: 508 lines of code python/pyspark/accumulators.py x: 79 commits (all time) y: 146 lines of code python/pyspark/ml/functions.py x: 20 commits (all time) y: 278 lines of code python/pyspark/sql/pandas/group_ops.py x: 20 commits (all time) y: 122 lines of code python/pyspark/sql/pandas/map_ops.py x: 16 commits (all time) y: 51 lines of code python/pyspark/pandas/data_type_ops/base.py x: 38 commits (all time) y: 366 lines of code python/pyspark/pandas/data_type_ops/boolean_ops.py x: 23 commits (all time) y: 334 lines of code python/pyspark/pandas/data_type_ops/num_ops.py x: 39 commits (all time) y: 429 lines of code python/pyspark/pandas/indexes/multi.py x: 36 commits (all time) y: 531 lines of code python/pyspark/pandas/indexing.py x: 37 commits (all time) y: 1202 lines of code python/pyspark/pandas/internal.py x: 37 commits (all time) y: 842 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala x: 136 commits (all time) y: 618 lines of code python/pyspark/pandas/generic.py x: 65 commits (all time) y: 938 lines of code python/pyspark/pandas/plot/matplotlib.py x: 14 commits (all time) y: 555 lines of code python/pyspark/pandas/strings.py x: 19 commits (all time) y: 315 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/ExpressionImplUtils.java x: 9 commits (all time) y: 187 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/misc.scala x: 84 commits (all time) y: 389 lines of code core/src/main/scala/org/apache/spark/rdd/CheckpointRDD.scala x: 78 commits (all time) y: 10 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/planning/patterns.scala x: 80 commits (all time) y: 269 lines of code python/pyspark/pandas/data_type_ops/categorical_ops.py x: 25 commits (all time) y: 84 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala x: 44 commits (all time) y: 391 lines of code common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java x: 55 commits (all time) y: 1093 lines of code python/pyspark/sql/context.py x: 126 commits (all time) y: 296 lines of code core/src/main/scala/org/apache/spark/scheduler/dynalloc/ExecutorMonitor.scala x: 21 commits (all time) y: 438 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala x: 248 commits (all time) y: 763 lines of code common/network-yarn/src/main/java/org/apache/spark/network/yarn/YarnShuffleService.java x: 35 commits (all time) y: 405 lines of code python/pyspark/sql/connect/client/artifact.py x: 6 commits (all time) y: 254 lines of code python/pyspark/sql/catalog.py x: 63 commits (all time) y: 317 lines of code python/pyspark/sql/connect/catalog.py x: 17 commits (all time) y: 262 lines of code python/pyspark/sql/connect/proto/catalog_pb2.pyi x: 9 commits (all time) y: 910 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetOptions.scala x: 17 commits (all time) y: 60 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala x: 47 commits (all time) y: 367 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/hiveUDFs.scala x: 75 commits (all time) y: 327 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/InsertAdaptiveSparkPlan.scala x: 32 commits (all time) y: 101 lines of code connector/protobuf/src/main/scala/org/apache/spark/sql/protobuf/ProtobufDeserializer.scala x: 9 commits (all time) y: 326 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveStrategies.scala x: 144 commits (all time) y: 222 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/ExecutionPage.scala x: 27 commits (all time) y: 172 lines of code python/pyspark/ml/param/_shared_params_code_gen.py x: 54 commits (all time) y: 301 lines of code python/pyspark/ml/param/shared.py x: 56 commits (all time) y: 440 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala x: 51 commits (all time) y: 676 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SubqueryBroadcastExec.scala x: 14 commits (all time) y: 80 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/HashedRelation.scala x: 95 commits (all time) y: 795 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroUtils.scala x: 6 commits (all time) y: 239 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/subquery.scala x: 57 commits (all time) y: 546 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/InterpretedUnsafeProjection.scala x: 21 commits (all time) y: 210 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/datetimeExpressions.scala x: 193 commits (all time) y: 2799 lines of code core/src/main/scala/org/apache/spark/rdd/RDD.scala x: 317 commits (all time) y: 1072 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/vectorized/ArrowColumnVector.java x: 12 commits (all time) y: 453 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ColumnResolutionHelper.scala x: 6 commits (all time) y: 290 lines of code python/pyspark/sql/pandas/conversion.py x: 43 commits (all time) y: 451 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/mathExpressions.scala x: 71 commits (all time) y: 1507 lines of code core/src/main/scala/org/apache/spark/deploy/history/ApplicationCache.scala x: 10 commits (all time) y: 218 lines of code core/src/main/scala/org/apache/spark/rdd/HadoopRDD.scala x: 158 commits (all time) y: 328 lines of code core/src/main/scala/org/apache/spark/ui/JettyUtils.scala x: 82 commits (all time) y: 447 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/DecorrelateInnerQuery.scala x: 18 commits (all time) y: 389 lines of code core/src/main/scala/org/apache/spark/status/AppStatusStore.scala x: 53 commits (all time) y: 747 lines of code core/src/main/scala/org/apache/spark/io/CompressionCodec.scala x: 66 commits (all time) y: 142 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/thrift/ThriftCLIService.java x: 11 commits (all time) y: 618 lines of code core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala x: 148 commits (all time) y: 317 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala x: 78 commits (all time) y: 369 lines of code streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala x: 132 commits (all time) y: 286 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/AsyncProgressTrackingMicroBatchExecution.scala x: 3 commits (all time) y: 219 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/complexTypeCreator.scala x: 105 commits (all time) y: 574 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/CheckpointFileManager.scala x: 14 commits (all time) y: 227 lines of code launcher/src/main/java/org/apache/spark/launcher/SparkSubmitCommandBuilder.java x: 40 commits (all time) y: 394 lines of code python/pyspark/shell.py x: 117 commits (all time) y: 72 lines of code python/pyspark/sql/column.py x: 78 commits (all time) y: 401 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQuery.scala x: 6 commits (all time) y: 138 lines of code core/src/main/scala/org/apache/spark/status/protobuf/StageDataWrapperSerializer.scala x: 7 commits (all time) y: 658 lines of code sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala x: 50 commits (all time) y: 180 lines of code core/src/main/scala/org/apache/spark/util/collection/OpenHashMap.scala x: 33 commits (all time) y: 118 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamExecution.scala x: 157 commits (all time) y: 427 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/BasicWriteStatsTracker.scala x: 16 commits (all time) y: 159 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala x: 14 commits (all time) y: 392 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala x: 168 commits (all time) y: 971 lines of code sql/core/src/main/scala/org/apache/spark/sql/Column.scala x: 140 commits (all time) y: 250 lines of code sql/core/src/main/scala/org/apache/spark/sql/KeyValueGroupedDataset.scala x: 36 commits (all time) y: 374 lines of code core/src/main/scala/org/apache/spark/util/JsonProtocol.scala x: 130 commits (all time) y: 1350 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkExecuteStatementOperation.scala x: 81 commits (all time) y: 315 lines of code core/src/main/scala/org/apache/spark/ui/scope/RDDOperationGraph.scala x: 29 commits (all time) y: 183 lines of code core/src/main/scala/org/apache/spark/util/collection/ExternalAppendOnlyMap.scala x: 108 commits (all time) y: 380 lines of code connector/protobuf/src/main/scala/org/apache/spark/sql/protobuf/utils/SchemaConverters.scala x: 11 commits (all time) y: 122 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/WriteToDataSourceV2Exec.scala x: 47 commits (all time) y: 412 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala x: 205 commits (all time) y: 929 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/VectorizedRleValuesReader.java x: 27 commits (all time) y: 724 lines of code common/network-common/src/main/java/org/apache/spark/network/server/TransportRequestHandler.java x: 18 commits (all time) y: 238 lines of code python/pyspark/broadcast.py x: 70 commits (all time) y: 167 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLCLIDriver.scala x: 110 commits (all time) y: 530 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/features/MountVolumesFeatureStep.scala x: 15 commits (all time) y: 94 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/regexpExpressions.scala x: 75 commits (all time) y: 954 lines of code python/pyspark/sql/connect/readwriter.py x: 35 commits (all time) y: 707 lines of code python/pyspark/sql/readwriter.py x: 192 commits (all time) y: 727 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JDBCRDD.scala x: 72 commits (all time) y: 182 lines of code python/pyspark/errors/exceptions/captured.py x: 5 commits (all time) y: 173 lines of code python/pyspark/serializers.py x: 157 commits (all time) y: 373 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/ProgressReporter.scala x: 52 commits (all time) y: 281 lines of code core/src/main/scala/org/apache/spark/storage/BlockId.scala x: 51 commits (all time) y: 201 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 54 commits (all time) y: 255 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Expression.scala x: 178 commits (all time) y: 780 lines of code core/src/main/scala/org/apache/spark/broadcast/Broadcast.scala x: 42 commits (all time) y: 45 lines of code core/src/main/scala/org/apache/spark/broadcast/TorrentBroadcast.scala x: 86 commits (all time) y: 253 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/package.scala x: 34 commits (all time) y: 202 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/LogicalPlan.scala x: 121 commits (all time) y: 197 lines of code python/pyspark/pandas/accessors.py x: 31 commits (all time) y: 434 lines of code python/pyspark/conf.py x: 41 commits (all time) y: 121 lines of code python/pyspark/java_gateway.py x: 105 commits (all time) y: 138 lines of code python/pyspark/profiler.py x: 12 commits (all time) y: 318 lines of code python/pyspark/taskcontext.py x: 30 commits (all time) y: 145 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala x: 73 commits (all time) y: 722 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala x: 93 commits (all time) y: 1167 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/CLIService.java x: 10 commits (all time) y: 410 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/session/HiveSessionImpl.java x: 12 commits (all time) y: 767 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileScanRDD.scala x: 48 commits (all time) y: 216 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/config.scala x: 33 commits (all time) y: 376 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalShuffleBlockResolver.java x: 33 commits (all time) y: 362 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/InjectRuntimeFilter.scala x: 14 commits (all time) y: 279 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveShim.scala x: 23 commits (all time) y: 139 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveDirCommand.scala x: 18 commits (all time) y: 95 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveTable.scala x: 120 commits (all time) y: 200 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/literals.scala x: 118 commits (all time) y: 431 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeRow.java x: 79 commits (all time) y: 452 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/ApproximatePercentile.scala x: 36 commits (all time) y: 262 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/ColumnAccessor.scala x: 17 commits (all time) y: 122 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/ColumnType.scala x: 23 commits (all time) y: 589 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/compression/compressionSchemes.scala x: 10 commits (all time) y: 672 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/SpecificParquetRecordReaderBase.java x: 42 commits (all time) y: 216 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetPartitionReaderFactory.scala x: 27 commits (all time) y: 261 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/QueryPlanConstraints.scala x: 13 commits (all time) y: 68 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/UnwrapCastInBinaryComparison.scala x: 16 commits (all time) y: 243 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLDriver.scala x: 32 commits (all time) y: 80 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveInspectors.scala x: 93 commits (all time) y: 872 lines of code connector/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaMicroBatchStream.scala x: 4 commits (all time) y: 288 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/arithmetic.scala x: 162 commits (all time) y: 1091 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveMetastoreCatalog.scala x: 268 commits (all time) y: 311 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/TypeUtils.scala x: 40 commits (all time) y: 92 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/IncrementalExecution.scala x: 54 commits (all time) y: 280 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/statefulOperators.scala x: 55 commits (all time) y: 746 lines of code python/pyspark/pandas/config.py x: 24 commits (all time) y: 335 lines of code core/src/main/scala/org/apache/spark/executor/TaskMetrics.scala x: 94 commits (all time) y: 172 lines of code core/src/main/scala/org/apache/spark/deploy/SparkSubmitArguments.scala x: 109 commits (all time) y: 521 lines of code python/pyspark/storagelevel.py x: 29 commits (all time) y: 61 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala x: 68 commits (all time) y: 314 lines of code core/src/main/scala/org/apache/spark/deploy/worker/Worker.scala x: 204 commits (all time) y: 750 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/HashJoin.scala x: 65 commits (all time) y: 647 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/KubernetesConf.scala x: 34 commits (all time) y: 210 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/MapInPandasExec.scala x: 7 commits (all time) y: 14 lines of code core/src/main/scala/org/apache/spark/scheduler/LiveListenerBus.scala x: 25 commits (all time) y: 170 lines of code sql/core/src/main/scala/org/apache/spark/sql/RuntimeConfig.scala x: 24 commits (all time) y: 65 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/hash.scala x: 46 commits (all time) y: 783 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/higherOrderFunctions.scala x: 62 commits (all time) y: 986 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamingSymmetricHashJoinExec.scala x: 26 commits (all time) y: 405 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/IsotonicRegression.scala x: 26 commits (all time) y: 267 lines of code mllib-local/src/main/scala/org/apache/spark/ml/linalg/BLAS.scala x: 19 commits (all time) y: 654 lines of code mllib-local/src/main/scala/org/apache/spark/ml/linalg/Matrices.scala x: 21 commits (all time) y: 813 lines of code mllib-local/src/main/scala/org/apache/spark/ml/linalg/Vectors.scala x: 24 commits (all time) y: 559 lines of code mllib/src/main/scala/org/apache/spark/ml/Model.scala x: 10 commits (all time) y: 11 lines of code mllib/src/main/scala/org/apache/spark/ml/Pipeline.scala x: 45 commits (all time) y: 222 lines of code mllib/src/main/scala/org/apache/spark/ml/Predictor.scala x: 30 commits (all time) y: 95 lines of code mllib/src/main/scala/org/apache/spark/ml/ann/Layer.scala x: 17 commits (all time) y: 425 lines of code mllib/src/main/scala/org/apache/spark/ml/attribute/AttributeGroup.scala x: 12 commits (all time) y: 154 lines of code mllib/src/main/scala/org/apache/spark/ml/attribute/attributes.scala x: 15 commits (all time) y: 353 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/Classifier.scala x: 35 commits (all time) y: 102 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/DecisionTreeClassifier.scala x: 62 commits (all time) y: 207 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/GBTClassifier.scala x: 71 commits (all time) y: 267 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/LinearSVC.scala x: 44 commits (all time) y: 296 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/LogisticRegression.scala x: 168 commits (all time) y: 903 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/MultilayerPerceptronClassifier.scala x: 46 commits (all time) y: 238 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/NaiveBayes.scala x: 55 commits (all time) y: 422 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/OneVsRest.scala x: 43 commits (all time) y: 334 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/RandomForestClassifier.scala x: 64 commits (all time) y: 341 lines of code mllib/src/main/scala/org/apache/spark/ml/clustering/BisectingKMeans.scala x: 44 commits (all time) y: 195 lines of code mllib/src/main/scala/org/apache/spark/ml/clustering/KMeans.scala x: 64 commits (all time) y: 509 lines of code mllib/src/main/scala/org/apache/spark/ml/clustering/LDA.scala x: 52 commits (all time) y: 517 lines of code mllib/src/main/scala/org/apache/spark/ml/evaluation/RegressionEvaluator.scala x: 29 commits (all time) y: 82 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/BucketedRandomProjectionLSH.scala x: 11 commits (all time) y: 153 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/ChiSqSelector.scala x: 36 commits (all time) y: 107 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizer.scala x: 35 commits (all time) y: 248 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/Imputer.scala x: 19 commits (all time) y: 196 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/MinMaxScaler.scala x: 36 commits (all time) y: 165 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/OneHotEncoder.scala x: 35 commits (all time) y: 376 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/PolynomialExpansion.scala x: 24 commits (all time) y: 131 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/QuantileDiscretizer.scala x: 41 commits (all time) y: 148 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/RFormula.scala x: 54 commits (all time) y: 378 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/RFormulaParser.scala x: 9 commits (all time) y: 196 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/RobustScaler.scala x: 11 commits (all time) y: 184 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/VectorAssembler.scala x: 40 commits (all time) y: 221 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/VectorSlicer.scala x: 20 commits (all time) y: 116 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/Word2Vec.scala x: 52 commits (all time) y: 255 lines of code mllib/src/main/scala/org/apache/spark/ml/param/params.scala x: 61 commits (all time) y: 595 lines of code mllib/src/main/scala/org/apache/spark/ml/param/shared/SharedParamsCodeGen.scala x: 56 commits (all time) y: 220 lines of code mllib/src/main/scala/org/apache/spark/ml/r/GeneralizedLinearRegressionWrapper.scala x: 20 commits (all time) y: 154 lines of code mllib/src/main/scala/org/apache/spark/ml/recommendation/ALS.scala x: 101 commits (all time) y: 1021 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/AFTSurvivalRegression.scala x: 68 commits (all time) y: 336 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/DecisionTreeRegressor.scala x: 59 commits (all time) y: 205 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/FMRegressor.scala x: 18 commits (all time) y: 462 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.scala x: 72 commits (all time) y: 948 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/IsotonicRegression.scala x: 38 commits (all time) y: 200 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/LinearRegression.scala x: 139 commits (all time) y: 578 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/Regressor.scala x: 9 commits (all time) y: 11 lines of code mllib/src/main/scala/org/apache/spark/ml/stat/Summarizer.scala x: 22 commits (all time) y: 519 lines of code mllib/src/main/scala/org/apache/spark/ml/tree/Split.scala x: 13 commits (all time) y: 110 lines of code mllib/src/main/scala/org/apache/spark/ml/tree/impl/RandomForest.scala x: 50 commits (all time) y: 806 lines of code mllib/src/main/scala/org/apache/spark/ml/tree/treeModels.scala x: 29 commits (all time) y: 314 lines of code mllib/src/main/scala/org/apache/spark/ml/tuning/TrainValidationSplit.scala x: 29 commits (all time) y: 263 lines of code mllib/src/main/scala/org/apache/spark/ml/util/ReadWrite.scala x: 44 commits (all time) y: 351 lines of code mllib/src/main/scala/org/apache/spark/mllib/api/python/PythonMLLibAPI.scala x: 152 commits (all time) y: 1121 lines of code mllib/src/main/scala/org/apache/spark/mllib/classification/LogisticRegression.scala x: 63 commits (all time) y: 211 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/KMeans.scala x: 89 commits (all time) y: 323 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAModel.scala x: 57 commits (all time) y: 539 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAOptimizer.scala x: 43 commits (all time) y: 370 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala x: 32 commits (all time) y: 264 lines of code mllib/src/main/scala/org/apache/spark/mllib/feature/ChiSqSelector.scala x: 37 commits (all time) y: 201 lines of code mllib/src/main/scala/org/apache/spark/mllib/feature/StandardScaler.scala x: 25 commits (all time) y: 98 lines of code mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala x: 67 commits (all time) y: 516 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/BLAS.scala x: 31 commits (all time) y: 568 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/Matrices.scala x: 65 commits (all time) y: 772 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/Vectors.scala x: 82 commits (all time) y: 692 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala x: 62 commits (all time) y: 539 lines of code mllib/src/main/scala/org/apache/spark/mllib/optimization/Gradient.scala x: 39 commits (all time) y: 139 lines of code mllib/src/main/scala/org/apache/spark/mllib/optimization/GradientDescent.scala x: 44 commits (all time) y: 178 lines of code mllib/src/main/scala/org/apache/spark/mllib/random/RandomRDDs.scala x: 16 commits (all time) y: 506 lines of code mllib/src/main/scala/org/apache/spark/mllib/recommendation/ALS.scala x: 90 commits (all time) y: 221 lines of code mllib/src/main/scala/org/apache/spark/mllib/recommendation/MatrixFactorizationModel.scala x: 60 commits (all time) y: 234 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/LinearRegression.scala x: 47 commits (all time) y: 62 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/DecisionTree.scala x: 57 commits (all time) y: 110 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/RandomForest.scala x: 35 commits (all time) y: 131 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/configuration/BoostingStrategy.scala x: 18 commits (all time) y: 48 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/loss/AbsoluteError.scala x: 12 commits (all time) y: 13 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/loss/LogLoss.scala x: 14 commits (all time) y: 17 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/model/treeEnsembleModels.scala x: 35 commits (all time) y: 263 lines of code mllib/src/main/scala/org/apache/spark/mllib/util/LinearDataGenerator.scala x: 35 commits (all time) y: 107 lines of code mllib/src/main/scala/org/apache/spark/mllib/util/MFDataGenerator.scala x: 34 commits (all time) y: 57 lines of code mllib/src/main/scala/org/apache/spark/mllib/util/MLUtils.scala x: 66 commits (all time) y: 383 lines of code python/pyspark/ml/linalg/__init__.py x: 22 commits (all time) y: 779 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CoalesceShufflePartitions.scala x: 26 commits (all time) y: 117 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskInfo.scala x: 47 commits (all time) y: 80 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JDBCOptions.scala x: 42 commits (all time) y: 192 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/H2Dialect.scala x: 25 commits (all time) y: 218 lines of code python/pyspark/__init__.py x: 96 commits (all time) y: 85 lines of code core/src/main/scala/org/apache/spark/scheduler/Stage.scala x: 63 commits (all time) y: 53 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SparkPlanGraph.scala x: 33 commits (all time) y: 161 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/KubernetesClusterSchedulerBackend.scala x: 36 commits (all time) y: 256 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala x: 29 commits (all time) y: 103 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/CommandUtils.scala x: 32 commits (all time) y: 315 lines of code python/pyspark/sql/connect/proto/types_pb2.pyi x: 8 commits (all time) y: 876 lines of code core/src/main/scala/org/apache/spark/ExecutorAllocationManager.scala x: 97 commits (all time) y: 635 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveUtils.scala x: 75 commits (all time) y: 410 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/continuous/ContinuousExecution.scala x: 71 commits (all time) y: 323 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerMaster.scala x: 109 commits (all time) y: 199 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerMasterEndpoint.scala x: 69 commits (all time) y: 786 lines of code core/src/main/java/org/apache/spark/util/collection/TimSort.java x: 5 commits (all time) y: 470 lines of code python/pyspark/pandas/categorical.py x: 26 commits (all time) y: 234 lines of code python/pyspark/pandas/datetimes.py x: 13 commits (all time) y: 182 lines of code python/pyspark/pandas/missing/frame.py x: 15 commits (all time) y: 33 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskScheduler.scala x: 74 commits (all time) y: 32 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/TableChange.java x: 12 commits (all time) y: 405 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/CSVInferSchema.scala x: 26 commits (all time) y: 207 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JsonInferSchema.scala x: 27 commits (all time) y: 290 lines of code core/src/main/scala/org/apache/spark/deploy/master/Master.scala x: 238 commits (all time) y: 930 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaConverter.scala x: 35 commits (all time) y: 419 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVFileFormat.scala x: 44 commits (all time) y: 113 lines of code core/src/main/scala/org/apache/spark/util/collection/ExternalSorter.scala x: 75 commits (all time) y: 532 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkPlan.scala x: 139 commits (all time) y: 367 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala x: 48 commits (all time) y: 258 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Average.scala x: 43 commits (all time) y: 122 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Sum.scala x: 39 commits (all time) y: 149 lines of code sql/core/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 62 commits (all time) y: 935 lines of code core/src/main/scala/org/apache/spark/api/r/SerDe.scala x: 25 commits (all time) y: 362 lines of code streaming/src/main/scala/org/apache/spark/streaming/StreamingContext.scala x: 194 commits (all time) y: 468 lines of code streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala x: 137 commits (all time) y: 274 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/features/BasicDriverFeatureStep.scala x: 33 commits (all time) y: 151 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala x: 159 commits (all time) y: 581 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalBlockHandler.java x: 18 commits (all time) y: 481 lines of code common/kvstore/src/main/java/org/apache/spark/util/kvstore/InMemoryStore.java x: 10 commits (all time) y: 370 lines of code core/src/main/java/org/apache/spark/unsafe/map/BytesToBytesMap.java x: 56 commits (all time) y: 586 lines of code core/src/main/resources/org/apache/spark/ui/static/webui.css x: 58 commits (all time) y: 348 lines of code launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java x: 23 commits (all time) y: 263 lines of code core/src/main/scala/org/apache/spark/status/storeTypes.scala x: 29 commits (all time) y: 469 lines of code core/src/main/scala/org/apache/spark/scheduler/StageInfo.scala x: 67 commits (all time) y: 78 lines of code core/src/main/scala/org/apache/spark/storage/PushBasedFetchHelper.scala x: 7 commits (all time) y: 197 lines of code core/src/main/scala/org/apache/spark/ui/jobs/JobPage.scala x: 48 commits (all time) y: 472 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Projection.scala x: 62 commits (all time) y: 88 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/MonotonicallyIncreasingID.scala x: 24 commits (all time) y: 52 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/randomExpressions.scala x: 30 commits (all time) y: 107 lines of code core/src/main/scala/org/apache/spark/deploy/history/FsHistoryProvider.scala x: 136 commits (all time) y: 1190 lines of code core/src/main/scala/org/apache/spark/scheduler/SparkListener.scala x: 130 commits (all time) y: 317 lines of code core/src/main/scala/org/apache/spark/ui/PagedTable.scala x: 14 commits (all time) y: 272 lines of code core/src/main/scala/org/apache/spark/ui/storage/RDDPage.scala x: 52 commits (all time) y: 209 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskSet.scala x: 24 commits (all time) y: 13 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ExistingRDD.scala x: 86 commits (all time) y: 198 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/HDFSMetadataLog.scala x: 45 commits (all time) y: 245 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/conditionalExpressions.scala x: 64 commits (all time) y: 285 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/scheduler/cluster/YarnSchedulerBackend.scala x: 28 commits (all time) y: 256 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/operation/LogDivertAppender.java x: 10 commits (all time) y: 227 lines of code appveyor.yml x: 36 commits (all time) y: 42 lines of code python/pyspark/shuffle.py x: 29 commits (all time) y: 446 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerDecommissioner.scala x: 19 commits (all time) y: 303 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcUtils.scala x: 39 commits (all time) y: 396 lines of code connector/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaOffsetReaderAdmin.scala x: 2 commits (all time) y: 412 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/ShufflePartitionsUtil.scala x: 21 commits (all time) y: 261 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskResult.scala x: 56 commits (all time) y: 70 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskResultGetter.scala x: 59 commits (all time) y: 119 lines of code core/src/main/scala/org/apache/spark/util/io/ChunkedByteBuffer.scala x: 22 commits (all time) y: 207 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/ParquetVectorUpdaterFactory.java x: 7 commits (all time) y: 996 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/SharedState.scala x: 58 commits (all time) y: 185 lines of code python/pyspark/sql/pandas/_typing/__init__.pyi x: 9 commits (all time) y: 308 lines of code core/src/main/scala/org/apache/spark/shuffle/BlockStoreShuffleReader.scala x: 42 commits (all time) y: 98 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/TimeWindow.scala x: 28 commits (all time) y: 211 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/AggUtils.scala x: 24 commits (all time) y: 425 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/SortOrder.scala x: 44 commits (all time) y: 172 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/xml/xpath.scala x: 12 commits (all time) y: 195 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/InMemoryCatalog.scala x: 71 commits (all time) y: 533 lines of code core/src/main/scala/org/apache/spark/util/SizeEstimator.scala x: 58 commits (all time) y: 232 lines of code python/run-tests.py x: 53 commits (all time) y: 276 lines of code core/src/main/scala/org/apache/spark/TaskEndReason.scala x: 73 commits (all time) y: 138 lines of code core/src/main/scala/org/apache/spark/deploy/ApplicationDescription.scala x: 34 commits (all time) y: 20 lines of code core/src/main/scala/org/apache/spark/deploy/master/ApplicationInfo.scala x: 56 commits (all time) y: 150 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala x: 218 commits (all time) y: 265 lines of code core/src/main/scala/org/apache/spark/rdd/JdbcRDD.scala x: 38 commits (all time) y: 120 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveCatalogs.scala x: 62 commits (all time) y: 31 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/CatalystTypeConverters.scala x: 56 commits (all time) y: 396 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/features/BasicExecutorFeatureStep.scala x: 39 commits (all time) y: 241 lines of code core/src/main/scala/org/apache/spark/rdd/AsyncRDDActions.scala x: 48 commits (all time) y: 83 lines of code sql/core/src/main/scala/org/apache/spark/sql/catalyst/util/V2ExpressionBuilder.scala x: 23 commits (all time) y: 321 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/StandaloneSchedulerBackend.scala x: 54 commits (all time) y: 236 lines of code core/src/main/scala/org/apache/spark/TaskContext.scala x: 75 commits (all time) y: 94 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/ComplexTypes.scala x: 21 commits (all time) y: 34 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonGenerator.scala x: 33 commits (all time) y: 219 lines of code python/pyspark/mllib/tree.py x: 38 commits (all time) y: 321 lines of code core/src/main/scala/org/apache/spark/broadcast/BroadcastFactory.scala x: 34 commits (all time) y: 13 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/intervalExpressions.scala x: 40 commits (all time) y: 667 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CostBasedJoinReorder.scala x: 24 commits (all time) y: 255 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVector.java x: 32 commits (all time) y: 568 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala x: 82 commits (all time) y: 200 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/linearRegression.scala x: 10 commits (all time) y: 275 lines of code core/src/main/scala/org/apache/spark/deploy/history/HistoryPage.scala x: 35 commits (all time) y: 74 lines of code core/src/main/scala/org/apache/spark/deploy/master/ui/ApplicationPage.scala x: 63 commits (all time) y: 130 lines of code core/src/main/scala/org/apache/spark/deploy/worker/ExecutorRunner.scala x: 89 commits (all time) y: 151 lines of code core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala x: 67 commits (all time) y: 214 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JDBCRelation.scala x: 47 commits (all time) y: 247 lines of code core/src/main/scala/org/apache/spark/rdd/BlockRDD.scala x: 48 commits (all time) y: 51 lines of code core/src/main/scala/org/apache/spark/rdd/NewHadoopRDD.scala x: 104 commits (all time) y: 264 lines of code core/src/main/scala/org/apache/spark/rdd/SubtractedRDD.scala x: 52 commits (all time) y: 87 lines of code core/src/main/scala/org/apache/spark/ui/jobs/AllJobsPage.scala x: 63 commits (all time) y: 514 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/Exchange.scala x: 21 commits (all time) y: 48 lines of code core/src/main/scala/org/apache/spark/Partitioner.scala x: 76 commits (all time) y: 230 lines of code core/src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeExternalSorter.java x: 72 commits (all time) y: 578 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/thrift/ThriftCLIServiceClient.java x: 5 commits (all time) y: 390 lines of code core/src/main/scala/org/apache/spark/metrics/MetricsSystem.scala x: 61 commits (all time) y: 170 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/OffHeapColumnVector.java x: 44 commits (all time) y: 440 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/EvaluatePython.scala x: 17 commits (all time) y: 215 lines of code common/kvstore/src/main/java/org/apache/spark/util/kvstore/LevelDB.java x: 10 commits (all time) y: 317 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/ui/HiveThriftServer2Listener.scala x: 7 commits (all time) y: 271 lines of code common/network-common/src/main/java/org/apache/spark/network/client/TransportResponseHandler.java x: 13 commits (all time) y: 234 lines of code python/pyspark/ml/evaluation.py x: 52 commits (all time) y: 561 lines of code core/src/main/scala/org/apache/spark/shuffle/ShuffleBlockPusher.scala x: 14 commits (all time) y: 319 lines of code python/pyspark/streaming/dstream.py x: 28 commits (all time) y: 491 lines of code sql/core/src/main/scala/org/apache/spark/sql/SQLImplicits.scala x: 38 commits (all time) y: 72 lines of code core/src/main/scala/org/apache/spark/rpc/netty/NettyRpcEnv.scala x: 44 commits (all time) y: 533 lines of code streaming/src/main/scala/org/apache/spark/streaming/ui/StreamingPage.scala x: 41 commits (all time) y: 417 lines of code connector/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala x: 1 commits (all time) y: 552 lines of code connector/kafka-0-10-token-provider/src/main/scala/org/apache/spark/kafka010/KafkaTokenUtil.scala x: 1 commits (all time) y: 226 lines of code connector/spark-ganglia-lgpl/src/main/java/com/codahale/metrics/ganglia/GangliaReporter.java x: 1 commits (all time) y: 286 lines of code resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosSchedulerUtils.scala x: 20 commits (all time) y: 385 lines of code python/pyspark/ml/param/__init__.py x: 44 commits (all time) y: 324 lines of code core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala x: 33 commits (all time) y: 199 lines of code core/src/main/scala/org/apache/spark/metrics/sink/GraphiteSink.scala x: 26 commits (all time) y: 67 lines of code core/src/main/scala/org/apache/spark/metrics/MetricsConfig.scala x: 42 commits (all time) y: 77 lines of code resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosCoarseGrainedSchedulerBackend.scala x: 45 commits (all time) y: 568 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/commands.scala x: 64 commits (all time) y: 106 lines of code graphx/src/main/scala/org/apache/spark/graphx/impl/GraphImpl.scala x: 50 commits (all time) y: 244 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/GenerateColumnAccessor.scala x: 29 commits (all time) y: 171 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/GenerateExec.scala x: 26 commits (all time) y: 226 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/ui/StreamingQueryStatisticsPage.scala x: 11 commits (all time) y: 455 lines of code core/src/main/scala/org/apache/spark/shuffle/sort/SortShuffleManager.scala x: 36 commits (all time) y: 150 lines of code core/src/main/scala/org/apache/spark/deploy/FaultToleranceTest.scala x: 59 commits (all time) y: 328 lines of code core/src/main/scala/org/apache/spark/deploy/rest/RestSubmissionClient.scala x: 18 commits (all time) y: 334 lines of code streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala x: 83 commits (all time) y: 196 lines of code sql/core/src/main/scala/org/apache/spark/sql/package.scala x: 27 commits (all time) y: 13 lines of code core/src/main/scala/org/apache/spark/deploy/Client.scala x: 54 commits (all time) y: 216 lines of code core/src/main/scala/org/apache/spark/deploy/LocalSparkCluster.scala x: 71 commits (all time) y: 77 lines of code core/src/main/scala/org/apache/spark/serializer/JavaSerializer.scala x: 46 commits (all time) y: 114 lines of code core/src/main/scala/org/apache/spark/rdd/CoalescedRDD.scala x: 52 commits (all time) y: 220 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionCatalog.scala x: 78 commits (all time) y: 25 lines of code core/src/main/scala/org/apache/spark/util/ClosureCleaner.scala x: 52 commits (all time) y: 500 lines of code core/src/main/scala/org/apache/spark/scheduler/InputFormatInfo.scala x: 48 commits (all time) y: 111 lines of code core/src/main/scala/org/apache/spark/api/java/JavaRDDLike.scala x: 122 commits (all time) y: 285 lines of code graphx/src/main/scala/org/apache/spark/graphx/Pregel.scala x: 33 commits (all time) y: 53 lines of code core/src/main/scala/org/apache/spark/rdd/DoubleRDDFunctions.scala x: 53 commits (all time) y: 133 lines of code core/src/main/scala/org/apache/spark/rdd/PairRDDFunctions.scala x: 197 commits (all time) y: 538 lines of code core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala x: 60 commits (all time) y: 200 lines of code core/src/main/resources/org/apache/spark/ui/static/historypage.js x: 30 commits (all time) y: 198 lines of code core/src/main/resources/org/apache/spark/ui/static/spark-dag-viz.js x: 29 commits (all time) y: 343 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeTableCommand.scala x: 21 commits (all time) y: 11 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveContext.scala x: 211 commits (all time) y: 20 lines of code core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala x: 38 commits (all time) y: 45 lines of code core/src/main/scala/org/apache/spark/deploy/worker/DriverRunner.scala x: 55 commits (all time) y: 198 lines of code resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala x: 32 commits (all time) y: 680 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/BoundAttribute.scala x: 57 commits (all time) y: 61 lines of code sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala x: 324 commits (all time) y: 349 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/view.scala x: 18 commits (all time) y: 14 lines of code streaming/src/main/scala/org/apache/spark/streaming/util/RawTextSender.scala x: 35 commits (all time) y: 50 lines of code core/src/main/scala/org/apache/spark/scheduler/Pool.scala x: 52 commits (all time) y: 90 lines of code core/src/main/scala/org/apache/spark/scheduler/Schedulable.scala x: 31 commits (all time) y: 23 lines of code core/src/main/scala/org/apache/spark/rdd/SequenceFileRDDFunctions.scala x: 30 commits (all time) y: 37 lines of code resource-managers/mesos/src/main/scala/org/apache/spark/deploy/mesos/config.scala x: 14 commits (all time) y: 348 lines of code core/src/main/scala/org/apache/spark/api/java/JavaPairRDD.scala x: 112 commits (all time) y: 419 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/DStream.scala x: 68 commits (all time) y: 529 lines of code core/src/main/scala/org/apache/spark/rdd/ParallelCollectionRDD.scala x: 44 commits (all time) y: 102 lines of code core/src/main/scala/org/apache/spark/scheduler/SparkListenerBus.scala x: 75 commits (all time) y: 80 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala x: 48 commits (all time) y: 149 lines of code streaming/src/main/scala/org/apache/spark/streaming/DStreamGraph.scala x: 64 commits (all time) y: 149 lines of code streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala x: 53 commits (all time) y: 449 lines of code core/src/main/scala/org/apache/spark/api/java/JavaRDD.scala x: 72 commits (all time) y: 89 lines of code core/src/main/scala/org/apache/spark/deploy/master/ZooKeeperPersistenceEngine.scala x: 59 commits (all time) y: 48 lines of code core/src/main/scala/org/apache/spark/ui/jobs/PoolTable.scala x: 50 commits (all time) y: 49 lines of code streaming/src/main/scala/org/apache/spark/streaming/receiver/BlockGenerator.scala x: 21 commits (all time) y: 179 lines of code streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobScheduler.scala x: 79 commits (all time) y: 193 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateUnsafeProjection.scala x: 63 commits (all time) y: 298 lines of code graphx/src/main/scala/org/apache/spark/graphx/GraphOps.scala x: 49 commits (all time) y: 188 lines of code core/src/main/scala/org/apache/spark/FutureAction.scala x: 50 commits (all time) y: 153 lines of code core/src/main/scala/org/apache/spark/deploy/master/FileSystemPersistenceEngine.scala x: 39 commits (all time) y: 53 lines of code core/src/main/scala/org/apache/spark/deploy/master/ZooKeeperLeaderElectionAgent.scala x: 44 commits (all time) y: 57 lines of code core/src/main/scala/org/apache/spark/deploy/worker/WorkerArguments.scala x: 41 commits (all time) y: 126 lines of code core/src/main/scala/org/apache/spark/deploy/worker/ui/WorkerWebUI.scala x: 81 commits (all time) y: 31 lines of code core/src/main/scala/org/apache/spark/rdd/ShuffledRDD.scala x: 59 commits (all time) y: 67 lines of code core/src/main/scala/org/apache/spark/scheduler/JobWaiter.scala x: 40 commits (all time) y: 32 lines of code core/src/main/scala/org/apache/spark/ui/UIWorkloadGenerator.scala x: 69 commits (all time) y: 81 lines of code streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaDStreamLike.scala x: 64 commits (all time) y: 185 lines of code streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaPairDStream.scala x: 82 commits (all time) y: 380 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/QueueInputDStream.scala x: 29 commits (all time) y: 45 lines of code sql/core/src/main/scala/org/apache/spark/sql/sources/interfaces.scala x: 106 commits (all time) y: 81 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/WindowedDStream.scala x: 45 commits (all time) y: 35 lines of code graphx/src/main/scala/org/apache/spark/graphx/Graph.scala x: 48 commits (all time) y: 95 lines of code licenses-binary/LICENSE-javassist.html x: 1 commits (all time) y: 369 lines of code core/src/main/scala/org/apache/spark/api/java/JavaDoubleRDD.scala x: 65 commits (all time) y: 82 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerSource.scala x: 43 commits (all time) y: 33 lines of code core/src/main/scala/org/apache/spark/TaskState.scala x: 23 commits (all time) y: 8 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/TransformedDStream.scala x: 38 commits (all time) y: 34 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSQLParser.scala x: 3 commits (all time) y: 763 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGSchedulerSource.scala x: 46 commits (all time) y: 25 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUdf.scala x: 23 commits (all time) y: 1053 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/hiveUdfs.scala x: 54 commits (all time) y: 327 lines of code core/src/main/scala/org/apache/spark/deploy/master/ApplicationSource.scala x: 29 commits (all time) y: 17 lines of code core/src/main/scala/org/apache/spark/deploy/master/WorkerState.scala x: 31 commits (all time) y: 5 lines of code sql/core/src/main/scala/org/apache/spark/sql/UdfRegistration.scala x: 9 commits (all time) y: 935 lines of code
4424.0
lines of code
  min: 1.0
  average: 151.84
  25th percentile: 24.0
  median: 67.0
  75th percentile: 161.0
  max: 4424.0
0 1094.0
commits (all time)
min: 1.0 | average: 21.58 | 25th percentile: 2.0 | median: 7.0 | 75th percentile: 22.0 | max: 1094.0

File Size vs. Contributors (all time): 3586 points

connector/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala x: 11 contributors (all time) y: 157 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/CachedStreamResponse.scala x: 1 contributors (all time) y: 8 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteGrpcResponseSender.scala x: 1 contributors (all time) y: 191 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteResponseObserver.scala x: 1 contributors (all time) y: 205 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteThreadRunner.scala x: 4 contributors (all time) y: 149 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/ExecuteHolder.scala x: 5 contributors (all time) y: 109 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectExecutePlanHandler.scala x: 2 contributors (all time) y: 18 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectReattachExecuteHandler.scala x: 1 contributors (all time) y: 33 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/functions.scala x: 9 contributors (all time) y: 1323 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Dataset.scala x: 15 contributors (all time) y: 947 lines of code project/MimaExcludes.scala x: 158 contributors (all time) y: 163 lines of code sql/api/src/main/scala/org/apache/spark/sql/streaming/GroupState.scala x: 1 contributors (all time) y: 44 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/CodeGeneratorWithInterpretedFallback.scala x: 6 contributors (all time) y: 28 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala x: 172 contributors (all time) y: 4424 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/WholeStageCodegenExec.scala x: 50 contributors (all time) y: 528 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala x: 170 contributors (all time) y: 2820 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveLateralColumnAliasReference.scala x: 2 contributors (all time) y: 138 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowConverters.scala x: 13 contributors (all time) y: 341 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala x: 114 contributors (all time) y: 3431 lines of code python/pyspark/pandas/base.py x: 11 contributors (all time) y: 607 lines of code python/pyspark/errors/error_classes.py x: 8 contributors (all time) y: 3 lines of code python/pyspark/worker.py x: 73 contributors (all time) y: 777 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala x: 26 contributors (all time) y: 2873 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala x: 25 contributors (all time) y: 378 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExec.scala x: 15 contributors (all time) y: 187 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 19 contributors (all time) y: 363 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala x: 12 contributors (all time) y: 643 lines of code python/pyspark/ml/connect/io_utils.py x: 2 contributors (all time) y: 195 lines of code python/pyspark/ml/connect/tuning.py x: 2 contributors (all time) y: 328 lines of code python/pyspark/ml/torch/distributor.py x: 7 contributors (all time) y: 624 lines of code python/pyspark/ml/util.py x: 28 contributors (all time) y: 388 lines of code python/pyspark/pandas/utils.py x: 9 contributors (all time) y: 629 lines of code python/pyspark/sql/connect/session.py x: 16 contributors (all time) y: 620 lines of code python/pyspark/sql/connect/udf.py x: 4 contributors (all time) y: 212 lines of code python/pyspark/sql/connect/udtf.py x: 3 contributors (all time) y: 147 lines of code python/pyspark/sql/session.py x: 57 contributors (all time) y: 763 lines of code python/pyspark/sql/utils.py x: 27 contributors (all time) y: 176 lines of code core/src/main/scala/org/apache/spark/ui/storage/StoragePage.scala x: 17 contributors (all time) y: 194 lines of code sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala x: 128 contributors (all time) y: 1462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/JoinCodegenSupport.scala x: 3 contributors (all time) y: 62 lines of code sql/api/src/main/scala/org/apache/spark/sql/execution/streaming/Triggers.scala x: 1 contributors (all time) y: 61 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcExceptionConverter.scala x: 2 contributors (all time) y: 69 lines of code dev/sparktestsupport/modules.py x: 70 contributors (all time) y: 1015 lines of code python/pyspark/pandas/__init__.py x: 8 contributors (all time) y: 112 lines of code python/pyspark/pandas/indexes/__init__.py x: 2 contributors (all time) y: 4 lines of code python/pyspark/pandas/indexes/base.py x: 11 contributors (all time) y: 1008 lines of code python/pyspark/pandas/indexes/category.py x: 7 contributors (all time) y: 175 lines of code python/pyspark/pandas/indexes/datetimes.py x: 7 contributors (all time) y: 266 lines of code python/pyspark/pandas/series.py x: 16 contributors (all time) y: 2180 lines of code python/pyspark/pandas/spark/accessors.py x: 10 contributors (all time) y: 242 lines of code python/pyspark/pandas/usage_logging/__init__.py x: 6 contributors (all time) y: 100 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/ExecutorPodsWatchSnapshotSource.scala x: 5 contributors (all time) y: 69 lines of code python/pyspark/sql/pandas/serializers.py x: 13 contributors (all time) y: 546 lines of code python/pyspark/sql/pandas/types.py x: 10 contributors (all time) y: 599 lines of code core/src/main/scala/org/apache/spark/serializer/GenericAvroSerializer.scala x: 8 contributors (all time) y: 96 lines of code core/src/main/scala/org/apache/spark/serializer/KryoSerializer.scala x: 73 contributors (all time) y: 569 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/Encoders.scala x: 13 contributors (all time) y: 92 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala x: 27 contributors (all time) y: 423 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/BadRecordException.scala x: 4 contributors (all time) y: 21 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/FailureSafeParser.scala x: 11 contributors (all time) y: 59 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala x: 71 contributors (all time) y: 2370 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/DataTypeErrors.scala x: 3 contributors (all time) y: 238 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryErrorsBase.scala x: 8 contributors (all time) y: 29 lines of code python/pyspark/testing/pandasutils.py x: 8 contributors (all time) y: 440 lines of code project/SparkBuild.scala x: 209 contributors (all time) y: 1369 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala x: 75 contributors (all time) y: 546 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/JavaTypeInference.scala x: 2 contributors (all time) y: 102 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala x: 50 contributors (all time) y: 755 lines of code core/src/main/scala/org/apache/spark/SparkConf.scala x: 78 contributors (all time) y: 463 lines of code dev/sparktestsupport/utils.py x: 8 contributors (all time) y: 64 lines of code python/pyspark/sql/udtf.py x: 3 contributors (all time) y: 279 lines of code python/pyspark/cloudpickle/cloudpickle_fast.py x: 4 contributors (all time) y: 452 lines of code python/pyspark/sql/connect/plan.py x: 19 contributors (all time) y: 1734 lines of code python/pyspark/sql/connect/client/core.py x: 8 contributors (all time) y: 1150 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcRetryHandler.scala x: 2 contributors (all time) y: 114 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkConnectClient.scala x: 14 contributors (all time) y: 435 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkResult.scala x: 6 contributors (all time) y: 225 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowDeserializer.scala x: 2 contributors (all time) y: 447 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala x: 59 contributors (all time) y: 541 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileIndex.scala x: 6 contributors (all time) y: 67 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategy.scala x: 36 contributors (all time) y: 227 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningAwareFileIndex.scala x: 26 contributors (all time) y: 157 lines of code python/pyspark/sql/worker/analyze_udtf.py x: 1 contributors (all time) y: 108 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonFunction.scala x: 9 contributors (all time) y: 192 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/SortMergeJoinExec.scala x: 33 contributors (all time) y: 1142 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala x: 134 contributors (all time) y: 1590 lines of code python/pyspark/pandas/groupby.py x: 17 contributors (all time) y: 1638 lines of code python/pyspark/pandas/namespace.py x: 17 contributors (all time) y: 1460 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryListener.scala x: 3 contributors (all time) y: 73 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala x: 8 contributors (all time) y: 139 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveReferencesInUpdate.scala x: 2 contributors (all time) y: 40 lines of code connector/connect/common/src/main/protobuf/spark/connect/base.proto x: 12 contributors (all time) y: 662 lines of code python/pyspark/sql/connect/proto/base_pb2.pyi x: 12 contributors (all time) y: 2137 lines of code python/pyspark/testing/utils.py x: 10 contributors (all time) y: 367 lines of code core/src/main/scala/org/apache/spark/MapOutputTracker.scala x: 78 contributors (all time) y: 1104 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/Metadata.scala x: 2 contributors (all time) y: 180 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/FunctionBuilderBase.scala x: 1 contributors (all time) y: 79 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryCompilationErrors.scala x: 58 contributors (all time) y: 3219 lines of code python/pyspark/sql/types.py x: 58 contributors (all time) y: 1478 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectService.scala x: 19 contributors (all time) y: 283 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectStreamingQueryCache.scala x: 4 contributors (all time) y: 133 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseLexer.g4 x: 2 contributors (all time) y: 514 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4 x: 2 contributors (all time) y: 1695 lines of code core/src/main/scala/org/apache/spark/executor/Executor.scala x: 124 contributors (all time) y: 875 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala x: 59 contributors (all time) y: 901 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/JdbcDialects.scala x: 40 contributors (all time) y: 354 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala x: 27 contributors (all time) y: 213 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsAtomicPartitionManagement.java x: 6 contributors (all time) y: 40 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsPartitionManagement.java x: 4 contributors (all time) y: 48 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/TableCatalog.java x: 12 contributors (all time) y: 65 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/sources/filters.scala x: 9 contributors (all time) y: 174 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/dsl/package.scala x: 10 contributors (all time) y: 979 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerListener.scala x: 2 contributors (all time) y: 344 lines of code core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala x: 26 contributors (all time) y: 429 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala x: 21 contributors (all time) y: 458 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStoreConf.scala x: 12 contributors (all time) y: 27 lines of code common/utils/src/main/scala/org/apache/spark/util/SparkSerDeUtils.scala x: 3 contributors (all time) y: 26 lines of code core/src/main/scala/org/apache/spark/util/Utils.scala x: 192 contributors (all time) y: 2147 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala x: 34 contributors (all time) y: 1053 lines of code python/setup.py x: 29 contributors (all time) y: 276 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala x: 86 contributors (all time) y: 462 lines of code python/pyspark/sql/connect/proto/base_pb2_grpc.py x: 6 contributors (all time) y: 349 lines of code common/network-common/src/main/java/org/apache/spark/network/util/TransportConf.java x: 26 contributors (all time) y: 178 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SQLExecution.scala x: 24 contributors (all time) y: 164 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 4 contributors (all time) y: 89 lines of code connector/connect/common/src/main/protobuf/spark/connect/commands.proto x: 14 contributors (all time) y: 341 lines of code python/pyspark/sql/connect/proto/commands_pb2.pyi x: 12 contributors (all time) y: 1509 lines of code python/pyspark/sql/connect/streaming/query.py x: 3 contributors (all time) y: 219 lines of code python/pyspark/sql/streaming/listener.py x: 3 contributors (all time) y: 609 lines of code python/pyspark/sql/streaming/query.py x: 7 contributors (all time) y: 119 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/DeduplicateRelations.scala x: 14 contributors (all time) y: 329 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 1 contributors (all time) y: 1078 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/UdfUtils.scala x: 3 contributors (all time) y: 493 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/encoders/RowEncoder.scala x: 2 contributors (all time) y: 71 lines of code core/src/main/scala/org/apache/spark/scheduler/ShuffleMapTask.scala x: 53 contributors (all time) y: 62 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala x: 52 contributors (all time) y: 810 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasExec.scala x: 18 contributors (all time) y: 37 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/window/WindowGroupLimitExec.scala x: 3 contributors (all time) y: 186 lines of code core/src/main/scala/org/apache/spark/util/SparkUncaughtExceptionHandler.scala x: 9 contributors (all time) y: 44 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/subquery.scala x: 31 contributors (all time) y: 139 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala x: 65 contributors (all time) y: 2504 lines of code sql/core/src/main/scala/org/apache/spark/sql/functions.scala x: 145 contributors (all time) y: 2003 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/artifact/SparkConnectArtifactManager.scala x: 5 contributors (all time) y: 186 lines of code core/src/main/scala/org/apache/spark/internal/config/package.scala x: 108 contributors (all time) y: 2224 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ArtifactManager.scala x: 5 contributors (all time) y: 252 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SQLAppStatusListener.scala x: 26 contributors (all time) y: 470 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SQLListener.scala x: 20 contributors (all time) y: 67 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/ProtoUtils.scala x: 4 contributors (all time) y: 67 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerAppStatusStore.scala x: 1 contributors (all time) y: 93 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerPage.scala x: 1 contributors (all time) y: 442 lines of code core/src/main/scala/org/apache/spark/SparkContext.scala x: 219 contributors (all time) y: 1860 lines of code core/src/main/scala/org/apache/spark/ui/SparkUI.scala x: 56 contributors (all time) y: 169 lines of code common/utils/src/main/java/org/apache/spark/network/util/JavaUtils.java x: 1 contributors (all time) y: 253 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/protobuf/functions.scala x: 3 contributors (all time) y: 112 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/LiteralValueProtoConverter.scala x: 4 contributors (all time) y: 313 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/SparkIntervalUtils.scala x: 2 contributors (all time) y: 393 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/AnsiTypeCoercion.scala x: 12 contributors (all time) y: 177 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala x: 81 contributors (all time) y: 1828 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ToStringBase.scala x: 3 contributors (all time) y: 364 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/object.scala x: 20 contributors (all time) y: 596 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/IntervalUtils.scala x: 19 contributors (all time) y: 711 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/package.scala x: 27 contributors (all time) y: 146 lines of code mllib/src/main/scala/org/apache/spark/mllib/evaluation/RankingMetrics.scala x: 17 contributors (all time) y: 155 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroDeserializer.scala x: 6 contributors (all time) y: 455 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroSerializer.scala x: 3 contributors (all time) y: 304 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/progress.scala x: 5 contributors (all time) y: 192 lines of code sql/api/src/main/scala/org/apache/spark/sql/Row.scala x: 2 contributors (all time) y: 230 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/ScalaReflection.scala x: 1 contributors (all time) y: 309 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/DataTypeAstBuilder.scala x: 2 contributors (all time) y: 142 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/parsers.scala x: 2 contributors (all time) y: 275 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/DateFormatter.scala x: 1 contributors (all time) y: 140 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeFormatterHelper.scala x: 1 contributors (all time) y: 228 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala x: 1 contributors (all time) y: 419 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/ExecutionErrors.scala x: 1 contributors (all time) y: 171 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/QueryParsingErrors.scala x: 2 contributors (all time) y: 565 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/DataType.scala x: 2 contributors (all time) y: 284 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/Decimal.scala x: 2 contributors (all time) y: 473 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/StructType.scala x: 2 contributors (all time) y: 367 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CTESubstitution.scala x: 17 contributors (all time) y: 175 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/CSVOptions.scala x: 18 contributors (all time) y: 267 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/UnivocityParser.scala x: 20 contributors (all time) y: 299 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala x: 58 contributors (all time) y: 379 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/Columnar.scala x: 20 contributors (all time) y: 373 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFilters.scala x: 24 contributors (all time) y: 666 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetReadSupport.scala x: 17 contributors (all time) y: 365 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala x: 12 contributors (all time) y: 323 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetScanBuilder.scala x: 12 contributors (all time) y: 78 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/progress.scala x: 23 contributors (all time) y: 181 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/MapInBatchExec.scala x: 7 contributors (all time) y: 51 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveInlineTables.scala x: 14 contributors (all time) y: 80 lines of code core/src/main/scala/org/apache/spark/executor/CoarseGrainedExecutorBackend.scala x: 85 contributors (all time) y: 456 lines of code core/src/main/scala/org/apache/spark/scheduler/SchedulerBackend.scala x: 25 contributors (all time) y: 27 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedClusterMessage.scala x: 44 contributors (all time) y: 92 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala x: 104 contributors (all time) y: 696 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2ScanExecBase.scala x: 12 contributors (all time) y: 148 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveShim.scala x: 44 contributors (all time) y: 1468 lines of code dev/appveyor-install-dependencies.ps1 x: 18 contributors (all time) y: 112 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala x: 74 contributors (all time) y: 654 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala x: 35 contributors (all time) y: 556 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala x: 31 contributors (all time) y: 408 lines of code python/pyspark/sql/connect/types.py x: 8 contributors (all time) y: 230 lines of code python/pyspark/sql/connect/dataframe.py x: 16 contributors (all time) y: 1749 lines of code mllib/src/main/scala/org/apache/spark/ml/source/image/ImageFileFormat.scala x: 6 contributors (all time) y: 71 lines of code mllib/src/main/scala/org/apache/spark/ml/source/libsvm/LibSVMRelation.scala x: 28 contributors (all time) y: 138 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/encoders/ExpressionEncoder.scala x: 26 contributors (all time) y: 238 lines of code sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 63 contributors (all time) y: 557 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkStrategies.scala x: 100 contributors (all time) y: 685 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2CommandExec.scala x: 11 contributors (all time) y: 27 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/FlatMapGroupsInPandasWithStateExec.scala x: 6 contributors (all time) y: 166 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/sources/memory.scala x: 9 contributors (all time) y: 140 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala x: 49 contributors (all time) y: 1431 lines of code dev/run-tests.py x: 52 contributors (all time) y: 427 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 6 contributors (all time) y: 111 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/rows.scala x: 19 contributors (all time) y: 133 lines of code python/pyspark/util.py x: 18 contributors (all time) y: 171 lines of code core/src/main/scala/org/apache/spark/package.scala x: 25 contributors (all time) y: 12 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/AggregateInPandasExec.scala x: 14 contributors (all time) y: 136 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ArrowEvalPythonExec.scala x: 15 contributors (all time) y: 85 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ArrowPythonRunner.scala x: 14 contributors (all time) y: 42 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/FlatMapCoGroupsInPandasExec.scala x: 7 contributors (all time) y: 61 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/FlatMapGroupsInPandasExec.scala x: 14 contributors (all time) y: 55 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasEvaluatorFactory.scala x: 2 contributors (all time) y: 245 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala x: 84 contributors (all time) y: 1440 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statements.scala x: 21 contributors (all time) y: 114 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala x: 32 contributors (all time) y: 988 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/InsertIntoDataSourceDirCommand.scala x: 8 contributors (all time) y: 40 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/createDataSourceTables.scala x: 30 contributors (all time) y: 150 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala x: 66 contributors (all time) y: 953 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/views.scala x: 37 contributors (all time) y: 453 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/CreateHiveTableAsSelectCommand.scala x: 24 contributors (all time) y: 74 lines of code core/src/main/scala/org/apache/spark/ui/UIUtils.scala x: 72 contributors (all time) y: 584 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/rules.scala x: 46 contributors (all time) y: 421 lines of code connector/connect/common/src/main/protobuf/spark/connect/expressions.proto x: 10 contributors (all time) y: 311 lines of code python/pyspark/sql/connect/expressions.py x: 7 contributors (all time) y: 835 lines of code python/pyspark/sql/connect/functions.py x: 13 contributors (all time) y: 2070 lines of code python/pyspark/sql/connect/proto/expressions_pb2.pyi x: 11 contributors (all time) y: 1268 lines of code python/pyspark/sql/streaming/readwriter.py x: 4 contributors (all time) y: 540 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Strategy.scala x: 49 contributors (all time) y: 502 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Column.scala x: 5 contributors (all time) y: 273 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ordering.scala x: 10 contributors (all time) y: 70 lines of code core/src/main/resources/org/apache/spark/ui/static/stagepage.js x: 20 contributors (all time) y: 1040 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreeNode.scala x: 67 contributors (all time) y: 824 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/StringUtils.scala x: 23 contributors (all time) y: 88 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AdaptiveSparkPlanExec.scala x: 32 contributors (all time) y: 564 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/QueryStageExec.scala x: 18 contributors (all time) y: 179 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala x: 41 contributors (all time) y: 642 lines of code project/plugins.sbt x: 63 contributors (all time) y: 14 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetScan.scala x: 18 contributors (all time) y: 102 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala x: 85 contributors (all time) y: 1081 lines of code python/pyspark/sql/dataframe.py x: 128 contributors (all time) y: 1405 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/BroadcastExchangeExec.scala x: 26 contributors (all time) y: 148 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/HDFSBackedStateStoreProvider.scala x: 23 contributors (all time) y: 553 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala x: 7 contributors (all time) y: 483 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBStateStoreProvider.scala x: 8 contributors (all time) y: 275 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2SessionCatalog.scala x: 19 contributors (all time) y: 323 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/RewriteRowLevelCommand.scala x: 2 contributors (all time) y: 161 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala x: 61 contributors (all time) y: 382 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/dsl/package.scala x: 66 contributors (all time) y: 376 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/HyperLogLogPlusPlus.scala x: 17 contributors (all time) y: 83 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/PivotFirst.scala x: 7 contributors (all time) y: 98 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/interfaces.scala x: 34 contributors (all time) y: 255 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/ParserUtils.scala x: 22 contributors (all time) y: 76 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/QueryPlan.scala x: 56 contributors (all time) y: 343 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/LocalRelation.scala x: 21 contributors (all time) y: 69 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/types/DataTypeUtils.scala x: 3 contributors (all time) y: 130 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/ResolveDefaultColumnsUtil.scala x: 5 contributors (all time) y: 287 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala x: 45 contributors (all time) y: 691 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/ObjectAggregationIterator.scala x: 12 contributors (all time) y: 226 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/TungstenAggregationIterator.scala x: 24 contributors (all time) y: 229 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/InMemoryRelation.scala x: 34 contributors (all time) y: 353 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala x: 41 contributors (all time) y: 387 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2Writes.scala x: 6 contributors (all time) y: 124 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/ShuffleExchangeExec.scala x: 22 contributors (all time) y: 227 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/objects.scala x: 22 contributors (all time) y: 449 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StageTable.scala x: 61 contributors (all time) y: 300 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala x: 22 contributors (all time) y: 495 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/ui/ThriftServerPage.scala x: 22 contributors (all time) y: 346 lines of code core/src/main/scala/org/apache/spark/SparkEnv.scala x: 86 contributors (all time) y: 402 lines of code core/src/main/scala/org/apache/spark/api/python/PythonWorkerFactory.scala x: 38 contributors (all time) y: 292 lines of code python/pyspark/daemon.py x: 28 contributors (all time) y: 151 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/PythonUDF.scala x: 11 contributors (all time) y: 146 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala x: 30 contributors (all time) y: 242 lines of code python/pyspark/sql/__init__.py x: 15 contributors (all time) y: 34 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ConcatenatingArrowStreamReader.scala x: 1 contributors (all time) y: 131 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/EvalPythonExec.scala x: 13 contributors (all time) y: 22 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/debug/package.scala x: 34 contributors (all time) y: 183 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala x: 141 contributors (all time) y: 2008 lines of code python/pyspark/sql/connect/streaming/readwriter.py x: 4 contributors (all time) y: 511 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ApplyInPandasWithStatePythonRunner.scala x: 7 contributors (all time) y: 163 lines of code python/pyspark/version.py x: 12 contributors (all time) y: 1 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala x: 119 contributors (all time) y: 856 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CountMinSketchAgg.scala x: 10 contributors (all time) y: 169 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/generators.scala x: 38 contributors (all time) y: 432 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/maskExpressions.scala x: 4 contributors (all time) y: 253 lines of code dev/merge_spark_pr.py x: 33 contributors (all time) y: 406 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/ui/StreamingQueryStatusListener.scala x: 9 contributors (all time) y: 114 lines of code connector/connect/common/src/main/protobuf/spark/connect/relations.proto x: 18 contributors (all time) y: 796 lines of code python/pyspark/sql/connect/proto/relations_pb2.pyi x: 17 contributors (all time) y: 2915 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DistributionAndOrderingUtils.scala x: 5 contributors (all time) y: 76 lines of code python/pyspark/sql/sql_formatter.py x: 3 contributors (all time) y: 49 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MaxByAndMinBy.scala x: 7 contributors (all time) y: 87 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/jsonExpressions.scala x: 39 contributors (all time) y: 716 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/text/TextOptions.scala x: 10 contributors (all time) y: 28 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala x: 77 contributors (all time) y: 1204 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TableOutputResolver.scala x: 12 contributors (all time) y: 442 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala x: 90 contributors (all time) y: 444 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriterV2.scala x: 15 contributors (all time) y: 148 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala x: 64 contributors (all time) y: 354 lines of code python/pyspark/rdd.py x: 135 contributors (all time) y: 1514 lines of code python/pyspark/sql/_typing.pyi x: 5 contributors (all time) y: 52 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/pythonLogicalOperators.scala x: 12 contributors (all time) y: 122 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ExtractPythonUDFs.scala x: 24 contributors (all time) y: 215 lines of code python/pyspark/sql/udf.py x: 23 contributors (all time) y: 425 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/PythonForeachWriter.scala x: 5 contributors (all time) y: 102 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/SessionState.scala x: 33 contributors (all time) y: 108 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/v2ResolutionPlans.scala x: 11 contributors (all time) y: 137 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NestedColumnAliasing.scala x: 15 contributors (all time) y: 256 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Implicits.scala x: 12 contributors (all time) y: 96 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Relation.scala x: 16 contributors (all time) y: 174 lines of code sql/core/src/main/scala/org/apache/spark/sql/RelationalGroupedDataset.scala x: 39 contributors (all time) y: 342 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/udaf.scala x: 19 contributors (all time) y: 415 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/SetCommand.scala x: 17 contributors (all time) y: 137 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala x: 49 contributors (all time) y: 707 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/functions.scala x: 24 contributors (all time) y: 133 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormat.scala x: 20 contributors (all time) y: 175 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/LogicalRelation.scala x: 24 contributors (all time) y: 61 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala x: 27 contributors (all time) y: 173 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala x: 57 contributors (all time) y: 373 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/FileScan.scala x: 16 contributors (all time) y: 145 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/PushDownUtils.scala x: 13 contributors (all time) y: 128 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2ScanRelationPushDown.scala x: 15 contributors (all time) y: 429 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamingRelation.scala x: 16 contributors (all time) y: 77 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/memory.scala x: 32 contributors (all time) y: 199 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/FlatMapGroupsWithStateExecHelper.scala x: 5 contributors (all time) y: 152 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/SymmetricHashJoinStateManager.scala x: 16 contributors (all time) y: 446 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/CatalogImpl.scala x: 33 contributors (all time) y: 541 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamReader.scala x: 39 contributors (all time) y: 135 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 43 contributors (all time) y: 287 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SQLAppStatusStore.scala x: 10 contributors (all time) y: 115 lines of code core/src/main/scala/org/apache/spark/storage/BlockManager.scala x: 115 contributors (all time) y: 1519 lines of code core/src/main/scala/org/apache/spark/storage/DiskBlockManager.scala x: 52 contributors (all time) y: 249 lines of code core/src/main/scala/org/apache/spark/storage/DiskBlockObjectWriter.scala x: 19 contributors (all time) y: 212 lines of code core/src/main/scala/org/apache/spark/storage/DiskStore.scala x: 47 contributors (all time) y: 261 lines of code core/src/main/scala/org/apache/spark/storage/FallbackStorage.scala x: 7 contributors (all time) y: 150 lines of code core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala x: 55 contributors (all time) y: 1081 lines of code core/src/main/scala/org/apache/spark/storage/memory/MemoryStore.scala x: 21 contributors (all time) y: 601 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala x: 74 contributors (all time) y: 1003 lines of code core/src/main/resources/org/apache/spark/ui/static/executorspage.js x: 22 contributors (all time) y: 701 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/DeserializerBuildHelper.scala x: 7 contributors (all time) y: 349 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/SerializerBuildHelper.scala x: 9 contributors (all time) y: 371 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/orc/OrcColumnarBatchReader.java x: 9 contributors (all time) y: 138 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcDeserializer.scala x: 13 contributors (all time) y: 222 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRowConverter.scala x: 21 contributors (all time) y: 556 lines of code python/pyspark/context.py x: 111 contributors (all time) y: 747 lines of code python/pyspark/pandas/supported_api_gen.py x: 7 contributors (all time) y: 206 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala x: 68 contributors (all time) y: 4395 lines of code core/src/main/scala/org/apache/spark/errors/SparkCoreErrors.scala x: 8 contributors (all time) y: 412 lines of code core/src/main/scala/org/apache/spark/shuffle/ShufflePartitionPairsWriter.scala x: 3 contributors (all time) y: 104 lines of code core/src/main/resources/org/apache/spark/ui/static/utils.js x: 13 contributors (all time) y: 195 lines of code python/pyspark/pandas/resample.py x: 4 contributors (all time) y: 387 lines of code python/pyspark/pandas/spark/functions.py x: 7 contributors (all time) y: 128 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/ToNumberParser.scala x: 4 contributors (all time) y: 640 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroOptions.scala x: 8 contributors (all time) y: 78 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java x: 15 contributors (all time) y: 1586 lines of code core/src/main/scala/org/apache/spark/util/ThreadUtils.scala x: 20 contributors (all time) y: 233 lines of code sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala x: 20 contributors (all time) y: 155 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala x: 37 contributors (all time) y: 238 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreePatterns.scala x: 22 contributors (all time) y: 126 lines of code python/pyspark/sql/group.py x: 27 contributors (all time) y: 88 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ddl.scala x: 26 contributors (all time) y: 104 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/BaseSessionStateBuilder.scala x: 41 contributors (all time) y: 228 lines of code R/pkg/pkgdown/_pkgdown_template.yml x: 4 contributors (all time) y: 291 lines of code core/src/main/scala/org/apache/spark/api/java/JavaSparkContext.scala x: 53 contributors (all time) y: 252 lines of code core/src/main/scala/org/apache/spark/status/AppStatusListener.scala x: 40 contributors (all time) y: 1100 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/ShuffledHashJoinExec.scala x: 18 contributors (all time) y: 505 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/datasketchesAggregates.scala x: 3 contributors (all time) y: 204 lines of code core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala x: 100 contributors (all time) y: 1133 lines of code core/src/main/scala/org/apache/spark/deploy/security/HadoopDelegationTokenManager.scala x: 11 contributors (all time) y: 196 lines of code common/network-common/src/main/java/org/apache/spark/network/client/TransportClientFactory.java x: 16 contributors (all time) y: 230 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala x: 57 contributors (all time) y: 964 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/higherOrderFunctions.scala x: 11 contributors (all time) y: 77 lines of code common/unsafe/src/main/java/org/apache/spark/unsafe/Platform.java x: 18 contributors (all time) y: 240 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/DecimalPrecision.scala x: 17 contributors (all time) y: 120 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/KeyValueGroupedDataset.scala x: 2 contributors (all time) y: 413 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/MySQLDialect.scala x: 17 contributors (all time) y: 243 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/IsolatedClientLoader.scala x: 33 contributors (all time) y: 231 lines of code python/pyspark/pandas/window.py x: 10 contributors (all time) y: 539 lines of code core/src/main/scala/org/apache/spark/api/python/PythonUtils.scala x: 22 contributors (all time) y: 96 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/metric/SQLMetrics.scala x: 28 contributors (all time) y: 168 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/namedExpressions.scala x: 54 contributors (all time) y: 394 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/CodeGenerator.scala x: 68 contributors (all time) y: 1157 lines of code core/src/main/resources/org/apache/spark/ui/static/timeline-view.js x: 10 contributors (all time) y: 245 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/Config.scala x: 37 contributors (all time) y: 661 lines of code core/src/main/scala/org/apache/spark/internal/config/UI.scala x: 10 contributors (all time) y: 192 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerMessages.scala x: 35 contributors (all time) y: 82 lines of code core/src/main/scala/org/apache/spark/ui/exec/ExecutorsTab.scala x: 22 contributors (all time) y: 38 lines of code core/src/main/scala/org/apache/spark/resource/ResourceProfile.scala x: 8 contributors (all time) y: 348 lines of code core/src/main/scala/org/apache/spark/resource/ResourceUtils.scala x: 10 contributors (all time) y: 349 lines of code python/pyspark/pandas/typedef/typehints.py x: 12 contributors (all time) y: 393 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/RewriteMergeIntoTable.scala x: 1 contributors (all time) y: 357 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/parameters.scala x: 4 contributors (all time) y: 74 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGSchedulerEvent.scala x: 46 contributors (all time) y: 75 lines of code core/src/main/scala/org/apache/spark/scheduler/ResultTask.scala x: 44 contributors (all time) y: 47 lines of code core/src/main/scala/org/apache/spark/scheduler/Task.scala x: 55 contributors (all time) y: 122 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala x: 103 contributors (all time) y: 945 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FlatMapGroupsWithStateExec.scala x: 18 contributors (all time) y: 357 lines of code python/pyspark/ml/classification.py x: 48 contributors (all time) y: 2099 lines of code python/pyspark/ml/clustering.py x: 33 contributors (all time) y: 958 lines of code python/pyspark/ml/feature.py x: 66 contributors (all time) y: 3363 lines of code python/pyspark/ml/fpm.py x: 15 contributors (all time) y: 226 lines of code python/pyspark/ml/recommendation.py x: 25 contributors (all time) y: 322 lines of code python/pyspark/ml/regression.py x: 40 contributors (all time) y: 1523 lines of code python/pyspark/ml/tree.py x: 6 contributors (all time) y: 252 lines of code python/pyspark/ml/tuning.py x: 41 contributors (all time) y: 1099 lines of code python/pyspark/ml/wrapper.py x: 22 contributors (all time) y: 213 lines of code python/pyspark/mllib/classification.py x: 37 contributors (all time) y: 398 lines of code python/pyspark/mllib/clustering.py x: 41 contributors (all time) y: 449 lines of code python/pyspark/mllib/evaluation.py x: 19 contributors (all time) y: 254 lines of code python/pyspark/mllib/feature.py x: 26 contributors (all time) y: 346 lines of code python/pyspark/mllib/linalg/__init__.py x: 23 contributors (all time) y: 908 lines of code python/pyspark/mllib/linalg/distributed.py x: 15 contributors (all time) y: 365 lines of code python/pyspark/mllib/regression.py x: 36 contributors (all time) y: 371 lines of code python/pyspark/streaming/context.py x: 21 contributors (all time) y: 210 lines of code python/pyspark/sql/connect/group.py x: 6 contributors (all time) y: 311 lines of code python/pyspark/sql/connect/column.py x: 7 contributors (all time) y: 379 lines of code core/src/main/protobuf/org/apache/spark/status/protobuf/store_types.proto x: 7 contributors (all time) y: 740 lines of code core/src/main/scala/org/apache/spark/SparkStatusTracker.scala x: 9 contributors (all time) y: 55 lines of code core/src/main/scala/org/apache/spark/status/LiveEntity.scala x: 25 contributors (all time) y: 817 lines of code core/src/main/scala/org/apache/spark/status/api/v1/api.scala x: 40 contributors (all time) y: 467 lines of code connector/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/consumer/KafkaDataConsumer.scala x: 3 contributors (all time) y: 442 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/windowExpressions.scala x: 38 contributors (all time) y: 797 lines of code sql/core/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveSessionCatalog.scala x: 41 contributors (all time) y: 508 lines of code python/pyspark/accumulators.py x: 41 contributors (all time) y: 146 lines of code python/pyspark/ml/functions.py x: 11 contributors (all time) y: 278 lines of code python/pyspark/pandas/data_type_ops/base.py x: 8 contributors (all time) y: 366 lines of code python/pyspark/pandas/data_type_ops/boolean_ops.py x: 7 contributors (all time) y: 334 lines of code python/pyspark/pandas/data_type_ops/num_ops.py x: 9 contributors (all time) y: 429 lines of code python/pyspark/pandas/indexing.py x: 10 contributors (all time) y: 1202 lines of code python/pyspark/pandas/internal.py x: 9 contributors (all time) y: 842 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala x: 51 contributors (all time) y: 618 lines of code python/pyspark/pandas/generic.py x: 15 contributors (all time) y: 938 lines of code python/pyspark/pandas/plot/matplotlib.py x: 6 contributors (all time) y: 555 lines of code python/pyspark/pandas/strings.py x: 7 contributors (all time) y: 315 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/ExpressionImplUtils.java x: 4 contributors (all time) y: 187 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/misc.scala x: 43 contributors (all time) y: 389 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/stat/StatFunctions.scala x: 29 contributors (all time) y: 171 lines of code core/src/main/scala/org/apache/spark/rdd/CheckpointRDD.scala x: 31 contributors (all time) y: 10 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/planning/patterns.scala x: 43 contributors (all time) y: 269 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala x: 20 contributors (all time) y: 391 lines of code common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java x: 35 contributors (all time) y: 1093 lines of code python/pyspark/sql/context.py x: 49 contributors (all time) y: 296 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/ExpressionInfo.java x: 12 contributors (all time) y: 157 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/PlanHelper.scala x: 5 contributors (all time) y: 30 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkOptimizer.scala x: 31 contributors (all time) y: 74 lines of code core/src/main/scala/org/apache/spark/scheduler/dynalloc/ExecutorMonitor.scala x: 14 contributors (all time) y: 438 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala x: 82 contributors (all time) y: 763 lines of code common/network-yarn/src/main/java/org/apache/spark/network/yarn/YarnShuffleService.java x: 24 contributors (all time) y: 405 lines of code python/pyspark/sql/connect/client/artifact.py x: 3 contributors (all time) y: 254 lines of code python/pyspark/sql/catalog.py x: 26 contributors (all time) y: 317 lines of code python/pyspark/sql/connect/catalog.py x: 6 contributors (all time) y: 262 lines of code python/pyspark/sql/connect/proto/catalog_pb2.pyi x: 3 contributors (all time) y: 910 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala x: 28 contributors (all time) y: 367 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/hints.scala x: 14 contributors (all time) y: 97 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/hiveUDFs.scala x: 42 contributors (all time) y: 327 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/InsertAdaptiveSparkPlan.scala x: 17 contributors (all time) y: 101 lines of code connector/protobuf/src/main/scala/org/apache/spark/sql/protobuf/ProtobufDeserializer.scala x: 4 contributors (all time) y: 326 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/SchemaConverters.scala x: 4 contributors (all time) y: 197 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FallBackFileSourceV2.scala x: 7 contributors (all time) y: 21 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveStrategies.scala x: 49 contributors (all time) y: 222 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/submit/KubernetesClientApplication.scala x: 19 contributors (all time) y: 185 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/ExecutionPage.scala x: 19 contributors (all time) y: 172 lines of code python/pyspark/ml/param/_shared_params_code_gen.py x: 18 contributors (all time) y: 301 lines of code python/pyspark/ml/param/shared.py x: 18 contributors (all time) y: 440 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala x: 34 contributors (all time) y: 676 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/HashedRelation.scala x: 39 contributors (all time) y: 795 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/KubernetesExecutorBuilder.scala x: 13 contributors (all time) y: 62 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroUtils.scala x: 6 contributors (all time) y: 239 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/subquery.scala x: 28 contributors (all time) y: 546 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/InterpretedUnsafeProjection.scala x: 11 contributors (all time) y: 210 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/datetimeExpressions.scala x: 62 contributors (all time) y: 2799 lines of code core/src/main/scala/org/apache/spark/rdd/RDD.scala x: 130 contributors (all time) y: 1072 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/vectorized/ArrowColumnVector.java x: 11 contributors (all time) y: 453 lines of code python/pyspark/sql/connect/conversion.py x: 4 contributors (all time) y: 344 lines of code connector/protobuf/src/main/scala/org/apache/spark/sql/protobuf/ProtobufDataToCatalyst.scala x: 4 contributors (all time) y: 118 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/mathExpressions.scala x: 41 contributors (all time) y: 1507 lines of code core/src/main/scala/org/apache/spark/deploy/history/ApplicationCache.scala x: 8 contributors (all time) y: 218 lines of code core/src/main/scala/org/apache/spark/rdd/HadoopRDD.scala x: 69 contributors (all time) y: 328 lines of code core/src/main/scala/org/apache/spark/ui/JettyUtils.scala x: 50 contributors (all time) y: 447 lines of code core/src/main/scala/org/apache/spark/ui/ToolTips.scala x: 16 contributors (all time) y: 62 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/DecorrelateInnerQuery.scala x: 9 contributors (all time) y: 389 lines of code core/src/main/scala/org/apache/spark/status/AppStatusStore.scala x: 27 contributors (all time) y: 747 lines of code core/src/main/scala/org/apache/spark/io/CompressionCodec.scala x: 38 contributors (all time) y: 142 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/thrift/ThriftCLIService.java x: 5 contributors (all time) y: 618 lines of code core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala x: 66 contributors (all time) y: 317 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/MsSqlServerDialect.scala x: 19 contributors (all time) y: 141 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala x: 49 contributors (all time) y: 369 lines of code streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala x: 60 contributors (all time) y: 286 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/complexTypeCreator.scala x: 45 contributors (all time) y: 574 lines of code python/pyspark/shell.py x: 56 contributors (all time) y: 72 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/BlockMatrix.scala x: 20 contributors (all time) y: 343 lines of code python/pyspark/sql/column.py x: 41 contributors (all time) y: 401 lines of code core/src/main/scala/org/apache/spark/scheduler/SplitInfo.scala x: 11 contributors (all time) y: 52 lines of code core/src/main/scala/org/apache/spark/status/protobuf/StageDataWrapperSerializer.scala x: 3 contributors (all time) y: 658 lines of code sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala x: 29 contributors (all time) y: 180 lines of code core/src/main/scala/org/apache/spark/network/netty/NettyBlockTransferService.scala x: 25 contributors (all time) y: 169 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala x: 10 contributors (all time) y: 392 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala x: 66 contributors (all time) y: 971 lines of code sql/core/src/main/scala/org/apache/spark/sql/Column.scala x: 63 contributors (all time) y: 250 lines of code sql/core/src/main/scala/org/apache/spark/sql/KeyValueGroupedDataset.scala x: 21 contributors (all time) y: 374 lines of code core/src/main/scala/org/apache/spark/util/JsonProtocol.scala x: 65 contributors (all time) y: 1350 lines of code core/src/main/scala/org/apache/spark/ui/env/EnvironmentPage.scala x: 13 contributors (all time) y: 164 lines of code core/src/main/scala/org/apache/spark/memory/UnifiedMemoryManager.scala x: 13 contributors (all time) y: 140 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkExecuteStatementOperation.scala x: 45 contributors (all time) y: 315 lines of code python/pyspark/sql/pandas/functions.py x: 10 contributors (all time) y: 129 lines of code core/src/main/scala/org/apache/spark/util/collection/ExternalAppendOnlyMap.scala x: 37 contributors (all time) y: 380 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/jdbc/JDBCTableCatalog.scala x: 10 contributors (all time) y: 265 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/WriteToDataSourceV2Exec.scala x: 22 contributors (all time) y: 412 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala x: 34 contributors (all time) y: 411 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala x: 98 contributors (all time) y: 929 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/VectorizedRleValuesReader.java x: 15 contributors (all time) y: 724 lines of code python/pyspark/broadcast.py x: 42 contributors (all time) y: 167 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLCLIDriver.scala x: 56 contributors (all time) y: 530 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/features/MountVolumesFeatureStep.scala x: 10 contributors (all time) y: 94 lines of code core/src/main/scala/org/apache/spark/executor/ProcfsMetricsGetter.scala x: 9 contributors (all time) y: 190 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/regexpExpressions.scala x: 40 contributors (all time) y: 954 lines of code python/pyspark/sql/connect/readwriter.py x: 7 contributors (all time) y: 707 lines of code python/pyspark/sql/readwriter.py x: 79 contributors (all time) y: 727 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JDBCRDD.scala x: 39 contributors (all time) y: 182 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/ProgressReporter.scala x: 23 contributors (all time) y: 281 lines of code core/src/main/scala/org/apache/spark/storage/BlockId.scala x: 29 contributors (all time) y: 201 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 30 contributors (all time) y: 255 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Expression.scala x: 64 contributors (all time) y: 780 lines of code core/src/main/scala/org/apache/spark/broadcast/Broadcast.scala x: 23 contributors (all time) y: 45 lines of code core/src/main/scala/org/apache/spark/broadcast/TorrentBroadcast.scala x: 45 contributors (all time) y: 253 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/package.scala x: 22 contributors (all time) y: 202 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/LogicalPlan.scala x: 52 contributors (all time) y: 197 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/InlineCTE.scala x: 5 contributors (all time) y: 127 lines of code python/pyspark/pandas/accessors.py x: 7 contributors (all time) y: 434 lines of code python/pyspark/pandas/numpy_compat.py x: 7 contributors (all time) y: 213 lines of code python/pyspark/conf.py x: 22 contributors (all time) y: 121 lines of code python/pyspark/java_gateway.py x: 51 contributors (all time) y: 138 lines of code python/pyspark/profiler.py x: 10 contributors (all time) y: 318 lines of code python/pyspark/taskcontext.py x: 18 contributors (all time) y: 145 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala x: 41 contributors (all time) y: 722 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala x: 47 contributors (all time) y: 1167 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/CLIService.java x: 9 contributors (all time) y: 410 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/session/HiveSessionImpl.java x: 10 contributors (all time) y: 767 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileScanRDD.scala x: 31 contributors (all time) y: 216 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/NoSuchItemException.scala x: 15 contributors (all time) y: 228 lines of code resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/config.scala x: 19 contributors (all time) y: 376 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalShuffleBlockResolver.java x: 20 contributors (all time) y: 362 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/InjectRuntimeFilter.scala x: 8 contributors (all time) y: 279 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveTable.scala x: 57 contributors (all time) y: 200 lines of code project/MimaBuild.scala x: 22 contributors (all time) y: 59 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/literals.scala x: 50 contributors (all time) y: 431 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeRow.java x: 28 contributors (all time) y: 452 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/InternalRow.scala x: 16 contributors (all time) y: 112 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ColumnVectorUtils.java x: 15 contributors (all time) y: 192 lines of code core/src/main/scala/org/apache/spark/util/SparkExitCode.scala x: 3 contributors (all time) y: 12 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/ApproxCountDistinctForIntervals.scala x: 8 contributors (all time) y: 194 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/ApproximatePercentile.scala x: 26 contributors (all time) y: 262 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/ColumnType.scala x: 16 contributors (all time) y: 589 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/compression/compressionSchemes.scala x: 10 contributors (all time) y: 672 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/SpecificParquetRecordReaderBase.java x: 29 contributors (all time) y: 216 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetPartitionReaderFactory.scala x: 16 contributors (all time) y: 261 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLDriver.scala x: 23 contributors (all time) y: 80 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveInspectors.scala x: 38 contributors (all time) y: 872 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/arithmetic.scala x: 61 contributors (all time) y: 1091 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveMetastoreCatalog.scala x: 73 contributors (all time) y: 311 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/decimalExpressions.scala x: 20 contributors (all time) y: 221 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/TypeUtils.scala x: 21 contributors (all time) y: 92 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/IncrementalExecution.scala x: 25 contributors (all time) y: 280 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/statefulOperators.scala x: 20 contributors (all time) y: 746 lines of code python/pyspark/pandas/config.py x: 11 contributors (all time) y: 335 lines of code core/src/main/scala/org/apache/spark/executor/TaskMetrics.scala x: 41 contributors (all time) y: 172 lines of code core/src/main/scala/org/apache/spark/deploy/SparkSubmitArguments.scala x: 65 contributors (all time) y: 521 lines of code sql/core/src/main/scala/org/apache/spark/sql/SparkSessionExtensions.scala x: 15 contributors (all time) y: 135 lines of code python/pyspark/storagelevel.py x: 23 contributors (all time) y: 61 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/DerbyDialect.scala x: 10 contributors (all time) y: 48 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/write/streaming/StreamingWrite.java x: 5 contributors (all time) y: 14 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/noop/NoopDataSource.scala x: 10 contributors (all time) y: 64 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala x: 40 contributors (all time) y: 314 lines of code core/src/main/scala/org/apache/spark/deploy/worker/Worker.scala x: 85 contributors (all time) y: 750 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/BroadcastHashJoinExec.scala x: 22 contributors (all time) y: 173 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/HashJoin.scala x: 26 contributors (all time) y: 647 lines of code python/pyspark/sql/avro/functions.py x: 13 contributors (all time) y: 73 lines of code python/pyspark/sql/window.py x: 20 contributors (all time) y: 103 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/thrift/ThriftHttpServlet.java x: 6 contributors (all time) y: 393 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/MapInPandasExec.scala x: 4 contributors (all time) y: 14 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/StopWordsRemover.scala x: 21 contributors (all time) y: 134 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/hash.scala x: 32 contributors (all time) y: 783 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/higherOrderFunctions.scala x: 34 contributors (all time) y: 986 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/ParquetColumnVector.java x: 5 contributors (all time) y: 225 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamingSymmetricHashJoinExec.scala x: 14 contributors (all time) y: 405 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/IsotonicRegression.scala x: 16 contributors (all time) y: 267 lines of code mllib-local/src/main/scala/org/apache/spark/ml/linalg/BLAS.scala x: 12 contributors (all time) y: 654 lines of code mllib-local/src/main/scala/org/apache/spark/ml/linalg/Matrices.scala x: 17 contributors (all time) y: 813 lines of code mllib-local/src/main/scala/org/apache/spark/ml/linalg/Vectors.scala x: 16 contributors (all time) y: 559 lines of code mllib/src/main/scala/org/apache/spark/ml/ann/Layer.scala x: 12 contributors (all time) y: 425 lines of code mllib/src/main/scala/org/apache/spark/ml/attribute/attributes.scala x: 12 contributors (all time) y: 353 lines of code mllib/src/main/scala/org/apache/spark/ml/attribute/package.scala x: 6 contributors (all time) y: 2 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/DecisionTreeClassifier.scala x: 24 contributors (all time) y: 207 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/GBTClassifier.scala x: 30 contributors (all time) y: 267 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/LinearSVC.scala x: 15 contributors (all time) y: 296 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/LogisticRegression.scala x: 48 contributors (all time) y: 903 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/MultilayerPerceptronClassifier.scala x: 23 contributors (all time) y: 238 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/OneVsRest.scala x: 23 contributors (all time) y: 334 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/ProbabilisticClassifier.scala x: 15 contributors (all time) y: 164 lines of code mllib/src/main/scala/org/apache/spark/ml/classification/RandomForestClassifier.scala x: 29 contributors (all time) y: 341 lines of code mllib/src/main/scala/org/apache/spark/ml/clustering/BisectingKMeans.scala x: 20 contributors (all time) y: 195 lines of code mllib/src/main/scala/org/apache/spark/ml/clustering/KMeans.scala x: 31 contributors (all time) y: 509 lines of code mllib/src/main/scala/org/apache/spark/ml/clustering/LDA.scala x: 29 contributors (all time) y: 517 lines of code mllib/src/main/scala/org/apache/spark/ml/evaluation/BinaryClassificationEvaluator.scala x: 16 contributors (all time) y: 90 lines of code mllib/src/main/scala/org/apache/spark/ml/evaluation/Evaluator.scala x: 7 contributors (all time) y: 17 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/Bucketizer.scala x: 21 contributors (all time) y: 188 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizer.scala x: 24 contributors (all time) y: 248 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/Imputer.scala x: 12 contributors (all time) y: 196 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/PCA.scala x: 18 contributors (all time) y: 135 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/QuantileDiscretizer.scala x: 22 contributors (all time) y: 148 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/RFormula.scala x: 27 contributors (all time) y: 378 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/Selector.scala x: 4 contributors (all time) y: 228 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/StringIndexer.scala x: 29 contributors (all time) y: 461 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/UnivariateFeatureSelector.scala x: 5 contributors (all time) y: 306 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/VectorAssembler.scala x: 23 contributors (all time) y: 221 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/VectorIndexer.scala x: 22 contributors (all time) y: 360 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/VectorSlicer.scala x: 14 contributors (all time) y: 116 lines of code mllib/src/main/scala/org/apache/spark/ml/feature/Word2Vec.scala x: 31 contributors (all time) y: 255 lines of code mllib/src/main/scala/org/apache/spark/ml/fpm/FPGrowth.scala x: 17 contributors (all time) y: 247 lines of code mllib/src/main/scala/org/apache/spark/ml/param/params.scala x: 25 contributors (all time) y: 595 lines of code mllib/src/main/scala/org/apache/spark/ml/python/MLSerDe.scala x: 3 contributors (all time) y: 168 lines of code mllib/src/main/scala/org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala x: 9 contributors (all time) y: 103 lines of code mllib/src/main/scala/org/apache/spark/ml/r/GeneralizedLinearRegressionWrapper.scala x: 8 contributors (all time) y: 154 lines of code mllib/src/main/scala/org/apache/spark/ml/recommendation/ALS.scala x: 43 contributors (all time) y: 1021 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/FMRegressor.scala x: 5 contributors (all time) y: 462 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/GBTRegressor.scala x: 25 contributors (all time) y: 235 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.scala x: 28 contributors (all time) y: 948 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/IsotonicRegression.scala x: 23 contributors (all time) y: 200 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/LinearRegression.scala x: 47 contributors (all time) y: 578 lines of code mllib/src/main/scala/org/apache/spark/ml/regression/RandomForestRegressor.scala x: 26 contributors (all time) y: 216 lines of code mllib/src/main/scala/org/apache/spark/ml/stat/Summarizer.scala x: 12 contributors (all time) y: 519 lines of code mllib/src/main/scala/org/apache/spark/ml/tree/Node.scala x: 12 contributors (all time) y: 236 lines of code mllib/src/main/scala/org/apache/spark/ml/tree/impl/RandomForest.scala x: 27 contributors (all time) y: 806 lines of code mllib/src/main/scala/org/apache/spark/ml/tree/treeModels.scala x: 15 contributors (all time) y: 314 lines of code mllib/src/main/scala/org/apache/spark/ml/tuning/CrossValidator.scala x: 29 contributors (all time) y: 290 lines of code mllib/src/main/scala/org/apache/spark/ml/tuning/TrainValidationSplit.scala x: 19 contributors (all time) y: 263 lines of code mllib/src/main/scala/org/apache/spark/ml/util/ReadWrite.scala x: 21 contributors (all time) y: 351 lines of code mllib/src/main/scala/org/apache/spark/ml/util/SchemaUtils.scala x: 11 contributors (all time) y: 109 lines of code mllib/src/main/scala/org/apache/spark/mllib/api/python/PythonMLLibAPI.scala x: 58 contributors (all time) y: 1121 lines of code mllib/src/main/scala/org/apache/spark/mllib/classification/LogisticRegression.scala x: 36 contributors (all time) y: 211 lines of code mllib/src/main/scala/org/apache/spark/mllib/classification/NaiveBayes.scala x: 33 contributors (all time) y: 272 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/BisectingKMeans.scala x: 15 contributors (all time) y: 344 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/GaussianMixtureModel.scala x: 19 contributors (all time) y: 121 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAModel.scala x: 24 contributors (all time) y: 539 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAOptimizer.scala x: 24 contributors (all time) y: 370 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/LocalKMeans.scala x: 15 contributors (all time) y: 90 lines of code mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala x: 22 contributors (all time) y: 264 lines of code mllib/src/main/scala/org/apache/spark/mllib/evaluation/RegressionMetrics.scala x: 17 contributors (all time) y: 65 lines of code mllib/src/main/scala/org/apache/spark/mllib/feature/IDF.scala x: 10 contributors (all time) y: 146 lines of code mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala x: 37 contributors (all time) y: 516 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/BLAS.scala x: 22 contributors (all time) y: 568 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/Matrices.scala x: 38 contributors (all time) y: 772 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/Vectors.scala x: 44 contributors (all time) y: 692 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/IndexedRowMatrix.scala x: 17 contributors (all time) y: 160 lines of code mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala x: 31 contributors (all time) y: 539 lines of code mllib/src/main/scala/org/apache/spark/mllib/optimization/Optimizer.scala x: 11 contributors (all time) y: 6 lines of code mllib/src/main/scala/org/apache/spark/mllib/random/RandomRDDs.scala x: 11 contributors (all time) y: 506 lines of code mllib/src/main/scala/org/apache/spark/mllib/rdd/RDDFunctions.scala x: 9 contributors (all time) y: 18 lines of code mllib/src/main/scala/org/apache/spark/mllib/recommendation/ALS.scala x: 44 contributors (all time) y: 221 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/GeneralizedLinearAlgorithm.scala x: 25 contributors (all time) y: 155 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/LabeledPoint.scala x: 19 contributors (all time) y: 41 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/Lasso.scala x: 25 contributors (all time) y: 62 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/LinearRegression.scala x: 30 contributors (all time) y: 62 lines of code mllib/src/main/scala/org/apache/spark/mllib/regression/RidgeRegression.scala x: 26 contributors (all time) y: 62 lines of code mllib/src/main/scala/org/apache/spark/mllib/stat/distribution/MultivariateGaussian.scala x: 13 contributors (all time) y: 47 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/DecisionTree.scala x: 24 contributors (all time) y: 110 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/GradientBoostedTrees.scala x: 18 contributors (all time) y: 66 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/RandomForest.scala x: 25 contributors (all time) y: 131 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/loss/AbsoluteError.scala x: 9 contributors (all time) y: 13 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/loss/SquaredError.scala x: 10 contributors (all time) y: 13 lines of code mllib/src/main/scala/org/apache/spark/mllib/tree/model/treeEnsembleModels.scala x: 20 contributors (all time) y: 263 lines of code mllib/src/main/scala/org/apache/spark/mllib/util/KMeansDataGenerator.scala x: 12 contributors (all time) y: 46 lines of code mllib/src/main/scala/org/apache/spark/mllib/util/MLUtils.scala x: 43 contributors (all time) y: 383 lines of code mllib/src/main/scala/org/apache/spark/mllib/util/SVMDataGenerator.scala x: 15 contributors (all time) y: 39 lines of code python/pyspark/ml/linalg/__init__.py x: 13 contributors (all time) y: 779 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskInfo.scala x: 25 contributors (all time) y: 80 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JDBCOptions.scala x: 26 contributors (all time) y: 192 lines of code python/pyspark/__init__.py x: 53 contributors (all time) y: 85 lines of code core/src/main/scala/org/apache/spark/scheduler/Stage.scala x: 33 contributors (all time) y: 53 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/InMemoryTableScanExec.scala x: 27 contributors (all time) y: 117 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SparkPlanGraph.scala x: 20 contributors (all time) y: 161 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/CommandUtils.scala x: 24 contributors (all time) y: 315 lines of code core/src/main/java/org/apache/spark/memory/TaskMemoryManager.java x: 24 contributors (all time) y: 274 lines of code python/pyspark/sql/connect/proto/types_pb2.pyi x: 4 contributors (all time) y: 876 lines of code core/src/main/scala/org/apache/spark/ExecutorAllocationManager.scala x: 59 contributors (all time) y: 635 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveUtils.scala x: 35 contributors (all time) y: 410 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/continuous/ContinuousExecution.scala x: 28 contributors (all time) y: 323 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerMaster.scala x: 53 contributors (all time) y: 199 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerMasterEndpoint.scala x: 37 contributors (all time) y: 786 lines of code core/src/main/scala/org/apache/spark/TestUtils.scala x: 32 contributors (all time) y: 361 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskScheduler.scala x: 40 contributors (all time) y: 32 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/TableChange.java x: 6 contributors (all time) y: 405 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2AlterTableCommands.scala x: 5 contributors (all time) y: 163 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/JoinEstimation.scala x: 11 contributors (all time) y: 243 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JsonInferSchema.scala x: 14 contributors (all time) y: 290 lines of code core/src/main/scala/org/apache/spark/deploy/master/Master.scala x: 92 contributors (all time) y: 930 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaConverter.scala x: 22 contributors (all time) y: 419 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/complexTypeExtractors.scala x: 30 contributors (all time) y: 342 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/connector/util/V2ExpressionSQLBuilder.java x: 5 contributors (all time) y: 355 lines of code core/src/main/scala/org/apache/spark/util/collection/ExternalSorter.scala x: 43 contributors (all time) y: 532 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkPlan.scala x: 54 contributors (all time) y: 367 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/SortAggregateExec.scala x: 21 contributors (all time) y: 85 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala x: 27 contributors (all time) y: 258 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/HadoopFsRelation.scala x: 14 contributors (all time) y: 30 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Average.scala x: 23 contributors (all time) y: 122 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Sum.scala x: 21 contributors (all time) y: 149 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/nullExpressions.scala x: 29 contributors (all time) y: 375 lines of code sql/core/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 32 contributors (all time) y: 935 lines of code core/src/main/scala/org/apache/spark/api/r/SerDe.scala x: 16 contributors (all time) y: 362 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/KubernetesUtils.scala x: 17 contributors (all time) y: 314 lines of code streaming/src/main/scala/org/apache/spark/streaming/StreamingContext.scala x: 68 contributors (all time) y: 468 lines of code streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala x: 44 contributors (all time) y: 274 lines of code core/src/main/scala/org/apache/spark/deploy/worker/WorkerWatcher.scala x: 16 contributors (all time) y: 47 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala x: 59 contributors (all time) y: 581 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVDataSource.scala x: 13 contributors (all time) y: 191 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/orc/OrcFileFormat.scala x: 26 contributors (all time) y: 284 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalBlockHandler.java x: 11 contributors (all time) y: 481 lines of code common/kvstore/src/main/java/org/apache/spark/util/kvstore/RocksDBIterator.java x: 2 contributors (all time) y: 214 lines of code core/src/main/java/org/apache/spark/unsafe/map/BytesToBytesMap.java x: 26 contributors (all time) y: 586 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/BaseScriptTransformationExec.scala x: 9 contributors (all time) y: 320 lines of code core/src/main/resources/org/apache/spark/ui/static/webui.css x: 37 contributors (all time) y: 348 lines of code launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java x: 15 contributors (all time) y: 263 lines of code core/src/main/scala/org/apache/spark/status/storeTypes.scala x: 19 contributors (all time) y: 469 lines of code core/src/main/scala/org/apache/spark/executor/ExecutorSource.scala x: 25 contributors (all time) y: 110 lines of code core/src/main/scala/org/apache/spark/executor/ShuffleReadMetrics.scala x: 6 contributors (all time) y: 179 lines of code core/src/main/scala/org/apache/spark/scheduler/StageInfo.scala x: 30 contributors (all time) y: 78 lines of code core/src/main/scala/org/apache/spark/storage/PushBasedFetchHelper.scala x: 6 contributors (all time) y: 197 lines of code core/src/main/scala/org/apache/spark/ui/jobs/JobPage.scala x: 36 contributors (all time) y: 472 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/PushDownLeftSemiAntiJoin.scala x: 8 contributors (all time) y: 170 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Projection.scala x: 24 contributors (all time) y: 88 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/MonotonicallyIncreasingID.scala x: 20 contributors (all time) y: 52 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/randomExpressions.scala x: 20 contributors (all time) y: 107 lines of code core/src/main/scala/org/apache/spark/deploy/history/FsHistoryProvider.scala x: 78 contributors (all time) y: 1190 lines of code core/src/main/scala/org/apache/spark/status/KVUtils.scala x: 6 contributors (all time) y: 146 lines of code core/src/main/scala/org/apache/spark/scheduler/SparkListener.scala x: 56 contributors (all time) y: 317 lines of code core/src/main/scala/org/apache/spark/ui/PagedTable.scala x: 13 contributors (all time) y: 272 lines of code core/src/main/scala/org/apache/spark/ui/storage/RDDPage.scala x: 36 contributors (all time) y: 209 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskSet.scala x: 17 contributors (all time) y: 13 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ExistingRDD.scala x: 37 contributors (all time) y: 198 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/HDFSMetadataLog.scala x: 20 contributors (all time) y: 245 lines of code python/pyspark/pandas/plot/core.py x: 11 contributors (all time) y: 424 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/conditionalExpressions.scala x: 32 contributors (all time) y: 285 lines of code core/src/main/scala/org/apache/spark/status/api/v1/StagesResource.scala x: 9 contributors (all time) y: 227 lines of code core/src/main/scala/org/apache/spark/HeartbeatReceiver.scala x: 26 contributors (all time) y: 137 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/package.scala x: 11 contributors (all time) y: 91 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/numberFormatExpressions.scala x: 5 contributors (all time) y: 236 lines of code core/src/main/scala/org/apache/spark/storage/BlockManagerDecommissioner.scala x: 9 contributors (all time) y: 303 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcUtils.scala x: 22 contributors (all time) y: 396 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/ShufflePartitionsUtil.scala x: 7 contributors (all time) y: 261 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala x: 30 contributors (all time) y: 197 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala x: 13 contributors (all time) y: 514 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskResult.scala x: 34 contributors (all time) y: 70 lines of code core/src/main/scala/org/apache/spark/scheduler/TaskResultGetter.scala x: 31 contributors (all time) y: 119 lines of code core/src/main/scala/org/apache/spark/util/io/ChunkedByteBuffer.scala x: 16 contributors (all time) y: 207 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/ParquetVectorUpdaterFactory.java x: 4 contributors (all time) y: 996 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/RewriteDistinctAggregates.scala x: 14 contributors (all time) y: 161 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/SharedState.scala x: 30 contributors (all time) y: 185 lines of code core/src/main/scala/org/apache/spark/shuffle/BlockStoreShuffleReader.scala x: 29 contributors (all time) y: 98 lines of code python/pyspark/cloudpickle/cloudpickle.py x: 3 contributors (all time) y: 469 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLEnv.scala x: 28 contributors (all time) y: 55 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/AggUtils.scala x: 19 contributors (all time) y: 425 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/stat/FrequentItems.scala x: 16 contributors (all time) y: 131 lines of code core/src/main/scala/org/apache/spark/util/SizeEstimator.scala x: 38 contributors (all time) y: 232 lines of code python/run-tests.py x: 28 contributors (all time) y: 276 lines of code core/src/main/scala/org/apache/spark/TaskEndReason.scala x: 38 contributors (all time) y: 138 lines of code core/src/main/scala/org/apache/spark/deploy/ApplicationDescription.scala x: 20 contributors (all time) y: 20 lines of code core/src/main/scala/org/apache/spark/deploy/master/ApplicationInfo.scala x: 34 contributors (all time) y: 150 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala x: 16 contributors (all time) y: 284 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala x: 74 contributors (all time) y: 265 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/BroadcastNestedLoopJoinExec.scala x: 15 contributors (all time) y: 443 lines of code core/src/main/scala/org/apache/spark/rdd/JdbcRDD.scala x: 28 contributors (all time) y: 120 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveCatalogs.scala x: 17 contributors (all time) y: 31 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/OneForOneBlockFetcher.java x: 14 contributors (all time) y: 271 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/CatalystTypeConverters.scala x: 25 contributors (all time) y: 396 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Covariance.scala x: 12 contributors (all time) y: 116 lines of code core/src/main/scala/org/apache/spark/util/collection/BitSet.scala x: 23 contributors (all time) y: 151 lines of code sql/core/src/main/scala/org/apache/spark/sql/catalyst/util/V2ExpressionBuilder.scala x: 6 contributors (all time) y: 321 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/StandaloneSchedulerBackend.scala x: 35 contributors (all time) y: 236 lines of code python/pyspark/files.py x: 26 contributors (all time) y: 35 lines of code core/src/main/scala/org/apache/spark/TaskContext.scala x: 40 contributors (all time) y: 94 lines of code core/src/main/scala/org/apache/spark/TaskContextImpl.scala x: 23 contributors (all time) y: 157 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonGenerator.scala x: 16 contributors (all time) y: 219 lines of code python/pyspark/mllib/tree.py x: 22 contributors (all time) y: 321 lines of code core/src/main/scala/org/apache/spark/broadcast/BroadcastFactory.scala x: 21 contributors (all time) y: 13 lines of code core/src/main/scala/org/apache/spark/ui/WebUI.scala x: 32 contributors (all time) y: 153 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/intervalExpressions.scala x: 19 contributors (all time) y: 667 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CostBasedJoinReorder.scala x: 14 contributors (all time) y: 255 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PruneFileSourcePartitions.scala x: 19 contributors (all time) y: 62 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVector.java x: 21 contributors (all time) y: 568 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala x: 42 contributors (all time) y: 200 lines of code core/src/main/scala/org/apache/spark/serializer/SerializationDebugger.scala x: 9 contributors (all time) y: 262 lines of code common/network-common/src/main/java/org/apache/spark/network/crypto/TransportCipher.java x: 7 contributors (all time) y: 284 lines of code core/src/main/scala/org/apache/spark/deploy/history/HistoryPage.scala x: 26 contributors (all time) y: 74 lines of code core/src/main/scala/org/apache/spark/deploy/DeployMessage.scala x: 37 contributors (all time) y: 141 lines of code core/src/main/scala/org/apache/spark/deploy/client/StandaloneAppClient.scala x: 10 contributors (all time) y: 226 lines of code core/src/main/scala/org/apache/spark/deploy/master/ui/ApplicationPage.scala x: 42 contributors (all time) y: 130 lines of code core/src/main/scala/org/apache/spark/deploy/worker/ExecutorRunner.scala x: 51 contributors (all time) y: 151 lines of code core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala x: 47 contributors (all time) y: 214 lines of code core/src/main/scala/org/apache/spark/rdd/BlockRDD.scala x: 29 contributors (all time) y: 51 lines of code core/src/main/scala/org/apache/spark/rdd/CoGroupedRDD.scala x: 40 contributors (all time) y: 122 lines of code core/src/main/scala/org/apache/spark/rdd/NewHadoopRDD.scala x: 54 contributors (all time) y: 264 lines of code core/src/main/scala/org/apache/spark/rdd/SubtractedRDD.scala x: 29 contributors (all time) y: 87 lines of code core/src/main/scala/org/apache/spark/ui/jobs/AllJobsPage.scala x: 43 contributors (all time) y: 514 lines of code core/src/main/scala/org/apache/spark/SecurityManager.scala x: 28 contributors (all time) y: 220 lines of code core/src/main/scala/org/apache/spark/Partitioner.scala x: 44 contributors (all time) y: 230 lines of code core/src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeExternalSorter.java x: 28 contributors (all time) y: 578 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/ui/ThriftServerSessionPage.scala x: 19 contributors (all time) y: 84 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeArrayData.java x: 19 contributors (all time) y: 393 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/OffHeapColumnVector.java x: 24 contributors (all time) y: 440 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/OnHeapColumnVector.java x: 23 contributors (all time) y: 448 lines of code python/pyspark/ml/evaluation.py x: 26 contributors (all time) y: 561 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/InMemoryFileIndex.scala x: 23 contributors (all time) y: 113 lines of code python/pyspark/streaming/dstream.py x: 19 contributors (all time) y: 491 lines of code python/pyspark/streaming/kinesis.py x: 13 contributors (all time) y: 110 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/UnivocityGenerator.scala x: 9 contributors (all time) y: 83 lines of code sql/core/src/main/scala/org/apache/spark/sql/SQLImplicits.scala x: 18 contributors (all time) y: 72 lines of code core/src/main/java/org/apache/spark/shuffle/sort/UnsafeShuffleWriter.java x: 24 contributors (all time) y: 421 lines of code core/src/main/scala/org/apache/spark/rpc/netty/NettyRpcEnv.scala x: 22 contributors (all time) y: 533 lines of code streaming/src/main/scala/org/apache/spark/streaming/ui/StreamingPage.scala x: 26 contributors (all time) y: 417 lines of code core/src/main/scala/org/apache/spark/ContextCleaner.scala x: 24 contributors (all time) y: 189 lines of code connector/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala x: 1 contributors (all time) y: 552 lines of code connector/kafka-0-10-token-provider/src/main/scala/org/apache/spark/kafka010/KafkaTokenUtil.scala x: 1 contributors (all time) y: 226 lines of code connector/spark-ganglia-lgpl/src/main/java/com/codahale/metrics/ganglia/GangliaReporter.java x: 1 contributors (all time) y: 286 lines of code core/src/main/java/org/apache/spark/io/ReadAheadInputStream.java x: 6 contributors (all time) y: 292 lines of code core/src/main/scala/org/apache/spark/deploy/master/ui/MasterPage.scala x: 29 contributors (all time) y: 311 lines of code resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosCoarseGrainedSchedulerBackend.scala x: 28 contributors (all time) y: 568 lines of code python/pyspark/mllib/util.py x: 19 contributors (all time) y: 196 lines of code core/src/main/scala/org/apache/spark/deploy/FaultToleranceTest.scala x: 37 contributors (all time) y: 328 lines of code core/src/main/scala/org/apache/spark/internal/config/ConfigBuilder.scala x: 11 contributors (all time) y: 190 lines of code core/src/main/scala/org/apache/spark/rdd/PipedRDD.scala x: 33 contributors (all time) y: 173 lines of code streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala x: 31 contributors (all time) y: 196 lines of code streaming/src/main/scala/org/apache/spark/streaming/util/StateMap.scala x: 8 contributors (all time) y: 244 lines of code dev/create-release/releaseutils.py x: 14 contributors (all time) y: 182 lines of code sql/core/src/main/scala/org/apache/spark/sql/package.scala x: 16 contributors (all time) y: 13 lines of code core/src/main/scala/org/apache/spark/deploy/Client.scala x: 33 contributors (all time) y: 216 lines of code core/src/main/scala/org/apache/spark/deploy/LocalSparkCluster.scala x: 41 contributors (all time) y: 77 lines of code core/src/main/scala/org/apache/spark/rdd/CoalescedRDD.scala x: 35 contributors (all time) y: 220 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/ExternalCatalogUtils.scala x: 17 contributors (all time) y: 216 lines of code python/pyspark/mllib/__init__.py x: 14 contributors (all time) y: 17 lines of code core/src/main/scala/org/apache/spark/util/AccumulatorV2.scala x: 18 contributors (all time) y: 249 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionCatalog.scala x: 33 contributors (all time) y: 25 lines of code core/src/main/scala/org/apache/spark/util/ClosureCleaner.scala x: 33 contributors (all time) y: 500 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/console.scala x: 15 contributors (all time) y: 54 lines of code sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/HiveThriftServer2.scala x: 36 contributors (all time) y: 109 lines of code core/src/main/scala/org/apache/spark/api/java/JavaRDDLike.scala x: 52 contributors (all time) y: 285 lines of code sql/core/src/main/java/org/apache/spark/sql/execution/UnsafeKVExternalSorter.java x: 18 contributors (all time) y: 215 lines of code core/src/main/scala/org/apache/spark/rdd/EmptyRDD.scala x: 15 contributors (all time) y: 10 lines of code core/src/main/scala/org/apache/spark/rdd/PairRDDFunctions.scala x: 76 contributors (all time) y: 538 lines of code core/src/main/scala/org/apache/spark/serializer/SerializerManager.scala x: 29 contributors (all time) y: 129 lines of code core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala x: 40 contributors (all time) y: 200 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/TypedAggregateExpression.scala x: 14 contributors (all time) y: 219 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveContext.scala x: 54 contributors (all time) y: 20 lines of code launcher/src/main/java/org/apache/spark/launcher/LauncherServer.java x: 12 contributors (all time) y: 273 lines of code core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala x: 27 contributors (all time) y: 45 lines of code core/src/main/scala/org/apache/spark/metrics/sink/JmxSink.scala x: 13 contributors (all time) y: 15 lines of code core/src/main/scala/org/apache/spark/metrics/sink/MetricsServlet.scala x: 21 contributors (all time) y: 33 lines of code resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala x: 25 contributors (all time) y: 680 lines of code core/src/main/scala/org/apache/spark/internal/io/SparkHadoopWriter.scala x: 9 contributors (all time) y: 248 lines of code sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala x: 75 contributors (all time) y: 349 lines of code core/src/main/scala/org/apache/spark/api/java/JavaPairRDD.scala x: 49 contributors (all time) y: 419 lines of code core/src/main/scala/org/apache/spark/deploy/JsonProtocol.scala x: 29 contributors (all time) y: 110 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/DStream.scala x: 33 contributors (all time) y: 529 lines of code core/src/main/scala/org/apache/spark/storage/StorageUtils.scala x: 30 contributors (all time) y: 139 lines of code core/src/main/scala/org/apache/spark/deploy/master/ui/MasterWebUI.scala x: 40 contributors (all time) y: 81 lines of code core/src/main/scala/org/apache/spark/rdd/ParallelCollectionRDD.scala x: 30 contributors (all time) y: 102 lines of code streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceivedBlockTracker.scala x: 16 contributors (all time) y: 179 lines of code core/src/main/resources/org/apache/spark/ui/static/sorttable.js x: 13 contributors (all time) y: 352 lines of code core/src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeSorterSpillReader.java x: 15 contributors (all time) y: 110 lines of code streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala x: 26 contributors (all time) y: 449 lines of code core/src/main/scala/org/apache/spark/api/java/JavaRDD.scala x: 34 contributors (all time) y: 89 lines of code core/src/main/scala/org/apache/spark/deploy/master/ZooKeeperPersistenceEngine.scala x: 32 contributors (all time) y: 48 lines of code core/src/main/scala/org/apache/spark/ui/jobs/PoolTable.scala x: 24 contributors (all time) y: 49 lines of code graphx/src/main/scala/org/apache/spark/graphx/impl/EdgePartition.scala x: 12 contributors (all time) y: 338 lines of code core/src/main/scala/org/apache/spark/deploy/ExecutorState.scala x: 18 contributors (all time) y: 7 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateUnsafeProjection.scala x: 22 contributors (all time) y: 298 lines of code graphx/src/main/scala/org/apache/spark/graphx/GraphOps.scala x: 25 contributors (all time) y: 188 lines of code core/src/main/scala/org/apache/spark/deploy/master/WorkerInfo.scala x: 27 contributors (all time) y: 123 lines of code core/src/main/scala/org/apache/spark/deploy/worker/ui/WorkerWebUI.scala x: 38 contributors (all time) y: 31 lines of code core/src/main/scala/org/apache/spark/rdd/MapPartitionsRDD.scala x: 23 contributors (all time) y: 28 lines of code core/src/main/scala/org/apache/spark/rdd/ShuffledRDD.scala x: 32 contributors (all time) y: 67 lines of code core/src/main/scala/org/apache/spark/scheduler/JobWaiter.scala x: 26 contributors (all time) y: 32 lines of code core/src/main/scala/org/apache/spark/ui/UIWorkloadGenerator.scala x: 33 contributors (all time) y: 81 lines of code core/src/main/scala/org/apache/spark/util/collection/OpenHashSet.scala x: 28 contributors (all time) y: 181 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/ConstantInputDStream.scala x: 19 contributors (all time) y: 14 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/InputDStream.scala x: 28 contributors (all time) y: 51 lines of code sql/core/src/main/scala/org/apache/spark/sql/sources/interfaces.scala x: 27 contributors (all time) y: 81 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/PairDStreamFunctions.scala x: 22 contributors (all time) y: 370 lines of code graphx/src/main/scala/org/apache/spark/graphx/impl/VertexRDDImpl.scala x: 10 contributors (all time) y: 190 lines of code sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/xml/UDFXPathUtil.java x: 4 contributors (all time) y: 163 lines of code licenses-binary/LICENSE-javassist.html x: 1 contributors (all time) y: 369 lines of code core/src/main/scala/org/apache/spark/api/java/JavaDoubleRDD.scala x: 32 contributors (all time) y: 82 lines of code core/src/main/scala/org/apache/spark/rdd/PartitionPruningRDD.scala x: 21 contributors (all time) y: 36 lines of code streaming/src/main/scala/org/apache/spark/streaming/dstream/TransformedDStream.scala x: 22 contributors (all time) y: 34 lines of code core/src/main/scala/org/apache/spark/Aggregator.scala x: 24 contributors (all time) y: 32 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSQLParser.scala x: 2 contributors (all time) y: 763 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUdf.scala x: 9 contributors (all time) y: 1053 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/hiveUdfs.scala x: 18 contributors (all time) y: 327 lines of code sql/core/src/main/scala/org/apache/spark/sql/UdfRegistration.scala x: 6 contributors (all time) y: 935 lines of code
4424.0
lines of code
  min: 1.0
  average: 151.84
  25th percentile: 24.0
  median: 67.0
  75th percentile: 161.0
  max: 4424.0
0 219.0
contributors (all time)
min: 1.0 | average: 10.91 | 25th percentile: 2.0 | median: 5.0 | 75th percentile: 13.0 | max: 219.0

File Size vs. Commits (30 days): 617 points

connector/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala x: 6 commits (30d) y: 157 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/CachedStreamResponse.scala x: 3 commits (30d) y: 8 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteGrpcResponseSender.scala x: 3 commits (30d) y: 191 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteResponseObserver.scala x: 5 commits (30d) y: 205 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteThreadRunner.scala x: 8 commits (30d) y: 149 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/ExecuteHolder.scala x: 9 commits (30d) y: 109 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectExecutePlanHandler.scala x: 4 commits (30d) y: 18 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectReattachExecuteHandler.scala x: 2 commits (30d) y: 33 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/functions.scala x: 7 commits (30d) y: 1323 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Dataset.scala x: 5 commits (30d) y: 947 lines of code project/MimaExcludes.scala x: 11 commits (30d) y: 163 lines of code sql/api/src/main/java/org/apache/spark/api/java/function/FlatMapGroupsWithStateFunction.java x: 1 commits (30d) y: 11 lines of code sql/api/src/main/scala/org/apache/spark/sql/streaming/GroupState.scala x: 1 commits (30d) y: 44 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/CodeGeneratorWithInterpretedFallback.scala x: 1 commits (30d) y: 28 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala x: 13 commits (30d) y: 4424 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/WholeStageCodegenExec.scala x: 2 commits (30d) y: 528 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala x: 11 commits (30d) y: 2820 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveLateralColumnAliasReference.scala x: 1 commits (30d) y: 138 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowConverters.scala x: 2 commits (30d) y: 341 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala x: 6 commits (30d) y: 3431 lines of code python/pyspark/pandas/base.py x: 2 commits (30d) y: 607 lines of code python/pyspark/errors/error_classes.py x: 16 commits (30d) y: 3 lines of code python/pyspark/worker.py x: 9 commits (30d) y: 777 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala x: 17 commits (30d) y: 2873 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala x: 2 commits (30d) y: 378 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExec.scala x: 2 commits (30d) y: 187 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 12 commits (30d) y: 363 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala x: 3 commits (30d) y: 643 lines of code python/pyspark/ml/connect/io_utils.py x: 2 commits (30d) y: 195 lines of code python/pyspark/ml/connect/tuning.py x: 3 commits (30d) y: 328 lines of code python/pyspark/ml/torch/distributor.py x: 4 commits (30d) y: 624 lines of code python/pyspark/ml/util.py x: 1 commits (30d) y: 388 lines of code python/pyspark/pandas/utils.py x: 1 commits (30d) y: 629 lines of code python/pyspark/sql/connect/session.py x: 10 commits (30d) y: 620 lines of code python/pyspark/sql/connect/udf.py x: 3 commits (30d) y: 212 lines of code python/pyspark/sql/connect/udtf.py x: 5 commits (30d) y: 147 lines of code python/pyspark/sql/session.py x: 3 commits (30d) y: 763 lines of code python/pyspark/sql/utils.py x: 4 commits (30d) y: 176 lines of code core/src/main/scala/org/apache/spark/ui/storage/StoragePage.scala x: 1 commits (30d) y: 194 lines of code sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala x: 6 commits (30d) y: 1462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/JoinCodegenSupport.scala x: 1 commits (30d) y: 62 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcExceptionConverter.scala x: 2 commits (30d) y: 69 lines of code dev/sparktestsupport/modules.py x: 12 commits (30d) y: 1015 lines of code python/pyspark/pandas/__init__.py x: 1 commits (30d) y: 112 lines of code python/pyspark/pandas/indexes/base.py x: 1 commits (30d) y: 1008 lines of code python/pyspark/pandas/indexes/category.py x: 1 commits (30d) y: 175 lines of code python/pyspark/pandas/indexes/datetimes.py x: 1 commits (30d) y: 266 lines of code python/pyspark/pandas/series.py x: 2 commits (30d) y: 2180 lines of code python/pyspark/pandas/spark/accessors.py x: 1 commits (30d) y: 242 lines of code python/pyspark/pandas/usage_logging/__init__.py x: 1 commits (30d) y: 100 lines of code python/pyspark/sql/pandas/serializers.py x: 6 commits (30d) y: 546 lines of code python/pyspark/sql/pandas/types.py x: 1 commits (30d) y: 599 lines of code core/src/main/scala/org/apache/spark/serializer/KryoSerializer.scala x: 1 commits (30d) y: 569 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala x: 3 commits (30d) y: 423 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala x: 12 commits (30d) y: 2370 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/DataTypeErrors.scala x: 6 commits (30d) y: 238 lines of code python/pyspark/testing/pandasutils.py x: 3 commits (30d) y: 440 lines of code project/SparkBuild.scala x: 7 commits (30d) y: 1369 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala x: 3 commits (30d) y: 546 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/JavaTypeInference.scala x: 3 commits (30d) y: 102 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala x: 2 commits (30d) y: 755 lines of code core/src/main/scala/org/apache/spark/SparkConf.scala x: 1 commits (30d) y: 463 lines of code core/src/main/scala/org/apache/spark/api/python/StreamingPythonRunner.scala x: 6 commits (30d) y: 66 lines of code dev/sparktestsupport/utils.py x: 3 commits (30d) y: 64 lines of code python/pyspark/sql/udtf.py x: 8 commits (30d) y: 279 lines of code python/pyspark/cloudpickle/cloudpickle_fast.py x: 1 commits (30d) y: 452 lines of code python/pyspark/sql/connect/plan.py x: 3 commits (30d) y: 1734 lines of code python/pyspark/sql/connect/client/core.py x: 8 commits (30d) y: 1150 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ExecutePlanResponseReattachableIterator.scala x: 6 commits (30d) y: 181 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcRetryHandler.scala x: 4 commits (30d) y: 114 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkConnectClient.scala x: 6 commits (30d) y: 435 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkResult.scala x: 8 commits (30d) y: 225 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowDeserializer.scala x: 6 commits (30d) y: 447 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowEncoderUtils.scala x: 3 commits (30d) y: 27 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowSerializer.scala x: 5 commits (30d) y: 447 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategy.scala x: 2 commits (30d) y: 227 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningAwareFileIndex.scala x: 1 commits (30d) y: 157 lines of code python/pyspark/sql/worker/analyze_udtf.py x: 3 commits (30d) y: 108 lines of code python/pyspark/worker_util.py x: 2 commits (30d) y: 107 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonFunction.scala x: 4 commits (30d) y: 192 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/SortMergeJoinExec.scala x: 2 commits (30d) y: 1142 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala x: 5 commits (30d) y: 1590 lines of code python/pyspark/pandas/groupby.py x: 1 commits (30d) y: 1638 lines of code python/pyspark/pandas/namespace.py x: 1 commits (30d) y: 1460 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryListener.scala x: 3 commits (30d) y: 73 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/StreamingQueryListenerHelper.scala x: 2 commits (30d) y: 41 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala x: 8 commits (30d) y: 139 lines of code connector/connect/common/src/main/protobuf/spark/connect/base.proto x: 4 commits (30d) y: 662 lines of code python/pyspark/sql/connect/proto/base_pb2.pyi x: 4 commits (30d) y: 2137 lines of code python/pyspark/testing/utils.py x: 14 commits (30d) y: 367 lines of code core/src/main/scala/org/apache/spark/MapOutputTracker.scala x: 1 commits (30d) y: 1104 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/FunctionBuilderBase.scala x: 2 commits (30d) y: 79 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryCompilationErrors.scala x: 12 commits (30d) y: 3219 lines of code python/pyspark/sql/types.py x: 1 commits (30d) y: 1478 lines of code python/pyspark/testing/connectutils.py x: 3 commits (30d) y: 135 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectService.scala x: 9 commits (30d) y: 283 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectStreamingQueryCache.scala x: 4 commits (30d) y: 133 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseLexer.g4 x: 2 commits (30d) y: 514 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4 x: 3 commits (30d) y: 1695 lines of code core/src/main/scala/org/apache/spark/executor/Executor.scala x: 3 commits (30d) y: 875 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala x: 3 commits (30d) y: 901 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/JdbcDialects.scala x: 1 commits (30d) y: 354 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala x: 1 commits (30d) y: 213 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/dsl/package.scala x: 1 commits (30d) y: 979 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerListener.scala x: 3 commits (30d) y: 344 lines of code core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala x: 2 commits (30d) y: 429 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala x: 4 commits (30d) y: 458 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/expressions/UserDefinedFunction.scala x: 6 commits (30d) y: 110 lines of code core/src/main/scala/org/apache/spark/util/Utils.scala x: 8 commits (30d) y: 2147 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala x: 1 commits (30d) y: 1053 lines of code python/setup.py x: 7 commits (30d) y: 276 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala x: 2 commits (30d) y: 462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SQLExecution.scala x: 4 commits (30d) y: 164 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 4 commits (30d) y: 89 lines of code connector/connect/common/src/main/protobuf/spark/connect/commands.proto x: 5 commits (30d) y: 341 lines of code python/pyspark/sql/connect/proto/commands_pb2.pyi x: 5 commits (30d) y: 1509 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 2 commits (30d) y: 1078 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/UdfUtils.scala x: 1 commits (30d) y: 493 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala x: 2 commits (30d) y: 810 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasExec.scala x: 3 commits (30d) y: 37 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala x: 1 commits (30d) y: 2504 lines of code sql/core/src/main/scala/org/apache/spark/sql/functions.scala x: 6 commits (30d) y: 2003 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ColumnarEvaluatorFactory.scala x: 1 commits (30d) y: 79 lines of code core/src/main/scala/org/apache/spark/internal/config/package.scala x: 4 commits (30d) y: 2224 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ArtifactManager.scala x: 2 commits (30d) y: 252 lines of code core/src/main/scala/org/apache/spark/SparkContext.scala x: 4 commits (30d) y: 1860 lines of code common/utils/src/main/java/org/apache/spark/network/util/JavaUtils.java x: 1 commits (30d) y: 253 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowVectorReader.scala x: 3 commits (30d) y: 208 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/LiteralValueProtoConverter.scala x: 2 commits (30d) y: 313 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala x: 2 commits (30d) y: 1828 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/object.scala x: 5 commits (30d) y: 596 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/IntervalUtils.scala x: 2 commits (30d) y: 711 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/package.scala x: 2 commits (30d) y: 146 lines of code mllib/src/main/scala/org/apache/spark/mllib/evaluation/RankingMetrics.scala x: 1 commits (30d) y: 155 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroSerializer.scala x: 1 commits (30d) y: 304 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/parsers.scala x: 3 commits (30d) y: 275 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/SparkDateTimeUtils.scala x: 1 commits (30d) y: 347 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala x: 1 commits (30d) y: 419 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/QueryParsingErrors.scala x: 2 commits (30d) y: 565 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/DataType.scala x: 2 commits (30d) y: 284 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/Decimal.scala x: 2 commits (30d) y: 473 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/DecimalType.scala x: 2 commits (30d) y: 122 lines of code sql/api/src/main/scala/org/apache/spark/sql/util/ArrowUtils.scala x: 2 commits (30d) y: 171 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/CSVOptions.scala x: 2 commits (30d) y: 267 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala x: 1 commits (30d) y: 379 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowWriter.scala x: 2 commits (30d) y: 358 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/Columnar.scala x: 3 commits (30d) y: 373 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFilters.scala x: 1 commits (30d) y: 666 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala x: 1 commits (30d) y: 323 lines of code python/pyspark/testing/__init__.py x: 2 commits (30d) y: 3 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/MapInBatchExec.scala x: 5 commits (30d) y: 51 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala x: 1 commits (30d) y: 696 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala x: 2 commits (30d) y: 654 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala x: 5 commits (30d) y: 556 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala x: 2 commits (30d) y: 408 lines of code python/pyspark/sql/connect/dataframe.py x: 1 commits (30d) y: 1749 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/bitmapExpressions.scala x: 4 commits (30d) y: 244 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/encoders/ExpressionEncoder.scala x: 3 commits (30d) y: 238 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkStrategies.scala x: 3 commits (30d) y: 685 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/MicroBatchExecution.scala x: 1 commits (30d) y: 554 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala x: 3 commits (30d) y: 1431 lines of code python/pyspark/ml/deepspeed/deepspeed_distributor.py x: 4 commits (30d) y: 87 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala x: 4 commits (30d) y: 1440 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala x: 1 commits (30d) y: 953 lines of code core/src/main/scala/org/apache/spark/ui/UIUtils.scala x: 3 commits (30d) y: 584 lines of code python/pyspark/sql/connect/expressions.py x: 1 commits (30d) y: 835 lines of code python/pyspark/sql/connect/functions.py x: 7 commits (30d) y: 2070 lines of code python/pyspark/sql/connect/proto/expressions_pb2.pyi x: 1 commits (30d) y: 1268 lines of code python/pyspark/sql/streaming/readwriter.py x: 2 commits (30d) y: 540 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Strategy.scala x: 2 commits (30d) y: 502 lines of code core/src/main/resources/org/apache/spark/ui/static/stagepage.js x: 1 commits (30d) y: 1040 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/window/WindowEvaluatorFactory.scala x: 2 commits (30d) y: 97 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala x: 6 commits (30d) y: 1081 lines of code python/pyspark/sql/dataframe.py x: 1 commits (30d) y: 1405 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala x: 1 commits (30d) y: 483 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2SessionCatalog.scala x: 2 commits (30d) y: 323 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala x: 2 commits (30d) y: 691 lines of code core/src/main/scala/org/apache/spark/SparkEnv.scala x: 1 commits (30d) y: 402 lines of code core/src/main/scala/org/apache/spark/api/python/PythonWorkerFactory.scala x: 4 commits (30d) y: 292 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala x: 1 commits (30d) y: 2008 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala x: 1 commits (30d) y: 856 lines of code connector/connect/common/src/main/protobuf/spark/connect/relations.proto x: 2 commits (30d) y: 796 lines of code python/pyspark/sql/connect/proto/relations_pb2.pyi x: 2 commits (30d) y: 2915 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/jsonExpressions.scala x: 1 commits (30d) y: 716 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala x: 2 commits (30d) y: 1204 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TableOutputResolver.scala x: 2 commits (30d) y: 442 lines of code python/pyspark/rdd.py x: 1 commits (30d) y: 1514 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala x: 1 commits (30d) y: 707 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 1 commits (30d) y: 287 lines of code core/src/main/scala/org/apache/spark/storage/BlockManager.scala x: 1 commits (30d) y: 1519 lines of code core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala x: 1 commits (30d) y: 1081 lines of code python/pyspark/context.py x: 1 commits (30d) y: 747 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala x: 1 commits (30d) y: 4395 lines of code
4424.0
lines of code
  min: 1.0
  average: 312.87
  25th percentile: 40.0
  median: 131.0
  75th percentile: 353.5
  max: 4424.0
0 17.0
commits (30d)
min: 1.0 | average: 1.93 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 17.0

File Size vs. Contributors (30 days): 617 points

connector/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala x: 5 contributors (30d) y: 157 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/CachedStreamResponse.scala x: 1 contributors (30d) y: 8 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteGrpcResponseSender.scala x: 1 contributors (30d) y: 191 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteResponseObserver.scala x: 1 contributors (30d) y: 205 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteThreadRunner.scala x: 4 contributors (30d) y: 149 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/ExecuteHolder.scala x: 5 contributors (30d) y: 109 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectExecutePlanHandler.scala x: 2 contributors (30d) y: 18 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectReattachExecuteHandler.scala x: 1 contributors (30d) y: 33 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/functions.scala x: 4 contributors (30d) y: 1323 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Dataset.scala x: 5 contributors (30d) y: 947 lines of code project/MimaExcludes.scala x: 4 contributors (30d) y: 163 lines of code sql/api/src/main/scala/org/apache/spark/sql/streaming/GroupState.scala x: 1 contributors (30d) y: 44 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala x: 10 contributors (30d) y: 4424 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/WholeStageCodegenExec.scala x: 2 contributors (30d) y: 528 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala x: 9 contributors (30d) y: 2820 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveLateralColumnAliasReference.scala x: 1 contributors (30d) y: 138 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowConverters.scala x: 2 contributors (30d) y: 341 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala x: 4 contributors (30d) y: 3431 lines of code python/pyspark/pandas/base.py x: 1 contributors (30d) y: 607 lines of code python/pyspark/errors/error_classes.py x: 5 contributors (30d) y: 3 lines of code python/pyspark/worker.py x: 3 contributors (30d) y: 777 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala x: 13 contributors (30d) y: 2873 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala x: 2 contributors (30d) y: 378 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExec.scala x: 2 contributors (30d) y: 187 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 6 contributors (30d) y: 363 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala x: 3 contributors (30d) y: 643 lines of code python/pyspark/ml/connect/io_utils.py x: 2 contributors (30d) y: 195 lines of code python/pyspark/ml/connect/tuning.py x: 2 contributors (30d) y: 328 lines of code python/pyspark/ml/torch/distributor.py x: 2 contributors (30d) y: 624 lines of code python/pyspark/ml/util.py x: 1 contributors (30d) y: 388 lines of code python/pyspark/pandas/utils.py x: 1 contributors (30d) y: 629 lines of code python/pyspark/sql/connect/session.py x: 6 contributors (30d) y: 620 lines of code python/pyspark/sql/connect/udf.py x: 2 contributors (30d) y: 212 lines of code python/pyspark/sql/connect/udtf.py x: 3 contributors (30d) y: 147 lines of code python/pyspark/sql/session.py x: 1 contributors (30d) y: 763 lines of code sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala x: 4 contributors (30d) y: 1462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/JoinCodegenSupport.scala x: 1 contributors (30d) y: 62 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcExceptionConverter.scala x: 2 contributors (30d) y: 69 lines of code dev/sparktestsupport/modules.py x: 8 contributors (30d) y: 1015 lines of code python/pyspark/pandas/__init__.py x: 1 contributors (30d) y: 112 lines of code python/pyspark/pandas/indexes/base.py x: 1 contributors (30d) y: 1008 lines of code python/pyspark/pandas/indexes/datetimes.py x: 1 contributors (30d) y: 266 lines of code python/pyspark/pandas/series.py x: 1 contributors (30d) y: 2180 lines of code python/pyspark/pandas/spark/accessors.py x: 1 contributors (30d) y: 242 lines of code python/pyspark/pandas/usage_logging/__init__.py x: 1 contributors (30d) y: 100 lines of code python/pyspark/sql/pandas/serializers.py x: 4 contributors (30d) y: 546 lines of code core/src/main/scala/org/apache/spark/serializer/KryoSerializer.scala x: 1 contributors (30d) y: 569 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala x: 3 contributors (30d) y: 423 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala x: 8 contributors (30d) y: 2370 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/DataTypeErrors.scala x: 3 contributors (30d) y: 238 lines of code python/pyspark/testing/pandasutils.py x: 1 contributors (30d) y: 440 lines of code project/SparkBuild.scala x: 4 contributors (30d) y: 1369 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala x: 3 contributors (30d) y: 546 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/JavaTypeInference.scala x: 2 contributors (30d) y: 102 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala x: 2 contributors (30d) y: 755 lines of code core/src/main/scala/org/apache/spark/SparkConf.scala x: 1 contributors (30d) y: 463 lines of code core/src/main/scala/org/apache/spark/api/python/StreamingPythonRunner.scala x: 3 contributors (30d) y: 66 lines of code python/pyspark/sql/udtf.py x: 2 contributors (30d) y: 279 lines of code python/pyspark/sql/connect/plan.py x: 2 contributors (30d) y: 1734 lines of code python/pyspark/sql/connect/client/core.py x: 5 contributors (30d) y: 1150 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkResult.scala x: 3 contributors (30d) y: 225 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowDeserializer.scala x: 2 contributors (30d) y: 447 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategy.scala x: 2 contributors (30d) y: 227 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningAwareFileIndex.scala x: 1 contributors (30d) y: 157 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/SortMergeJoinExec.scala x: 2 contributors (30d) y: 1142 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala x: 5 contributors (30d) y: 1590 lines of code python/pyspark/pandas/groupby.py x: 1 contributors (30d) y: 1638 lines of code python/pyspark/pandas/namespace.py x: 1 contributors (30d) y: 1460 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryListener.scala x: 3 contributors (30d) y: 73 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala x: 4 contributors (30d) y: 139 lines of code connector/connect/common/src/main/protobuf/spark/connect/base.proto x: 1 contributors (30d) y: 662 lines of code python/pyspark/sql/connect/proto/base_pb2.pyi x: 1 contributors (30d) y: 2137 lines of code python/pyspark/testing/utils.py x: 1 contributors (30d) y: 367 lines of code core/src/main/scala/org/apache/spark/MapOutputTracker.scala x: 1 contributors (30d) y: 1104 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/FunctionBuilderBase.scala x: 1 contributors (30d) y: 79 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryCompilationErrors.scala x: 8 contributors (30d) y: 3219 lines of code python/pyspark/sql/types.py x: 1 contributors (30d) y: 1478 lines of code python/pyspark/testing/connectutils.py x: 2 contributors (30d) y: 135 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectService.scala x: 7 contributors (30d) y: 283 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectStreamingQueryCache.scala x: 3 contributors (30d) y: 133 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseLexer.g4 x: 2 contributors (30d) y: 514 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4 x: 2 contributors (30d) y: 1695 lines of code core/src/main/scala/org/apache/spark/executor/Executor.scala x: 2 contributors (30d) y: 875 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala x: 3 contributors (30d) y: 901 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/JdbcDialects.scala x: 1 contributors (30d) y: 354 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala x: 1 contributors (30d) y: 213 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/dsl/package.scala x: 1 contributors (30d) y: 979 lines of code core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala x: 2 contributors (30d) y: 429 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala x: 3 contributors (30d) y: 458 lines of code common/utils/src/main/scala/org/apache/spark/util/SparkSerDeUtils.scala x: 3 contributors (30d) y: 26 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/expressions/UserDefinedFunction.scala x: 4 contributors (30d) y: 110 lines of code core/src/main/scala/org/apache/spark/util/Utils.scala x: 6 contributors (30d) y: 2147 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala x: 1 contributors (30d) y: 1053 lines of code python/setup.py x: 5 contributors (30d) y: 276 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala x: 2 contributors (30d) y: 462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SQLExecution.scala x: 3 contributors (30d) y: 164 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 3 contributors (30d) y: 89 lines of code connector/connect/common/src/main/protobuf/spark/connect/commands.proto x: 4 contributors (30d) y: 341 lines of code python/pyspark/sql/connect/proto/commands_pb2.pyi x: 4 contributors (30d) y: 1509 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 1 contributors (30d) y: 1078 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/UdfUtils.scala x: 1 contributors (30d) y: 493 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/encoders/RowEncoder.scala x: 2 contributors (30d) y: 71 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala x: 2 contributors (30d) y: 810 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasExec.scala x: 2 contributors (30d) y: 37 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala x: 1 contributors (30d) y: 2504 lines of code sql/core/src/main/scala/org/apache/spark/sql/functions.scala x: 4 contributors (30d) y: 2003 lines of code core/src/main/scala/org/apache/spark/internal/config/package.scala x: 3 contributors (30d) y: 2224 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ArtifactManager.scala x: 2 contributors (30d) y: 252 lines of code core/src/main/scala/org/apache/spark/SparkContext.scala x: 4 contributors (30d) y: 1860 lines of code common/utils/src/main/java/org/apache/spark/network/util/JavaUtils.java x: 1 contributors (30d) y: 253 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/LiteralValueProtoConverter.scala x: 1 contributors (30d) y: 313 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala x: 2 contributors (30d) y: 1828 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/object.scala x: 2 contributors (30d) y: 596 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/IntervalUtils.scala x: 2 contributors (30d) y: 711 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/package.scala x: 2 contributors (30d) y: 146 lines of code mllib/src/main/scala/org/apache/spark/mllib/evaluation/RankingMetrics.scala x: 1 contributors (30d) y: 155 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/parsers.scala x: 2 contributors (30d) y: 275 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/SparkDateTimeUtils.scala x: 1 contributors (30d) y: 347 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala x: 1 contributors (30d) y: 419 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/QueryParsingErrors.scala x: 2 contributors (30d) y: 565 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/Decimal.scala x: 2 contributors (30d) y: 473 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/UnivocityParser.scala x: 2 contributors (30d) y: 299 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala x: 1 contributors (30d) y: 323 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/MapInBatchExec.scala x: 3 contributors (30d) y: 51 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala x: 1 contributors (30d) y: 696 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala x: 2 contributors (30d) y: 654 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala x: 2 contributors (30d) y: 408 lines of code python/pyspark/sql/connect/dataframe.py x: 1 contributors (30d) y: 1749 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/bitmapExpressions.scala x: 4 contributors (30d) y: 244 lines of code sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 2 contributors (30d) y: 557 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkStrategies.scala x: 3 contributors (30d) y: 685 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/MicroBatchExecution.scala x: 1 contributors (30d) y: 554 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala x: 3 contributors (30d) y: 1431 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 3 contributors (30d) y: 111 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala x: 4 contributors (30d) y: 1440 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala x: 1 contributors (30d) y: 953 lines of code core/src/main/scala/org/apache/spark/ui/UIUtils.scala x: 1 contributors (30d) y: 584 lines of code python/pyspark/sql/connect/expressions.py x: 1 contributors (30d) y: 835 lines of code python/pyspark/sql/connect/functions.py x: 4 contributors (30d) y: 2070 lines of code python/pyspark/sql/connect/proto/expressions_pb2.pyi x: 1 contributors (30d) y: 1268 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Strategy.scala x: 2 contributors (30d) y: 502 lines of code core/src/main/resources/org/apache/spark/ui/static/stagepage.js x: 1 contributors (30d) y: 1040 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala x: 5 contributors (30d) y: 1081 lines of code python/pyspark/sql/dataframe.py x: 1 contributors (30d) y: 1405 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala x: 1 contributors (30d) y: 483 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/ResolveDefaultColumnsUtil.scala x: 1 contributors (30d) y: 287 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala x: 2 contributors (30d) y: 691 lines of code core/src/main/scala/org/apache/spark/SparkEnv.scala x: 1 contributors (30d) y: 402 lines of code core/src/main/scala/org/apache/spark/api/python/PythonWorkerFactory.scala x: 3 contributors (30d) y: 292 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala x: 1 contributors (30d) y: 2008 lines of code python/pyspark/sql/connect/streaming/readwriter.py x: 1 contributors (30d) y: 511 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ApplyInPandasWithStatePythonRunner.scala x: 2 contributors (30d) y: 163 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala x: 1 contributors (30d) y: 856 lines of code connector/connect/common/src/main/protobuf/spark/connect/relations.proto x: 2 contributors (30d) y: 796 lines of code python/pyspark/sql/connect/proto/relations_pb2.pyi x: 2 contributors (30d) y: 2915 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/jsonExpressions.scala x: 1 contributors (30d) y: 716 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala x: 2 contributors (30d) y: 1204 lines of code python/pyspark/rdd.py x: 1 contributors (30d) y: 1514 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala x: 1 contributors (30d) y: 707 lines of code core/src/main/scala/org/apache/spark/storage/BlockManager.scala x: 1 contributors (30d) y: 1519 lines of code core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala x: 1 contributors (30d) y: 1081 lines of code python/pyspark/context.py x: 1 contributors (30d) y: 747 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala x: 1 contributors (30d) y: 4395 lines of code
4424.0
lines of code
  min: 1.0
  average: 312.87
  25th percentile: 40.0
  median: 131.0
  75th percentile: 353.5
  max: 4424.0
0 13.0
contributors (30d)
min: 1.0 | average: 1.51 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 13.0

File Size vs. Commits (90 days): 931 points

connector/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala x: 7 commits (90d) y: 157 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/CachedStreamResponse.scala x: 3 commits (90d) y: 8 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteGrpcResponseSender.scala x: 3 commits (90d) y: 191 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteResponseObserver.scala x: 5 commits (90d) y: 205 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteThreadRunner.scala x: 8 commits (90d) y: 149 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/ExecuteHolder.scala x: 9 commits (90d) y: 109 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectExecutePlanHandler.scala x: 4 commits (90d) y: 18 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectReattachExecuteHandler.scala x: 2 commits (90d) y: 33 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/functions.scala x: 33 commits (90d) y: 1323 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Dataset.scala x: 10 commits (90d) y: 947 lines of code project/MimaExcludes.scala x: 20 commits (90d) y: 163 lines of code sql/api/src/main/java/org/apache/spark/api/java/function/FlatMapGroupsWithStateFunction.java x: 1 commits (90d) y: 11 lines of code sql/api/src/main/scala/org/apache/spark/sql/streaming/GroupState.scala x: 1 commits (90d) y: 44 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/CodeGeneratorWithInterpretedFallback.scala x: 1 commits (90d) y: 28 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala x: 25 commits (90d) y: 4424 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/WholeStageCodegenExec.scala x: 2 commits (90d) y: 528 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala x: 27 commits (90d) y: 2820 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveLateralColumnAliasReference.scala x: 1 commits (90d) y: 138 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowConverters.scala x: 3 commits (90d) y: 341 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala x: 21 commits (90d) y: 3431 lines of code python/pyspark/pandas/base.py x: 4 commits (90d) y: 607 lines of code python/pyspark/errors/error_classes.py x: 19 commits (90d) y: 3 lines of code python/pyspark/worker.py x: 18 commits (90d) y: 777 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala x: 54 commits (90d) y: 2873 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala x: 2 commits (90d) y: 378 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExec.scala x: 2 commits (90d) y: 187 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 16 commits (90d) y: 363 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala x: 5 commits (90d) y: 643 lines of code python/pyspark/ml/connect/io_utils.py x: 4 commits (90d) y: 195 lines of code python/pyspark/ml/connect/tuning.py x: 3 commits (90d) y: 328 lines of code python/pyspark/ml/torch/distributor.py x: 13 commits (90d) y: 624 lines of code python/pyspark/ml/util.py x: 3 commits (90d) y: 388 lines of code python/pyspark/pandas/utils.py x: 2 commits (90d) y: 629 lines of code python/pyspark/sql/connect/session.py x: 25 commits (90d) y: 620 lines of code python/pyspark/sql/connect/udf.py x: 5 commits (90d) y: 212 lines of code python/pyspark/sql/connect/udtf.py x: 5 commits (90d) y: 147 lines of code python/pyspark/sql/session.py x: 8 commits (90d) y: 763 lines of code python/pyspark/sql/utils.py x: 9 commits (90d) y: 176 lines of code core/src/main/scala/org/apache/spark/ui/storage/StoragePage.scala x: 1 commits (90d) y: 194 lines of code sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala x: 10 commits (90d) y: 1462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/JoinCodegenSupport.scala x: 1 commits (90d) y: 62 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcExceptionConverter.scala x: 3 commits (90d) y: 69 lines of code dev/sparktestsupport/modules.py x: 27 commits (90d) y: 1015 lines of code python/pyspark/pandas/__init__.py x: 1 commits (90d) y: 112 lines of code python/pyspark/pandas/indexes/base.py x: 4 commits (90d) y: 1008 lines of code python/pyspark/pandas/indexes/datetimes.py x: 3 commits (90d) y: 266 lines of code python/pyspark/pandas/series.py x: 7 commits (90d) y: 2180 lines of code python/pyspark/pandas/spark/accessors.py x: 3 commits (90d) y: 242 lines of code python/pyspark/pandas/usage_logging/__init__.py x: 1 commits (90d) y: 100 lines of code python/pyspark/sql/pandas/serializers.py x: 16 commits (90d) y: 546 lines of code python/pyspark/sql/pandas/types.py x: 6 commits (90d) y: 599 lines of code core/src/main/scala/org/apache/spark/serializer/KryoSerializer.scala x: 3 commits (90d) y: 569 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Encoders.scala x: 2 commits (90d) y: 54 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala x: 5 commits (90d) y: 423 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/BadRecordException.scala x: 3 commits (90d) y: 21 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala x: 30 commits (90d) y: 2370 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/DataTypeErrors.scala x: 8 commits (90d) y: 238 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryErrorsBase.scala x: 5 commits (90d) y: 29 lines of code python/pyspark/testing/pandasutils.py x: 3 commits (90d) y: 440 lines of code project/SparkBuild.scala x: 22 commits (90d) y: 1369 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala x: 6 commits (90d) y: 546 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/JavaTypeInference.scala x: 3 commits (90d) y: 102 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala x: 2 commits (90d) y: 755 lines of code core/src/main/scala/org/apache/spark/SparkConf.scala x: 1 commits (90d) y: 463 lines of code core/src/main/scala/org/apache/spark/api/python/StreamingPythonRunner.scala x: 6 commits (90d) y: 66 lines of code dev/sparktestsupport/utils.py x: 4 commits (90d) y: 64 lines of code python/pyspark/sql/udtf.py x: 11 commits (90d) y: 279 lines of code python/pyspark/cloudpickle/cloudpickle_fast.py x: 2 commits (90d) y: 452 lines of code python/pyspark/sql/connect/plan.py x: 12 commits (90d) y: 1734 lines of code python/pyspark/sql/connect/client/core.py x: 19 commits (90d) y: 1150 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ExecutePlanResponseReattachableIterator.scala x: 6 commits (90d) y: 181 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcRetryHandler.scala x: 5 commits (90d) y: 114 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkConnectClient.scala x: 9 commits (90d) y: 435 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkResult.scala x: 10 commits (90d) y: 225 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowDeserializer.scala x: 6 commits (90d) y: 447 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowSerializer.scala x: 5 commits (90d) y: 447 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala x: 4 commits (90d) y: 541 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningAwareFileIndex.scala x: 1 commits (90d) y: 157 lines of code python/pyspark/sql/worker/analyze_udtf.py x: 3 commits (90d) y: 108 lines of code python/pyspark/worker_util.py x: 2 commits (90d) y: 107 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonFunction.scala x: 6 commits (90d) y: 192 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/SortMergeJoinExec.scala x: 2 commits (90d) y: 1142 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala x: 7 commits (90d) y: 1590 lines of code python/pyspark/pandas/groupby.py x: 3 commits (90d) y: 1638 lines of code python/pyspark/pandas/namespace.py x: 4 commits (90d) y: 1460 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryListener.scala x: 3 commits (90d) y: 73 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/StreamingQueryListenerHelper.scala x: 2 commits (90d) y: 41 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala x: 14 commits (90d) y: 139 lines of code connector/connect/common/src/main/protobuf/spark/connect/base.proto x: 6 commits (90d) y: 662 lines of code python/pyspark/sql/connect/proto/base_pb2.pyi x: 5 commits (90d) y: 2137 lines of code python/pyspark/testing/utils.py x: 15 commits (90d) y: 367 lines of code core/src/main/scala/org/apache/spark/MapOutputTracker.scala x: 3 commits (90d) y: 1104 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/FunctionBuilderBase.scala x: 2 commits (90d) y: 79 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryCompilationErrors.scala x: 40 commits (90d) y: 3219 lines of code python/pyspark/sql/types.py x: 4 commits (90d) y: 1478 lines of code python/pyspark/testing/connectutils.py x: 3 commits (90d) y: 135 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectStreamingQueryCache.scala x: 4 commits (90d) y: 133 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseLexer.g4 x: 2 commits (90d) y: 514 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4 x: 3 commits (90d) y: 1695 lines of code core/src/main/scala/org/apache/spark/executor/Executor.scala x: 6 commits (90d) y: 875 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala x: 3 commits (90d) y: 901 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/JdbcDialects.scala x: 1 commits (90d) y: 354 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala x: 2 commits (90d) y: 213 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/dsl/package.scala x: 1 commits (90d) y: 979 lines of code core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala x: 2 commits (90d) y: 429 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala x: 4 commits (90d) y: 458 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/expressions/UserDefinedFunction.scala x: 8 commits (90d) y: 110 lines of code core/src/main/scala/org/apache/spark/util/Utils.scala x: 16 commits (90d) y: 2147 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala x: 2 commits (90d) y: 1053 lines of code python/setup.py x: 9 commits (90d) y: 276 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala x: 2 commits (90d) y: 462 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/SparkConnectPlanExecution.scala x: 4 commits (90d) y: 187 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SQLExecution.scala x: 4 commits (90d) y: 164 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 6 commits (90d) y: 89 lines of code connector/connect/common/src/main/protobuf/spark/connect/commands.proto x: 10 commits (90d) y: 341 lines of code python/pyspark/sql/connect/proto/commands_pb2.pyi x: 9 commits (90d) y: 1509 lines of code python/pyspark/sql/connect/streaming/query.py x: 1 commits (90d) y: 219 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/DeduplicateRelations.scala x: 4 commits (90d) y: 329 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 2 commits (90d) y: 1078 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/UdfUtils.scala x: 3 commits (90d) y: 493 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala x: 4 commits (90d) y: 810 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasExec.scala x: 4 commits (90d) y: 37 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala x: 3 commits (90d) y: 2504 lines of code sql/core/src/main/scala/org/apache/spark/sql/functions.scala x: 32 commits (90d) y: 2003 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/artifact/SparkConnectArtifactManager.scala x: 12 commits (90d) y: 186 lines of code core/src/main/scala/org/apache/spark/internal/config/package.scala x: 5 commits (90d) y: 2224 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ArtifactManager.scala x: 5 commits (90d) y: 252 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerPage.scala x: 1 commits (90d) y: 442 lines of code core/src/main/scala/org/apache/spark/SparkContext.scala x: 13 commits (90d) y: 1860 lines of code common/utils/src/main/java/org/apache/spark/network/util/JavaUtils.java x: 1 commits (90d) y: 253 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/arrow/ArrowVectorReader.scala x: 3 commits (90d) y: 208 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/LiteralValueProtoConverter.scala x: 3 commits (90d) y: 313 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/SparkIntervalUtils.scala x: 2 commits (90d) y: 393 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/StringUtils.scala x: 5 commits (90d) y: 66 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala x: 2 commits (90d) y: 1828 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ToStringBase.scala x: 2 commits (90d) y: 364 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/object.scala x: 7 commits (90d) y: 596 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/IntervalUtils.scala x: 3 commits (90d) y: 711 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/package.scala x: 4 commits (90d) y: 146 lines of code mllib/src/main/scala/org/apache/spark/mllib/evaluation/RankingMetrics.scala x: 1 commits (90d) y: 155 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroSerializer.scala x: 1 commits (90d) y: 304 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/DataTypeProtoConverter.scala x: 1 commits (90d) y: 236 lines of code sql/api/src/main/scala/org/apache/spark/sql/Row.scala x: 2 commits (90d) y: 230 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/ScalaReflection.scala x: 2 commits (90d) y: 309 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/DataTypeAstBuilder.scala x: 2 commits (90d) y: 142 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/SparkDateTimeUtils.scala x: 2 commits (90d) y: 347 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala x: 1 commits (90d) y: 419 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/QueryParsingErrors.scala x: 2 commits (90d) y: 565 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/DataType.scala x: 2 commits (90d) y: 284 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/Decimal.scala x: 2 commits (90d) y: 473 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/DecimalType.scala x: 2 commits (90d) y: 122 lines of code sql/api/src/main/scala/org/apache/spark/sql/util/ArrowUtils.scala x: 2 commits (90d) y: 171 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/CSVOptions.scala x: 2 commits (90d) y: 267 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JSONOptions.scala x: 1 commits (90d) y: 188 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala x: 3 commits (90d) y: 379 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowWriter.scala x: 3 commits (90d) y: 358 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/Columnar.scala x: 4 commits (90d) y: 373 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFilters.scala x: 1 commits (90d) y: 666 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetUtils.scala x: 1 commits (90d) y: 333 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala x: 1 commits (90d) y: 323 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetScanBuilder.scala x: 1 commits (90d) y: 78 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/MapInBatchExec.scala x: 6 commits (90d) y: 51 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveInlineTables.scala x: 5 commits (90d) y: 80 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala x: 1 commits (90d) y: 696 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveShim.scala x: 3 commits (90d) y: 1468 lines of code dev/appveyor-install-dependencies.ps1 x: 7 commits (90d) y: 112 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala x: 2 commits (90d) y: 654 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala x: 9 commits (90d) y: 556 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala x: 3 commits (90d) y: 408 lines of code python/pyspark/sql/connect/dataframe.py x: 8 commits (90d) y: 1749 lines of code sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 5 commits (90d) y: 557 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkStrategies.scala x: 6 commits (90d) y: 685 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/MicroBatchExecution.scala x: 2 commits (90d) y: 554 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala x: 3 commits (90d) y: 1431 lines of code python/pyspark/ml/deepspeed/deepspeed_distributor.py x: 4 commits (90d) y: 87 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 6 commits (90d) y: 111 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasEvaluatorFactory.scala x: 2 commits (90d) y: 245 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala x: 9 commits (90d) y: 1440 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala x: 4 commits (90d) y: 988 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala x: 1 commits (90d) y: 953 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/views.scala x: 3 commits (90d) y: 453 lines of code core/src/main/scala/org/apache/spark/ui/UIUtils.scala x: 3 commits (90d) y: 584 lines of code python/pyspark/sql/connect/expressions.py x: 1 commits (90d) y: 835 lines of code python/pyspark/sql/connect/functions.py x: 30 commits (90d) y: 2070 lines of code python/pyspark/sql/connect/proto/expressions_pb2.pyi x: 1 commits (90d) y: 1268 lines of code python/pyspark/sql/streaming/readwriter.py x: 3 commits (90d) y: 540 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Strategy.scala x: 6 commits (90d) y: 502 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Column.scala x: 1 commits (90d) y: 273 lines of code core/src/main/resources/org/apache/spark/ui/static/stagepage.js x: 1 commits (90d) y: 1040 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreeNode.scala x: 5 commits (90d) y: 824 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/StringUtils.scala x: 5 commits (90d) y: 88 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala x: 1 commits (90d) y: 642 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/window/WindowEvaluatorFactory.scala x: 2 commits (90d) y: 97 lines of code project/plugins.sbt x: 4 commits (90d) y: 14 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/ExecuteEventsManager.scala x: 2 commits (90d) y: 199 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala x: 27 commits (90d) y: 1081 lines of code python/pyspark/sql/dataframe.py x: 6 commits (90d) y: 1405 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/HDFSBackedStateStoreProvider.scala x: 1 commits (90d) y: 553 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2SessionCatalog.scala x: 5 commits (90d) y: 323 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/RewriteRowLevelCommand.scala x: 3 commits (90d) y: 161 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala x: 5 commits (90d) y: 382 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/dsl/package.scala x: 1 commits (90d) y: 376 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/types/DataTypeUtils.scala x: 7 commits (90d) y: 130 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/ResolveDefaultColumnsUtil.scala x: 5 commits (90d) y: 287 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala x: 3 commits (90d) y: 691 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala x: 1 commits (90d) y: 387 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala x: 2 commits (90d) y: 495 lines of code core/src/main/scala/org/apache/spark/SparkEnv.scala x: 2 commits (90d) y: 402 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/PythonUDF.scala x: 6 commits (90d) y: 146 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala x: 4 commits (90d) y: 2008 lines of code python/pyspark/sql/connect/streaming/readwriter.py x: 3 commits (90d) y: 511 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala x: 3 commits (90d) y: 856 lines of code dev/merge_spark_pr.py x: 5 commits (90d) y: 406 lines of code connector/connect/common/src/main/protobuf/spark/connect/relations.proto x: 8 commits (90d) y: 796 lines of code python/pyspark/sql/connect/proto/relations_pb2.pyi x: 7 commits (90d) y: 2915 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/jsonExpressions.scala x: 1 commits (90d) y: 716 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala x: 3 commits (90d) y: 1204 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala x: 4 commits (90d) y: 444 lines of code sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriterV2.scala x: 3 commits (90d) y: 148 lines of code python/pyspark/rdd.py x: 4 commits (90d) y: 1514 lines of code python/pyspark/sql/_typing.pyi x: 3 commits (90d) y: 52 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/ExtractPythonUDFs.scala x: 3 commits (90d) y: 215 lines of code python/pyspark/sql/udf.py x: 6 commits (90d) y: 425 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/udaf.scala x: 1 commits (90d) y: 415 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala x: 5 commits (90d) y: 707 lines of code sql/core/src/main/scala/org/apache/spark/sql/internal/CatalogImpl.scala x: 7 commits (90d) y: 541 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 6 commits (90d) y: 287 lines of code core/src/main/scala/org/apache/spark/storage/BlockManager.scala x: 2 commits (90d) y: 1519 lines of code core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala x: 1 commits (90d) y: 1081 lines of code core/src/main/scala/org/apache/spark/storage/memory/MemoryStore.scala x: 1 commits (90d) y: 601 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala x: 1 commits (90d) y: 1003 lines of code core/src/main/resources/org/apache/spark/ui/static/executorspage.js x: 6 commits (90d) y: 701 lines of code python/pyspark/context.py x: 4 commits (90d) y: 747 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala x: 1 commits (90d) y: 4395 lines of code core/src/main/scala/org/apache/spark/errors/SparkCoreErrors.scala x: 4 commits (90d) y: 412 lines of code python/pyspark/pandas/spark/functions.py x: 10 commits (90d) y: 128 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java x: 1 commits (90d) y: 1586 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreePatterns.scala x: 5 commits (90d) y: 126 lines of code R/pkg/pkgdown/_pkgdown_template.yml x: 1 commits (90d) y: 291 lines of code core/src/main/scala/org/apache/spark/status/AppStatusListener.scala x: 2 commits (90d) y: 1100 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/internal/CatalogImpl.scala x: 5 commits (90d) y: 303 lines of code core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala x: 3 commits (90d) y: 1133 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala x: 2 commits (90d) y: 964 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectAddArtifactsHandler.scala x: 5 commits (90d) y: 179 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/CodeGenerator.scala x: 3 commits (90d) y: 1157 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/Config.scala x: 1 commits (90d) y: 661 lines of code python/pyspark/ml/classification.py x: 1 commits (90d) y: 2099 lines of code python/pyspark/ml/clustering.py x: 1 commits (90d) y: 958 lines of code python/pyspark/ml/feature.py x: 1 commits (90d) y: 3363 lines of code python/pyspark/ml/regression.py x: 1 commits (90d) y: 1523 lines of code python/pyspark/ml/tuning.py x: 1 commits (90d) y: 1099 lines of code python/pyspark/mllib/linalg/__init__.py x: 1 commits (90d) y: 908 lines of code core/src/main/protobuf/org/apache/spark/status/protobuf/store_types.proto x: 1 commits (90d) y: 740 lines of code core/src/main/scala/org/apache/spark/status/LiveEntity.scala x: 1 commits (90d) y: 817 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/windowExpressions.scala x: 1 commits (90d) y: 797 lines of code python/pyspark/pandas/data_type_ops/num_ops.py x: 3 commits (90d) y: 429 lines of code python/pyspark/pandas/indexes/multi.py x: 1 commits (90d) y: 531 lines of code python/pyspark/pandas/indexing.py x: 1 commits (90d) y: 1202 lines of code python/pyspark/pandas/internal.py x: 3 commits (90d) y: 842 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala x: 2 commits (90d) y: 618 lines of code python/pyspark/pandas/generic.py x: 2 commits (90d) y: 938 lines of code python/pyspark/pandas/strings.py x: 2 commits (90d) y: 315 lines of code common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java x: 2 commits (90d) y: 1093 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala x: 4 commits (90d) y: 763 lines of code python/pyspark/sql/connect/client/artifact.py x: 6 commits (90d) y: 254 lines of code python/pyspark/sql/connect/catalog.py x: 4 commits (90d) y: 262 lines of code python/pyspark/sql/connect/proto/catalog_pb2.pyi x: 4 commits (90d) y: 910 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/datetimeExpressions.scala x: 1 commits (90d) y: 2799 lines of code core/src/main/scala/org/apache/spark/rdd/RDD.scala x: 1 commits (90d) y: 1072 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/mathExpressions.scala x: 1 commits (90d) y: 1507 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/thrift/ThriftCLIService.java x: 1 commits (90d) y: 618 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/complexTypeCreator.scala x: 1 commits (90d) y: 574 lines of code core/src/main/scala/org/apache/spark/util/JsonProtocol.scala x: 1 commits (90d) y: 1350 lines of code
4424.0
lines of code
  min: 1.0
  average: 298.67
  25th percentile: 49.0
  median: 137.0
  75th percentile: 347.0
  max: 4424.0
0 54.0
commits (90d)
min: 1.0 | average: 2.61 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 3.0 | max: 54.0

File Size vs. Contributors (90 days): 931 points

connector/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala x: 6 contributors (90d) y: 157 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/CachedStreamResponse.scala x: 1 contributors (90d) y: 8 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteGrpcResponseSender.scala x: 1 contributors (90d) y: 191 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteResponseObserver.scala x: 1 contributors (90d) y: 205 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/execution/ExecuteThreadRunner.scala x: 4 contributors (90d) y: 149 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/ExecuteHolder.scala x: 5 contributors (90d) y: 109 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectExecutePlanHandler.scala x: 2 contributors (90d) y: 18 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectReattachExecuteHandler.scala x: 1 contributors (90d) y: 33 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/functions.scala x: 6 contributors (90d) y: 1323 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Dataset.scala x: 8 contributors (90d) y: 947 lines of code sql/api/src/main/scala/org/apache/spark/sql/streaming/GroupState.scala x: 1 contributors (90d) y: 44 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala x: 16 contributors (90d) y: 4424 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/WholeStageCodegenExec.scala x: 2 contributors (90d) y: 528 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala x: 18 contributors (90d) y: 2820 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveLateralColumnAliasReference.scala x: 1 contributors (90d) y: 138 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowConverters.scala x: 3 contributors (90d) y: 341 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala x: 11 contributors (90d) y: 3431 lines of code python/pyspark/pandas/base.py x: 1 contributors (90d) y: 607 lines of code python/pyspark/errors/error_classes.py x: 7 contributors (90d) y: 3 lines of code python/pyspark/worker.py x: 5 contributors (90d) y: 777 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala x: 19 contributors (90d) y: 2873 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala x: 2 contributors (90d) y: 378 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExec.scala x: 2 contributors (90d) y: 187 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala x: 8 contributors (90d) y: 363 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala x: 4 contributors (90d) y: 643 lines of code python/pyspark/ml/connect/io_utils.py x: 2 contributors (90d) y: 195 lines of code python/pyspark/ml/connect/tuning.py x: 2 contributors (90d) y: 328 lines of code python/pyspark/ml/torch/distributor.py x: 3 contributors (90d) y: 624 lines of code python/pyspark/ml/util.py x: 2 contributors (90d) y: 388 lines of code python/pyspark/pandas/utils.py x: 2 contributors (90d) y: 629 lines of code python/pyspark/sql/connect/session.py x: 10 contributors (90d) y: 620 lines of code python/pyspark/sql/connect/udf.py x: 2 contributors (90d) y: 212 lines of code python/pyspark/sql/connect/udtf.py x: 3 contributors (90d) y: 147 lines of code python/pyspark/sql/session.py x: 5 contributors (90d) y: 763 lines of code python/pyspark/sql/utils.py x: 4 contributors (90d) y: 176 lines of code sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala x: 8 contributors (90d) y: 1462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/JoinCodegenSupport.scala x: 1 contributors (90d) y: 62 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcExceptionConverter.scala x: 2 contributors (90d) y: 69 lines of code dev/sparktestsupport/modules.py x: 10 contributors (90d) y: 1015 lines of code python/pyspark/pandas/__init__.py x: 1 contributors (90d) y: 112 lines of code python/pyspark/pandas/indexes/base.py x: 1 contributors (90d) y: 1008 lines of code python/pyspark/pandas/indexes/datetimes.py x: 1 contributors (90d) y: 266 lines of code python/pyspark/pandas/series.py x: 1 contributors (90d) y: 2180 lines of code python/pyspark/pandas/spark/accessors.py x: 1 contributors (90d) y: 242 lines of code python/pyspark/pandas/usage_logging/__init__.py x: 1 contributors (90d) y: 100 lines of code python/pyspark/sql/pandas/serializers.py x: 5 contributors (90d) y: 546 lines of code python/pyspark/sql/pandas/types.py x: 3 contributors (90d) y: 599 lines of code core/src/main/scala/org/apache/spark/serializer/KryoSerializer.scala x: 3 contributors (90d) y: 569 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala x: 3 contributors (90d) y: 423 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala x: 18 contributors (90d) y: 2370 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/DataTypeErrors.scala x: 3 contributors (90d) y: 238 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryErrorsBase.scala x: 3 contributors (90d) y: 29 lines of code python/pyspark/testing/pandasutils.py x: 1 contributors (90d) y: 440 lines of code project/SparkBuild.scala x: 10 contributors (90d) y: 1369 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/JavaTypeInference.scala x: 2 contributors (90d) y: 102 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala x: 2 contributors (90d) y: 755 lines of code core/src/main/scala/org/apache/spark/SparkConf.scala x: 1 contributors (90d) y: 463 lines of code core/src/main/scala/org/apache/spark/api/python/StreamingPythonRunner.scala x: 3 contributors (90d) y: 66 lines of code python/pyspark/sql/udtf.py x: 3 contributors (90d) y: 279 lines of code python/pyspark/cloudpickle/cloudpickle_fast.py x: 2 contributors (90d) y: 452 lines of code python/pyspark/sql/connect/plan.py x: 7 contributors (90d) y: 1734 lines of code python/pyspark/sql/connect/client/core.py x: 8 contributors (90d) y: 1150 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/GrpcRetryHandler.scala x: 2 contributors (90d) y: 114 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkConnectClient.scala x: 6 contributors (90d) y: 435 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/SparkResult.scala x: 5 contributors (90d) y: 225 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala x: 4 contributors (90d) y: 541 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningAwareFileIndex.scala x: 1 contributors (90d) y: 157 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonFunction.scala x: 3 contributors (90d) y: 192 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/joins/SortMergeJoinExec.scala x: 2 contributors (90d) y: 1142 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala x: 7 contributors (90d) y: 1590 lines of code python/pyspark/pandas/groupby.py x: 1 contributors (90d) y: 1638 lines of code python/pyspark/pandas/namespace.py x: 1 contributors (90d) y: 1460 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryListener.scala x: 3 contributors (90d) y: 73 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala x: 8 contributors (90d) y: 139 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveReferencesInUpdate.scala x: 2 contributors (90d) y: 40 lines of code connector/connect/common/src/main/protobuf/spark/connect/base.proto x: 2 contributors (90d) y: 662 lines of code python/pyspark/sql/connect/proto/base_pb2.pyi x: 2 contributors (90d) y: 2137 lines of code python/pyspark/testing/utils.py x: 1 contributors (90d) y: 367 lines of code core/src/main/scala/org/apache/spark/MapOutputTracker.scala x: 3 contributors (90d) y: 1104 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/FunctionBuilderBase.scala x: 1 contributors (90d) y: 79 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryCompilationErrors.scala x: 19 contributors (90d) y: 3219 lines of code python/pyspark/sql/types.py x: 2 contributors (90d) y: 1478 lines of code python/pyspark/testing/connectutils.py x: 2 contributors (90d) y: 135 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectService.scala x: 9 contributors (90d) y: 283 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SparkConnectStreamingQueryCache.scala x: 3 contributors (90d) y: 133 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseLexer.g4 x: 2 contributors (90d) y: 514 lines of code sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4 x: 2 contributors (90d) y: 1695 lines of code core/src/main/scala/org/apache/spark/executor/Executor.scala x: 4 contributors (90d) y: 875 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala x: 3 contributors (90d) y: 901 lines of code sql/core/src/main/scala/org/apache/spark/sql/jdbc/JdbcDialects.scala x: 1 contributors (90d) y: 354 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/dsl/package.scala x: 1 contributors (90d) y: 979 lines of code connector/connect/server/src/main/scala/org/apache/spark/sql/connect/ui/SparkConnectServerListener.scala x: 2 contributors (90d) y: 344 lines of code core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala x: 2 contributors (90d) y: 429 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala x: 3 contributors (90d) y: 458 lines of code core/src/main/scala/org/apache/spark/util/Utils.scala x: 7 contributors (90d) y: 2147 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala x: 2 contributors (90d) y: 1053 lines of code python/setup.py x: 6 contributors (90d) y: 276 lines of code core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala x: 2 contributors (90d) y: 462 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SQLExecution.scala x: 3 contributors (90d) y: 164 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryManager.scala x: 4 contributors (90d) y: 89 lines of code connector/connect/common/src/main/protobuf/spark/connect/commands.proto x: 6 contributors (90d) y: 341 lines of code python/pyspark/sql/connect/proto/commands_pb2.pyi x: 5 contributors (90d) y: 1509 lines of code python/pyspark/sql/connect/streaming/query.py x: 1 contributors (90d) y: 219 lines of code python/pyspark/sql/streaming/listener.py x: 2 contributors (90d) y: 609 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/DeduplicateRelations.scala x: 3 contributors (90d) y: 329 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/UDFRegistration.scala x: 1 contributors (90d) y: 1078 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/UdfUtils.scala x: 3 contributors (90d) y: 493 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/encoders/RowEncoder.scala x: 2 contributors (90d) y: 71 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala x: 3 contributors (90d) y: 810 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasExec.scala x: 3 contributors (90d) y: 37 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala x: 3 contributors (90d) y: 2504 lines of code sql/core/src/main/scala/org/apache/spark/sql/functions.scala x: 5 contributors (90d) y: 2003 lines of code core/src/main/scala/org/apache/spark/internal/config/package.scala x: 4 contributors (90d) y: 2224 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/connect/client/ArtifactManager.scala x: 5 contributors (90d) y: 252 lines of code core/src/main/scala/org/apache/spark/SparkContext.scala x: 8 contributors (90d) y: 1860 lines of code common/utils/src/main/java/org/apache/spark/network/util/JavaUtils.java x: 1 contributors (90d) y: 253 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/protobuf/functions.scala x: 3 contributors (90d) y: 112 lines of code connector/connect/common/src/main/scala/org/apache/spark/sql/connect/common/LiteralValueProtoConverter.scala x: 2 contributors (90d) y: 313 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala x: 2 contributors (90d) y: 1828 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ToStringBase.scala x: 2 contributors (90d) y: 364 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/IntervalUtils.scala x: 3 contributors (90d) y: 711 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/package.scala x: 2 contributors (90d) y: 146 lines of code mllib/src/main/scala/org/apache/spark/mllib/evaluation/RankingMetrics.scala x: 1 contributors (90d) y: 155 lines of code connector/avro/src/main/scala/org/apache/spark/sql/avro/AvroSerializer.scala x: 1 contributors (90d) y: 304 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/progress.scala x: 4 contributors (90d) y: 192 lines of code sql/api/src/main/scala/org/apache/spark/sql/Row.scala x: 2 contributors (90d) y: 230 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/parsers.scala x: 2 contributors (90d) y: 275 lines of code sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala x: 1 contributors (90d) y: 419 lines of code sql/api/src/main/scala/org/apache/spark/sql/errors/QueryParsingErrors.scala x: 2 contributors (90d) y: 565 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/DataType.scala x: 2 contributors (90d) y: 284 lines of code sql/api/src/main/scala/org/apache/spark/sql/types/Decimal.scala x: 2 contributors (90d) y: 473 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CTESubstitution.scala x: 3 contributors (90d) y: 175 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/UnivocityParser.scala x: 3 contributors (90d) y: 299 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala x: 3 contributors (90d) y: 379 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFilters.scala x: 1 contributors (90d) y: 666 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetUtils.scala x: 1 contributors (90d) y: 333 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala x: 1 contributors (90d) y: 323 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/MapInBatchExec.scala x: 4 contributors (90d) y: 51 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveInlineTables.scala x: 4 contributors (90d) y: 80 lines of code core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala x: 1 contributors (90d) y: 696 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveShim.scala x: 3 contributors (90d) y: 1468 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala x: 2 contributors (90d) y: 654 lines of code core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala x: 6 contributors (90d) y: 556 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala x: 3 contributors (90d) y: 408 lines of code python/pyspark/sql/connect/dataframe.py x: 4 contributors (90d) y: 1749 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkStrategies.scala x: 4 contributors (90d) y: 685 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/MicroBatchExecution.scala x: 2 contributors (90d) y: 554 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala x: 3 contributors (90d) y: 1431 lines of code connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 6 contributors (90d) y: 111 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/FlatMapGroupsInPandasExec.scala x: 4 contributors (90d) y: 55 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/WindowInPandasEvaluatorFactory.scala x: 2 contributors (90d) y: 245 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala x: 8 contributors (90d) y: 1440 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala x: 4 contributors (90d) y: 988 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala x: 1 contributors (90d) y: 953 lines of code core/src/main/scala/org/apache/spark/ui/UIUtils.scala x: 1 contributors (90d) y: 584 lines of code python/pyspark/sql/connect/expressions.py x: 1 contributors (90d) y: 835 lines of code python/pyspark/sql/connect/functions.py x: 7 contributors (90d) y: 2070 lines of code python/pyspark/sql/connect/proto/expressions_pb2.pyi x: 1 contributors (90d) y: 1268 lines of code python/pyspark/sql/streaming/readwriter.py x: 1 contributors (90d) y: 540 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DataSourceV2Strategy.scala x: 5 contributors (90d) y: 502 lines of code core/src/main/resources/org/apache/spark/ui/static/stagepage.js x: 1 contributors (90d) y: 1040 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreeNode.scala x: 4 contributors (90d) y: 824 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/StringUtils.scala x: 5 contributors (90d) y: 88 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala x: 1 contributors (90d) y: 642 lines of code project/plugins.sbt x: 2 contributors (90d) y: 14 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala x: 10 contributors (90d) y: 1081 lines of code python/pyspark/sql/dataframe.py x: 5 contributors (90d) y: 1405 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/RewriteRowLevelCommand.scala x: 2 contributors (90d) y: 161 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala x: 5 contributors (90d) y: 382 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala x: 3 contributors (90d) y: 691 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala x: 1 contributors (90d) y: 387 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala x: 1 contributors (90d) y: 495 lines of code core/src/main/scala/org/apache/spark/SparkEnv.scala x: 2 contributors (90d) y: 402 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/python/BatchEvalPythonExec.scala x: 3 contributors (90d) y: 89 lines of code core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala x: 3 contributors (90d) y: 2008 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala x: 2 contributors (90d) y: 856 lines of code connector/connect/common/src/main/protobuf/spark/connect/relations.proto x: 6 contributors (90d) y: 796 lines of code python/pyspark/sql/connect/proto/relations_pb2.pyi x: 5 contributors (90d) y: 2915 lines of code python/pyspark/pandas/sql_formatter.py x: 4 contributors (90d) y: 105 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/jsonExpressions.scala x: 1 contributors (90d) y: 716 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala x: 3 contributors (90d) y: 1204 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TableOutputResolver.scala x: 4 contributors (90d) y: 442 lines of code python/pyspark/rdd.py x: 3 contributors (90d) y: 1514 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/udaf.scala x: 1 contributors (90d) y: 415 lines of code sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala x: 5 contributors (90d) y: 287 lines of code core/src/main/scala/org/apache/spark/storage/BlockManager.scala x: 2 contributors (90d) y: 1519 lines of code core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala x: 1 contributors (90d) y: 1081 lines of code python/pyspark/context.py x: 3 contributors (90d) y: 747 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala x: 1 contributors (90d) y: 4395 lines of code common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java x: 1 contributors (90d) y: 1586 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/trees/TreePatterns.scala x: 4 contributors (90d) y: 126 lines of code R/pkg/pkgdown/_pkgdown_template.yml x: 1 contributors (90d) y: 291 lines of code core/src/main/scala/org/apache/spark/status/AppStatusListener.scala x: 1 contributors (90d) y: 1100 lines of code core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala x: 3 contributors (90d) y: 1133 lines of code sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala x: 1 contributors (90d) y: 964 lines of code python/pyspark/pandas/window.py x: 1 contributors (90d) y: 539 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/CodeGenerator.scala x: 3 contributors (90d) y: 1157 lines of code resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/Config.scala x: 1 contributors (90d) y: 661 lines of code python/pyspark/ml/classification.py x: 1 contributors (90d) y: 2099 lines of code python/pyspark/ml/feature.py x: 1 contributors (90d) y: 3363 lines of code python/pyspark/ml/regression.py x: 1 contributors (90d) y: 1523 lines of code python/pyspark/mllib/linalg/__init__.py x: 1 contributors (90d) y: 908 lines of code core/src/main/protobuf/org/apache/spark/status/protobuf/store_types.proto x: 1 contributors (90d) y: 740 lines of code core/src/main/scala/org/apache/spark/status/LiveEntity.scala x: 1 contributors (90d) y: 817 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/windowExpressions.scala x: 1 contributors (90d) y: 797 lines of code sql/core/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveSessionCatalog.scala x: 3 contributors (90d) y: 508 lines of code python/pyspark/pandas/indexing.py x: 1 contributors (90d) y: 1202 lines of code python/pyspark/pandas/internal.py x: 1 contributors (90d) y: 842 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala x: 2 contributors (90d) y: 618 lines of code python/pyspark/pandas/generic.py x: 1 contributors (90d) y: 938 lines of code common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java x: 2 contributors (90d) y: 1093 lines of code sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala x: 4 contributors (90d) y: 763 lines of code python/pyspark/sql/connect/client/artifact.py x: 3 contributors (90d) y: 254 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/datetimeExpressions.scala x: 1 contributors (90d) y: 2799 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/mathExpressions.scala x: 1 contributors (90d) y: 1507 lines of code sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/thrift/ThriftCLIService.java x: 1 contributors (90d) y: 618 lines of code sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/complexTypeCreator.scala x: 1 contributors (90d) y: 574 lines of code core/src/main/scala/org/apache/spark/util/JsonProtocol.scala x: 1 contributors (90d) y: 1350 lines of code
4424.0
lines of code
  min: 1.0
  average: 298.67
  25th percentile: 49.0
  median: 137.0
  75th percentile: 347.0
  max: 4424.0
0 19.0
contributors (90d)
min: 1.0 | average: 1.78 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 19.0