apache / datafu
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 39% | 29% | 31%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
java0% | 0% | 41% | 28% | 29%
css0% | 0% | 100% | 0% | 0%
scala0% | 0% | 34% | 58% | 7%
xsl0% | 0% | 0% | 100% | 0%
groovy0% | 0% | 0% | 74% | 25%
erb0% | 0% | 0% | 0% | 100%
pig0% | 0% | 0% | 0% | 100%
py0% | 0% | 0% | 0% | 100%
rb0% | 0% | 0% | 0% | 100%
less0% | 0% | 0% | 0% | 100%
rdf0% | 0% | 0% | 0% | 100%
builder0% | 0% | 0% | 0% | 100%
html0% | 0% | 0% | 0% | 100%
js0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
datafu-pig0% | 0% | 36% | 29% | 34%
datafu-hourglass0% | 0% | 50% | 25% | 23%
site0% | 0% | 41% | 0% | 58%
datafu-spark0% | 0% | 30% | 52% | 16%
gradle0% | 0% | 0% | 100% | 0%
buildSrc0% | 0% | 0% | 74% | 25%
build-plugin0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
StagedOutputJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
492 14
PageRankImpl.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
391 33
AbstractPartitionPreservingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
387 20
AbstractPartitionCollapsingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
349 24
bootstrap-theme.css
in site/source/stylesheets
340 -
SparkDFUtils.scala
in datafu-spark/src/main/scala/datafu/spark
337 25
SimpleRandomSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
319 21
PartitionCollapsingExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
315 21
PageRank.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
312 8
VAR.java
in datafu-pig/src/main/java/datafu/pig/stats
300 16
FloatVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
289 15
LongVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15
IntVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15
DoubleVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15
StreamingQuantile.java
in datafu-pig/src/main/java/datafu/pig/stats
276 16
AbstractJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
257 28
ExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
234 25
ReservoirSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
230 23
EmpiricalCountEntropy.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
229 20
CollapsingReducer.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
222 12
BagJoin.java
in datafu-pig/src/main/java/datafu/pig/bags
219 7
AbstractNonIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
206 11
CollapsingMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
202 17
PathUtils.java
in datafu-hourglass/src/main/java/datafu/hourglass/fs
192 11
WeightedReservoirSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
181 13
ExtremalTupleByNthField.java
in datafu-pig/src/main/java/datafu/org/apache/pig/piggybank/evaluation
177 19
PartitionCollapsingSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
160 10
AliasableEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
160 23
PartitionPreservingExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
156 13
CountDistinctUpTo.java
in datafu-pig/src/main/java/datafu/pig/bags
156 19
xsl
rat-output-to-html.xsl
in gradle/resources
153 -
CollapsingCombiner.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
152 9
HyperLogLogPlusPlus.java
in datafu-pig/src/main/java/datafu/pig/stats
152 13
SimpleRandomSampleWithReplacementElect.java
in datafu-pig/src/main/java/datafu/pig/sampling
143 10
SetDifference.java
in datafu-pig/src/main/java/datafu/pig/sets
140 8
SparkOverwriteUDAFs.scala
in datafu-spark/src/main/scala/spark/utils/overwrites
135 9
ChaoShenEntropyEstimator.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
134 11
ReduceEstimator.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
130 7
Aggregators.scala
in datafu-spark/src/main/scala/datafu/spark
129 12
Coalesce.java
in datafu-pig/src/main/java/datafu/pig/util
128 4
TupleDiff.java
in datafu-pig/src/main/java/datafu/pig/util
128 8
DataTypeUtil.java
in datafu-pig/src/main/java/datafu/pig/hash/lsh/util
126 6
Autojar.groovy
in buildSrc/src/main/groovy/datafu/autojar/task
125 10
DateRangePlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
124 1
SimpleRandomSampleWithReplacementVote.java
in datafu-pig/src/main/java/datafu/pig/sampling
121 2
BagGroup.java
in datafu-pig/src/main/java/datafu/pig/bags
117 4
CondEntropy.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
117 7
PartitionPreservingSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
114 9
PartitionCollapsingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
109 19
PartitioningReducer.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
108 8
Files With Most Units (Top 50)
File# lines# units
PageRankImpl.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
391 33
AbstractJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
257 28
ExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
234 25
SparkDFUtils.scala
in datafu-spark/src/main/scala/datafu/spark
337 25
AbstractPartitionCollapsingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
349 24
AliasableEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
160 23
ReservoirSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
230 23
PartitionCollapsingExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
315 21
SimpleRandomSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
319 21
AbstractPartitionPreservingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
387 20
EmpiricalCountEntropy.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
229 20
PartitionCollapsingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
109 19
CountDistinctUpTo.java
in datafu-pig/src/main/java/datafu/pig/bags
156 19
ExtremalTupleByNthField.java
in datafu-pig/src/main/java/datafu/org/apache/pig/piggybank/evaluation
177 19
CollapsingMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
202 17
VAR.java
in datafu-pig/src/main/java/datafu/pig/stats
300 16
StreamingQuantile.java
in datafu-pig/src/main/java/datafu/pig/stats
276 16
df_utils.py
in datafu-spark/src/main/resources/pyspark_utils
62 16
PartitionPreservingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
88 15
LongVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15
IntVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15
FloatVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
289 15
DoubleVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15
StagedOutputJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
492 14
PartitionPreservingExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
156 13
WeightedReservoirSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
181 13
HyperLogLogPlusPlus.java
in datafu-pig/src/main/java/datafu/pig/stats
152 13
TimeBasedJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
99 12
CollapsingReducer.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
222 12
Aggregators.scala
in datafu-spark/src/main/scala/datafu/spark
129 12
PathUtils.java
in datafu-hourglass/src/main/java/datafu/hourglass/fs
192 11
IncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
86 11
AbstractNonIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
206 11
PartitioningMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
106 11
ScoredTuple.java
in datafu-pig/src/main/java/datafu/pig/sampling
71 11
ChaoShenEntropyEstimator.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
134 11
PartitionCollapsingSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
160 10
SimpleRandomSampleWithReplacementElect.java
in datafu-pig/src/main/java/datafu/pig/sampling
143 10
Autojar.groovy
in buildSrc/src/main/groovy/datafu/autojar/task
125 10
CollapsingCombiner.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
152 9
PartitionPreservingSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
114 9
DataFuException.java
in datafu-pig/src/main/java/datafu/pig/util
68 9
SparkOverwriteUDAFs.scala
in datafu-spark/src/main/scala/spark/utils/overwrites
135 9
PartitioningReducer.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
108 8
TaskSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
63 8
SetDifference.java
in datafu-pig/src/main/java/datafu/pig/sets
140 8
PageRank.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
312 8
URLInfo.java
in datafu-pig/src/main/java/datafu/pig/urls
102 8
TupleDiff.java
in datafu-pig/src/main/java/datafu/pig/util
128 8
Hasher.java
in datafu-pig/src/main/java/datafu/pig/hash
76 8
Files With Long Lines (Top 50)

There are 61 files with lines longer than 120 characters. In total, there are 188 long lines.

File# lines# units# long lines
bootstrap-theme.css
in site/source/stylesheets
340 - 30
AliasableEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
160 23 16
xsl
rat-output-to-html.xsl
in gradle/resources
153 - 10
Coalesce.java
in datafu-pig/src/main/java/datafu/pig/util
128 4 8
Hasher.java
in datafu-pig/src/main/java/datafu/pig/hash
76 8 7
StagedOutputJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
492 14 6
AbstractPartitionCollapsingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
349 24 6
AbstractNonIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
206 11 5
SimpleEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
101 5 5
config.rb
in site
37 3 5
index.markdown.erb
in site/source
49 - 5
df_utils.py
in datafu-spark/src/main/resources/pyspark_utils
62 16 5
PartitionCollapsingExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
315 21 4
SparkOverwriteUDAFs.scala
in datafu-spark/src/main/scala/spark/utils/overwrites
135 9 4
Aggregators.scala
in datafu-spark/src/main/scala/datafu/spark
129 12 4
AbstractPartitionPreservingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
387 20 3
WeightedReservoirSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
181 13 3
BagGroup.java
in datafu-pig/src/main/java/datafu/pig/bags
117 4 3
BagConcat.java
in datafu-pig/src/main/java/datafu/pig/bags
94 4 3
_footer.erb
in site/source/layouts
27 - 3
SparkDFUtils.scala
in datafu-spark/src/main/scala/datafu/spark
337 25 3
ExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
234 25 2
PartitionCollapsingSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
160 10 2
PartitionPreservingSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
114 9 2
AvroDateRangeMetadata.java
in datafu-hourglass/src/main/java/datafu/hourglass/avro
40 2 2
PageRankImpl.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
391 33 2
MetricUDF.java
in datafu-pig/src/main/java/datafu/pig/hash/lsh/metric
95 4 2
SimpleRandomSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
319 21 2
BagJoin.java
in datafu-pig/src/main/java/datafu/pig/bags
219 7 2
VAR.java
in datafu-pig/src/main/java/datafu/pig/stats
300 16 2
35 - 2
DateRangePlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
124 1 1
TimePartitioner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
78 3 1
ReduceEstimator.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
130 7 1
DelegatingMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
32 3 1
AvroKeyValueIdentityMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
17 1 1
CollapsingCombiner.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
152 9 1
CollapsingReducer.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
222 12 1
DelegatingCombiner.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
32 3 1
CollapsingMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
202 17 1
DelegatingReducer.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
32 3 1
PartitioningMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
106 11 1
CombinedAvroKeyInputFormat.java
in datafu-hourglass/src/main/java/datafu/hourglass/avro
55 2 1
Sessionize.java
in datafu-pig/src/main/java/datafu/pig/sessions
107 5 1
PageRank.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
312 8 1
CachedFile.java
in datafu-pig/src/main/java/datafu/pig/text/opennlp
18 1 1
ContextualEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
44 7 1
TransposeTupleToBag.java
in datafu-pig/src/main/java/datafu/pig/util
61 2 1
LSHFunc.java
in datafu-pig/src/main/java/datafu/pig/hash/lsh
102 5 1
HasherRand.java
in datafu-pig/src/main/java/datafu/pig/hash
51 5 1
Correlations

File Size vs. Commits (all time): 225 points

datafu-spark/src/main/resources/pyspark_utils/df_utils.py x: 8 commits (all time) y: 62 lines of code datafu-spark/src/main/scala/datafu/spark/SparkDFUtils.scala x: 15 commits (all time) y: 337 lines of code site/source/layouts/layout.erb x: 4 commits (all time) y: 39 lines of code datafu-spark/src/main/scala/spark/utils/overwrites/SparkOverwriteUDAFs.scala x: 6 commits (all time) y: 135 lines of code datafu-spark/src/main/resources/pyspark_utils/bridge_utils.py x: 4 commits (all time) y: 41 lines of code datafu-spark/src/main/scala/datafu/spark/DataFrameOps.scala x: 8 commits (all time) y: 75 lines of code site/source/index.markdown.erb x: 11 commits (all time) y: 49 lines of code site/source/layouts/_docs_nav.erb x: 16 commits (all time) y: 55 lines of code site/source/layouts/_footer.erb x: 16 commits (all time) y: 27 lines of code datafu-spark/src/main/scala/datafu/spark/Aggregators.scala x: 1 commits (all time) y: 129 lines of code datafu-spark/src/main/scala/datafu/spark/ScalaPythonBridge.scala x: 6 commits (all time) y: 103 lines of code datafu-spark/src/main/scala/spark/utils/overwrites/SparkPythonRunner.scala x: 7 commits (all time) y: 101 lines of code doap_DataFu.rdf x: 3 commits (all time) y: 35 lines of code datafu-spark/src/main/scala/datafu/spark/PythonPathsManager.scala x: 3 commits (all time) y: 102 lines of code site/config.rb x: 12 commits (all time) y: 37 lines of code buildSrc/src/main/groovy/datafu/autojar/task/Autojar.groovy x: 2 commits (all time) y: 125 lines of code datafu-pig/src/main/java/datafu/org/apache/pig/piggybank/evaluation/ExtremalTupleByNthField.java x: 2 commits (all time) y: 177 lines of code datafu-pig/src/main/java/datafu/pig/bags/CountDistinctUpTo.java x: 2 commits (all time) y: 156 lines of code datafu-pig/src/main/java/datafu/pig/hash/Hasher.java x: 2 commits (all time) y: 76 lines of code datafu-pig/src/main/java/datafu/pig/hash/HasherRand.java x: 4 commits (all time) y: 51 lines of code build-plugin/src/main/java/org/adrianwalker/multilinestring/EcjMultilineProcessor.java x: 2 commits (all time) y: 41 lines of code build-plugin/src/main/java/org/adrianwalker/multilinestring/JavacMultilineProcessor.java x: 2 commits (all time) y: 39 lines of code build-plugin/src/main/java/org/adrianwalker/multilinestring/MultilineProcessor.java x: 7 commits (all time) y: 33 lines of code datafu-spark/src/main/resources/pyspark_utils/__init__.py x: 1 commits (all time) y: 1 lines of code datafu-spark/src/main/resources/pyspark_utils/init_spark_context.py x: 1 commits (all time) y: 3 lines of code site/source/sitemap.xml.builder x: 5 commits (all time) y: 14 lines of code site/source/stylesheets/all.less x: 5 commits (all time) y: 53 lines of code datafu-pig/src/main/resources/datafu/count_macros.pig x: 3 commits (all time) y: 38 lines of code datafu-pig/src/main/resources/datafu/diff_macros.pig x: 3 commits (all time) y: 34 lines of code datafu-pig/src/main/resources/datafu/left_outer_join.pig x: 2 commits (all time) y: 37 lines of code datafu-pig/src/main/resources/datafu/dedup.pig x: 1 commits (all time) y: 34 lines of code datafu-pig/src/main/java/datafu/pig/stats/HyperLogLogPlusPlus.java x: 6 commits (all time) y: 152 lines of code buildSrc/src/main/groovy/datafu/autojar/GradleAutojarPlugin.groovy x: 1 commits (all time) y: 14 lines of code buildSrc/src/main/groovy/datafu/autojar/task/ExtractAutojar.groovy x: 1 commits (all time) y: 29 lines of code site/source/layouts/blog.erb x: 5 commits (all time) y: 41 lines of code site/source/layouts/_header.erb x: 6 commits (all time) y: 24 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroMultipleInputsUtil.java x: 4 commits (all time) y: 78 lines of code datafu-pig/src/main/java/datafu/pig/util/ContextualEvalFunc.java x: 6 commits (all time) y: 44 lines of code datafu-pig/src/main/java/datafu/pig/util/SimpleEvalFunc.java x: 4 commits (all time) y: 101 lines of code datafu-pig/src/main/resources/datafu/tf_idf.pig x: 1 commits (all time) y: 84 lines of code datafu-pig/src/main/java/datafu/pig/sessions/SessionCount.java x: 2 commits (all time) y: 48 lines of code datafu-pig/src/main/java/datafu/pig/sessions/Sessionize.java x: 3 commits (all time) y: 107 lines of code datafu-pig/src/main/java/datafu/pig/util/AliasableEvalFunc.java x: 6 commits (all time) y: 160 lines of code datafu-pig/src/main/java/datafu/pig/bags/TupleFromBag.java x: 3 commits (all time) y: 62 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractJob.java x: 5 commits (all time) y: 257 lines of code datafu-pig/src/main/java/datafu/pig/bags/FirstTupleFromBag.java x: 2 commits (all time) y: 50 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/TokenizeSimple.java x: 2 commits (all time) y: 55 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/POSTag.java x: 2 commits (all time) y: 106 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/SentenceDetect.java x: 2 commits (all time) y: 74 lines of code site/lib/pig.rb x: 3 commits (all time) y: 54 lines of code datafu-hourglass/find_dupes.rb x: 2 commits (all time) y: 9 lines of code datafu-hourglass/src/main/java/datafu/hourglass/fs/PathUtils.java x: 5 commits (all time) y: 192 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/DistributedCacheHelper.java x: 4 commits (all time) y: 56 lines of code datafu-hourglass/overview.html x: 2 commits (all time) y: 3 lines of code site/source/blog/index.html.erb x: 2 commits (all time) y: 36 lines of code site/source/javascripts/all.js x: 2 commits (all time) y: 1 lines of code site/source/layouts/docs.erb x: 2 commits (all time) y: 33 lines of code site/source/stylesheets/highlight.css.erb x: 2 commits (all time) y: 19 lines of code gradle/resources/rat-output-to-html.xsl x: 1 commits (all time) y: 153 lines of code datafu-pig/src/main/java/datafu/pig/bags/ZipBags.java x: 1 commits (all time) y: 58 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/L2LSH.java x: 4 commits (all time) y: 18 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SampleByKey.java x: 6 commits (all time) y: 58 lines of code datafu-pig/src/main/java/datafu/pig/sampling/WeightedReservoirSample.java x: 4 commits (all time) y: 181 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroDateRangeMetadata.java x: 3 commits (all time) y: 40 lines of code datafu-hourglass/src/main/java/datafu/hourglass/fs/DateRange.java x: 2 commits (all time) y: 20 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractNonIncrementalJob.java x: 3 commits (all time) y: 206 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractPartitionCollapsingIncrementalJob.java x: 3 commits (all time) y: 349 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractPartitionPreservingIncrementalJob.java x: 3 commits (all time) y: 387 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/DateRangeConfigurable.java x: 2 commits (all time) y: 6 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/ExecutionPlanner.java x: 3 commits (all time) y: 234 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/FileCleaner.java x: 3 commits (all time) y: 45 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/IncrementalJob.java x: 2 commits (all time) y: 86 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionCollapsingExecutionPlanner.java x: 3 commits (all time) y: 315 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionCollapsingIncrementalJob.java x: 3 commits (all time) y: 109 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionPreservingExecutionPlanner.java x: 3 commits (all time) y: 156 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionPreservingIncrementalJob.java x: 3 commits (all time) y: 88 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/ReduceEstimator.java x: 2 commits (all time) y: 130 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/TimeBasedJob.java x: 2 commits (all time) y: 99 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/TimePartitioner.java x: 2 commits (all time) y: 78 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/AvroKeyValueIdentityMapper.java x: 2 commits (all time) y: 17 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/CollapsingCombiner.java x: 3 commits (all time) y: 152 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/CollapsingMapper.java x: 3 commits (all time) y: 202 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/CollapsingReducer.java x: 3 commits (all time) y: 222 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/DelegatingCombiner.java x: 2 commits (all time) y: 32 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/PartitioningMapper.java x: 3 commits (all time) y: 106 lines of code datafu-hourglass/src/main/java/datafu/hourglass/model/KeyValueCollector.java x: 3 commits (all time) y: 6 lines of code datafu-hourglass/src/main/java/datafu/hourglass/schemas/PartitionCollapsingSchemas.java x: 2 commits (all time) y: 160 lines of code datafu-hourglass/src/main/java/datafu/hourglass/schemas/PartitionPreservingSchemas.java x: 2 commits (all time) y: 114 lines of code datafu-hourglass/src/main/java/datafu/hourglass/schemas/TaskSchemas.java x: 2 commits (all time) y: 63 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagConcat.java x: 2 commits (all time) y: 94 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagGroup.java x: 6 commits (all time) y: 117 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagJoin.java x: 4 commits (all time) y: 219 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagLeftOuterJoin.java x: 3 commits (all time) y: 25 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/LSHFunc.java x: 2 commits (all time) y: 102 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/LSH.java x: 3 commits (all time) y: 16 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/LSHCreator.java x: 3 commits (all time) y: 61 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/Cosine.java x: 3 commits (all time) y: 14 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/MetricUDF.java x: 3 commits (all time) y: 95 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/AbstractStableDistributionFunction.java x: 3 commits (all time) y: 42 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/L1LSH.java x: 3 commits (all time) y: 18 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/util/DataTypeUtil.java x: 3 commits (all time) y: 126 lines of code datafu-pig/src/main/java/datafu/pig/sampling/ReservoirSample.java x: 2 commits (all time) y: 230 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SimpleRandomSample.java x: 3 commits (all time) y: 319 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SimpleRandomSampleWithReplacementElect.java x: 2 commits (all time) y: 143 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SimpleRandomSampleWithReplacementVote.java x: 3 commits (all time) y: 121 lines of code datafu-pig/src/main/java/datafu/pig/sets/SetDifference.java x: 3 commits (all time) y: 140 lines of code datafu-pig/src/main/java/datafu/pig/util/FieldNotFound.java x: 2 commits (all time) y: 12 lines of code datafu-pig/src/main/java/datafu/pig/util/SelectStringFieldByName.java x: 1 commits (all time) y: 32 lines of code datafu-pig/src/main/java/datafu/pig/urls/URLInfo.java x: 1 commits (all time) y: 102 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/StagedOutputJob.java x: 2 commits (all time) y: 492 lines of code datafu-pig/src/main/java/datafu/pig/bags/DistinctBy.java x: 2 commits (all time) y: 98 lines of code datafu-pig/src/main/java/datafu/pig/bags/Enumerate.java x: 2 commits (all time) y: 88 lines of code datafu-pig/src/main/java/datafu/pig/bags/UnorderedPairs.java x: 2 commits (all time) y: 80 lines of code datafu-pig/src/main/java/datafu/pig/geo/HaversineDistInMiles.java x: 2 commits (all time) y: 25 lines of code datafu-pig/src/main/java/datafu/pig/linkanalysis/PageRank.java x: 2 commits (all time) y: 312 lines of code datafu-pig/src/main/java/datafu/pig/linkanalysis/PageRankImpl.java x: 2 commits (all time) y: 391 lines of code datafu-pig/src/main/java/datafu/pig/stats/Quantile.java x: 2 commits (all time) y: 103 lines of code datafu-pig/src/main/java/datafu/pig/stats/QuantileUtil.java x: 2 commits (all time) y: 44 lines of code datafu-pig/src/main/java/datafu/pig/stats/StreamingQuantile.java x: 2 commits (all time) y: 276 lines of code datafu-pig/src/main/java/datafu/pig/stats/VAR.java x: 2 commits (all time) y: 300 lines of code datafu-pig/src/main/java/datafu/pig/stats/entropy/CondEntropy.java x: 2 commits (all time) y: 117 lines of code datafu-pig/src/main/java/datafu/pig/util/AssertUDF.java x: 2 commits (all time) y: 23 lines of code datafu-pig/src/main/java/datafu/pig/util/Coalesce.java x: 2 commits (all time) y: 128 lines of code datafu-pig/src/main/java/datafu/pig/util/DataFuException.java x: 2 commits (all time) y: 68 lines of code datafu-pig/src/main/java/datafu/pig/util/TransposeTupleToBag.java x: 2 commits (all time) y: 61 lines of code datafu-pig/src/main/java/datafu/pig/util/Base64Decode.java x: 2 commits (all time) y: 14 lines of code datafu-pig/src/main/java/datafu/pig/hash/SHA.java x: 2 commits (all time) y: 28 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroKeyValueWithMetadataOutputFormat.java x: 1 commits (all time) y: 21 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroKeyValueWithMetadataRecordWriter.java x: 1 commits (all time) y: 55 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroMultipleInputsKeyInputFormat.java x: 1 commits (all time) y: 24 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/MaxInputDataExceededException.java x: 1 commits (all time) y: 11 lines of code build-plugin/src/main/java/org/adrianwalker/multilinestring/Multiline.java x: 1 commits (all time) y: 9 lines of code datafu-pig/src/main/java/datafu/pig/bags/AppendToBag.java x: 1 commits (all time) y: 27 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagSplit.java x: 1 commits (all time) y: 94 lines of code datafu-pig/src/main/java/datafu/pig/bags/EmptyBagToNull.java x: 1 commits (all time) y: 35 lines of code datafu-pig/src/main/java/datafu/pig/linkanalysis/ProgressIndicator.java x: 1 commits (all time) y: 5 lines of code datafu-pig/src/main/java/datafu/pig/sampling/ScoredTuple.java x: 1 commits (all time) y: 71 lines of code datafu-pig/src/main/java/datafu/pig/sampling/WeightedSample.java x: 1 commits (all time) y: 104 lines of code datafu-pig/src/main/java/datafu/pig/sets/SetIntersect.java x: 1 commits (all time) y: 75 lines of code datafu-pig/src/main/java/datafu/pig/stats/DoubleVAR.java x: 1 commits (all time) y: 288 lines of code datafu-pig/src/main/java/datafu/pig/stats/FloatVAR.java x: 1 commits (all time) y: 289 lines of code datafu-pig/src/main/java/datafu/pig/stats/MarkovPairs.java x: 1 commits (all time) y: 92 lines of code datafu-pig/src/main/java/datafu/pig/stats/entropy/ChaoShenEntropyEstimator.java x: 1 commits (all time) y: 134 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/CachedFile.java x: 1 commits (all time) y: 18 lines of code site/source/stylesheets/bootstrap-theme.css x: 1 commits (all time) y: 340 lines of code
492.0
lines of code
  min: 1.0
  average: 76.26
  25th percentile: 16.5
  median: 42.0
  75th percentile: 102.5
  max: 492.0
0 16.0
commits (all time)
min: 1.0 | average: 2.56 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 3.0 | max: 16.0

File Size vs. Contributors (all time): 225 points

datafu-spark/src/main/resources/pyspark_utils/df_utils.py x: 5 contributors (all time) y: 62 lines of code datafu-spark/src/main/scala/datafu/spark/SparkDFUtils.scala x: 9 contributors (all time) y: 337 lines of code site/source/layouts/layout.erb x: 3 contributors (all time) y: 39 lines of code datafu-spark/src/main/scala/spark/utils/overwrites/SparkOverwriteUDAFs.scala x: 5 contributors (all time) y: 135 lines of code datafu-spark/src/main/resources/pyspark_utils/bridge_utils.py x: 3 contributors (all time) y: 41 lines of code datafu-spark/src/main/scala/datafu/spark/DataFrameOps.scala x: 5 contributors (all time) y: 75 lines of code site/source/index.markdown.erb x: 5 contributors (all time) y: 49 lines of code site/source/layouts/_docs_nav.erb x: 6 contributors (all time) y: 55 lines of code site/source/layouts/_footer.erb x: 6 contributors (all time) y: 27 lines of code datafu-spark/src/main/scala/datafu/spark/Aggregators.scala x: 1 contributors (all time) y: 129 lines of code datafu-spark/src/main/scala/datafu/spark/ScalaPythonBridge.scala x: 4 contributors (all time) y: 103 lines of code datafu-spark/src/main/scala/spark/utils/overwrites/SparkPythonRunner.scala x: 4 contributors (all time) y: 101 lines of code doap_DataFu.rdf x: 3 contributors (all time) y: 35 lines of code datafu-spark/src/main/scala/datafu/spark/PythonPathsManager.scala x: 3 contributors (all time) y: 102 lines of code site/config.rb x: 5 contributors (all time) y: 37 lines of code buildSrc/src/main/groovy/datafu/autojar/task/Autojar.groovy x: 2 contributors (all time) y: 125 lines of code datafu-pig/src/main/java/datafu/org/apache/pig/piggybank/evaluation/ExtremalTupleByNthField.java x: 2 contributors (all time) y: 177 lines of code datafu-pig/src/main/java/datafu/pig/bags/CountDistinctUpTo.java x: 2 contributors (all time) y: 156 lines of code datafu-pig/src/main/java/datafu/pig/hash/Hasher.java x: 2 contributors (all time) y: 76 lines of code datafu-pig/src/main/java/datafu/pig/hash/HasherRand.java x: 2 contributors (all time) y: 51 lines of code build-plugin/src/main/java/org/adrianwalker/multilinestring/EcjMultilineProcessor.java x: 2 contributors (all time) y: 41 lines of code build-plugin/src/main/java/org/adrianwalker/multilinestring/JavacMultilineProcessor.java x: 2 contributors (all time) y: 39 lines of code build-plugin/src/main/java/org/adrianwalker/multilinestring/MultilineProcessor.java x: 3 contributors (all time) y: 33 lines of code datafu-spark/src/main/resources/pyspark_utils/__init__.py x: 1 contributors (all time) y: 1 lines of code datafu-spark/src/main/resources/pyspark_utils/init_spark_context.py x: 1 contributors (all time) y: 3 lines of code site/source/sitemap.xml.builder x: 4 contributors (all time) y: 14 lines of code site/source/stylesheets/all.less x: 4 contributors (all time) y: 53 lines of code datafu-pig/src/main/resources/datafu/count_macros.pig x: 3 contributors (all time) y: 38 lines of code datafu-pig/src/main/resources/datafu/left_outer_join.pig x: 2 contributors (all time) y: 37 lines of code datafu-pig/src/main/resources/datafu/dedup.pig x: 1 contributors (all time) y: 34 lines of code datafu-pig/src/main/java/datafu/pig/stats/HyperLogLogPlusPlus.java x: 5 contributors (all time) y: 152 lines of code buildSrc/src/main/groovy/datafu/autojar/GradleAutojarPlugin.groovy x: 1 contributors (all time) y: 14 lines of code buildSrc/src/main/groovy/datafu/autojar/task/ExtractAutojar.groovy x: 1 contributors (all time) y: 29 lines of code site/source/layouts/_header.erb x: 3 contributors (all time) y: 24 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroMultipleInputsUtil.java x: 3 contributors (all time) y: 78 lines of code datafu-pig/src/main/java/datafu/pig/util/ContextualEvalFunc.java x: 3 contributors (all time) y: 44 lines of code datafu-pig/src/main/java/datafu/pig/util/SimpleEvalFunc.java x: 2 contributors (all time) y: 101 lines of code datafu-pig/src/main/resources/datafu/tf_idf.pig x: 1 contributors (all time) y: 84 lines of code datafu-pig/src/main/java/datafu/pig/sessions/SessionCount.java x: 2 contributors (all time) y: 48 lines of code datafu-pig/src/main/java/datafu/pig/sessions/Sessionize.java x: 3 contributors (all time) y: 107 lines of code datafu-pig/src/main/java/datafu/pig/util/AliasableEvalFunc.java x: 5 contributors (all time) y: 160 lines of code datafu-pig/src/main/java/datafu/pig/bags/TupleFromBag.java x: 3 contributors (all time) y: 62 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractJob.java x: 3 contributors (all time) y: 257 lines of code datafu-pig/src/main/java/datafu/pig/bags/FirstTupleFromBag.java x: 2 contributors (all time) y: 50 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/TokenizeSimple.java x: 2 contributors (all time) y: 55 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/POSTag.java x: 2 contributors (all time) y: 106 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/SentenceDetect.java x: 2 contributors (all time) y: 74 lines of code site/lib/pig.rb x: 2 contributors (all time) y: 54 lines of code datafu-hourglass/find_dupes.rb x: 1 contributors (all time) y: 9 lines of code datafu-hourglass/src/main/java/datafu/hourglass/fs/PathUtils.java x: 3 contributors (all time) y: 192 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/DistributedCacheHelper.java x: 3 contributors (all time) y: 56 lines of code datafu-hourglass/overview.html x: 2 contributors (all time) y: 3 lines of code site/source/blog/index.html.erb x: 2 contributors (all time) y: 36 lines of code site/source/javascripts/all.js x: 2 contributors (all time) y: 1 lines of code site/source/layouts/docs.erb x: 2 contributors (all time) y: 33 lines of code site/source/stylesheets/highlight.css.erb x: 2 contributors (all time) y: 19 lines of code gradle/resources/rat-output-to-html.xsl x: 1 contributors (all time) y: 153 lines of code datafu-pig/src/main/java/datafu/pig/bags/ZipBags.java x: 1 contributors (all time) y: 58 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/L2LSH.java x: 4 contributors (all time) y: 18 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SampleByKey.java x: 4 contributors (all time) y: 58 lines of code datafu-pig/src/main/java/datafu/pig/sampling/WeightedReservoirSample.java x: 3 contributors (all time) y: 181 lines of code datafu-pig/src/main/java/datafu/pig/util/InUDF.java x: 3 contributors (all time) y: 19 lines of code datafu-hourglass/src/main/java/datafu/hourglass/fs/DateRange.java x: 2 contributors (all time) y: 20 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractNonIncrementalJob.java x: 3 contributors (all time) y: 206 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractPartitionCollapsingIncrementalJob.java x: 3 contributors (all time) y: 349 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/AbstractPartitionPreservingIncrementalJob.java x: 3 contributors (all time) y: 387 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/DateRangeConfigurable.java x: 2 contributors (all time) y: 6 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/ExecutionPlanner.java x: 3 contributors (all time) y: 234 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/FileCleaner.java x: 3 contributors (all time) y: 45 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/IncrementalJob.java x: 2 contributors (all time) y: 86 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionCollapsingExecutionPlanner.java x: 3 contributors (all time) y: 315 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionCollapsingIncrementalJob.java x: 3 contributors (all time) y: 109 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionPreservingExecutionPlanner.java x: 3 contributors (all time) y: 156 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/PartitionPreservingIncrementalJob.java x: 3 contributors (all time) y: 88 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/ReduceEstimator.java x: 2 contributors (all time) y: 130 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/TimeBasedJob.java x: 2 contributors (all time) y: 99 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/TimePartitioner.java x: 2 contributors (all time) y: 78 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/AvroKeyValueIdentityMapper.java x: 2 contributors (all time) y: 17 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/CollapsingCombiner.java x: 3 contributors (all time) y: 152 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/CollapsingMapper.java x: 3 contributors (all time) y: 202 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/CollapsingReducer.java x: 3 contributors (all time) y: 222 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/DelegatingCombiner.java x: 2 contributors (all time) y: 32 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/ObjectReducer.java x: 2 contributors (all time) y: 8 lines of code datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/PartitioningMapper.java x: 3 contributors (all time) y: 106 lines of code datafu-hourglass/src/main/java/datafu/hourglass/model/KeyValueCollector.java x: 3 contributors (all time) y: 6 lines of code datafu-hourglass/src/main/java/datafu/hourglass/schemas/PartitionCollapsingSchemas.java x: 2 contributors (all time) y: 160 lines of code datafu-hourglass/src/main/java/datafu/hourglass/schemas/PartitionPreservingSchemas.java x: 2 contributors (all time) y: 114 lines of code datafu-hourglass/src/main/java/datafu/hourglass/schemas/TaskSchemas.java x: 2 contributors (all time) y: 63 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagConcat.java x: 2 contributors (all time) y: 94 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagGroup.java x: 5 contributors (all time) y: 117 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagJoin.java x: 3 contributors (all time) y: 219 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/LSH.java x: 3 contributors (all time) y: 16 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/LSHCreator.java x: 3 contributors (all time) y: 61 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/Cosine.java x: 3 contributors (all time) y: 14 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/MetricUDF.java x: 3 contributors (all time) y: 95 lines of code datafu-pig/src/main/java/datafu/pig/hash/lsh/util/DataTypeUtil.java x: 3 contributors (all time) y: 126 lines of code datafu-pig/src/main/java/datafu/pig/sampling/ReservoirSample.java x: 2 contributors (all time) y: 230 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SimpleRandomSample.java x: 3 contributors (all time) y: 319 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SimpleRandomSampleWithReplacementElect.java x: 2 contributors (all time) y: 143 lines of code datafu-pig/src/main/java/datafu/pig/sampling/SimpleRandomSampleWithReplacementVote.java x: 3 contributors (all time) y: 121 lines of code datafu-pig/src/main/java/datafu/pig/sets/SetDifference.java x: 3 contributors (all time) y: 140 lines of code datafu-pig/src/main/java/datafu/pig/util/FieldNotFound.java x: 2 contributors (all time) y: 12 lines of code datafu-pig/src/main/java/datafu/pig/util/SelectStringFieldByName.java x: 1 contributors (all time) y: 32 lines of code datafu-pig/src/main/java/datafu/pig/urls/URLInfo.java x: 1 contributors (all time) y: 102 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/StagedOutputJob.java x: 2 contributors (all time) y: 492 lines of code datafu-pig/src/main/java/datafu/pig/bags/DistinctBy.java x: 2 contributors (all time) y: 98 lines of code datafu-pig/src/main/java/datafu/pig/bags/Enumerate.java x: 2 contributors (all time) y: 88 lines of code datafu-pig/src/main/java/datafu/pig/bags/UnorderedPairs.java x: 2 contributors (all time) y: 80 lines of code datafu-pig/src/main/java/datafu/pig/geo/HaversineDistInMiles.java x: 2 contributors (all time) y: 25 lines of code datafu-pig/src/main/java/datafu/pig/linkanalysis/PageRank.java x: 2 contributors (all time) y: 312 lines of code datafu-pig/src/main/java/datafu/pig/linkanalysis/PageRankImpl.java x: 2 contributors (all time) y: 391 lines of code datafu-pig/src/main/java/datafu/pig/stats/Quantile.java x: 2 contributors (all time) y: 103 lines of code datafu-pig/src/main/java/datafu/pig/stats/QuantileUtil.java x: 2 contributors (all time) y: 44 lines of code datafu-pig/src/main/java/datafu/pig/stats/StreamingQuantile.java x: 2 contributors (all time) y: 276 lines of code datafu-pig/src/main/java/datafu/pig/stats/VAR.java x: 2 contributors (all time) y: 300 lines of code datafu-pig/src/main/java/datafu/pig/stats/entropy/CondEntropy.java x: 2 contributors (all time) y: 117 lines of code datafu-pig/src/main/java/datafu/pig/util/AssertUDF.java x: 2 contributors (all time) y: 23 lines of code datafu-pig/src/main/java/datafu/pig/util/Coalesce.java x: 2 contributors (all time) y: 128 lines of code datafu-pig/src/main/java/datafu/pig/util/DataFuException.java x: 2 contributors (all time) y: 68 lines of code datafu-pig/src/main/java/datafu/pig/util/TransposeTupleToBag.java x: 2 contributors (all time) y: 61 lines of code datafu-pig/src/main/java/datafu/pig/util/Base64Decode.java x: 2 contributors (all time) y: 14 lines of code datafu-pig/src/main/java/datafu/pig/hash/SHA.java x: 2 contributors (all time) y: 28 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroKeyValueWithMetadataOutputFormat.java x: 1 contributors (all time) y: 21 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroKeyValueWithMetadataRecordWriter.java x: 1 contributors (all time) y: 55 lines of code datafu-hourglass/src/main/java/datafu/hourglass/avro/AvroMultipleInputsKeyInputFormat.java x: 1 contributors (all time) y: 24 lines of code datafu-hourglass/src/main/java/datafu/hourglass/jobs/MaxInputDataExceededException.java x: 1 contributors (all time) y: 11 lines of code datafu-pig/src/main/java/datafu/pig/bags/AppendToBag.java x: 1 contributors (all time) y: 27 lines of code datafu-pig/src/main/java/datafu/pig/bags/BagSplit.java x: 1 contributors (all time) y: 94 lines of code datafu-pig/src/main/java/datafu/pig/bags/EmptyBagToNull.java x: 1 contributors (all time) y: 35 lines of code datafu-pig/src/main/java/datafu/pig/linkanalysis/ProgressIndicator.java x: 1 contributors (all time) y: 5 lines of code datafu-pig/src/main/java/datafu/pig/sampling/ScoredTuple.java x: 1 contributors (all time) y: 71 lines of code datafu-pig/src/main/java/datafu/pig/sampling/WeightedSample.java x: 1 contributors (all time) y: 104 lines of code datafu-pig/src/main/java/datafu/pig/sets/SetIntersect.java x: 1 contributors (all time) y: 75 lines of code datafu-pig/src/main/java/datafu/pig/stats/DoubleVAR.java x: 1 contributors (all time) y: 288 lines of code datafu-pig/src/main/java/datafu/pig/stats/FloatVAR.java x: 1 contributors (all time) y: 289 lines of code datafu-pig/src/main/java/datafu/pig/stats/MarkovPairs.java x: 1 contributors (all time) y: 92 lines of code datafu-pig/src/main/java/datafu/pig/stats/entropy/ChaoShenEntropyEstimator.java x: 1 contributors (all time) y: 134 lines of code datafu-pig/src/main/java/datafu/pig/text/opennlp/CachedFile.java x: 1 contributors (all time) y: 18 lines of code site/source/stylesheets/bootstrap-theme.css x: 1 contributors (all time) y: 340 lines of code
492.0
lines of code
  min: 1.0
  average: 76.26
  25th percentile: 16.5
  median: 42.0
  75th percentile: 102.5
  max: 492.0
0 9.0
contributors (all time)
min: 1.0 | average: 2.18 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 3.0 | max: 9.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 2 points

datafu-spark/src/main/resources/pyspark_utils/df_utils.py x: 1 commits (90d) y: 62 lines of code datafu-spark/src/main/scala/datafu/spark/SparkDFUtils.scala x: 1 commits (90d) y: 337 lines of code
337.0
lines of code
  min: 62.0
  average: 199.5
  25th percentile: 62.0
  median: 199.5
  75th percentile: 337.0
  max: 337.0
0 1.0
commits (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (90 days): 2 points

datafu-spark/src/main/resources/pyspark_utils/df_utils.py x: 1 contributors (90d) y: 62 lines of code datafu-spark/src/main/scala/datafu/spark/SparkDFUtils.scala x: 1 contributors (90d) y: 337 lines of code
337.0
lines of code
  min: 62.0
  average: 199.5
  25th percentile: 62.0
  median: 199.5
  75th percentile: 337.0
  max: 337.0
0 1.0
contributors (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0