apache / datafu
File Age & Freshness

File age measurements show the distribution of file ages (days since the first commit) and the file freshness (days since the latest commit).

Summary
File Change History Overall
File Age Distribution Overall
Days since first update
  • There are 225 files with 17,139 lines of code in files.
    • 225 files that are 366+ days old (17,139 lines of code)
    • 0 files that are 181-365 days old (0 lines of code)
    • 0 files that are 91-180 days old (0 lines of code)
    • 0 files that are 31-90 days old (0 lines of code)
    • 0 files that are 1-30 days old (0 lines of code)
100% | 0% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30

explore: grouped by folders | grouped by age
File Freshness Distribution Overall
Days since last update
  • There are 225 files with 17,139 lines of code in files.
    • 216 files have been last changed 366+ days ago (16,268 lines of code)
    • 9 files have been last changed 181-365 days ago (871 lines of code)
    • 0 files have been last changed 91-180 days ago (0 lines of code)
    • 0 files have been last changed 31-90 days ago (0 lines of code)
    • 0 files have been last changed 1-30 days ago (0 lines of code)
94% | 5% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30

explore: grouped by folders | grouped by freshness
File Change History per File Extension
java, erb, markdown, gradle, scala, pig, md, py, gitignore, css, js, rb, sh, groovy, yml, svg, properties, xsl, builder, json, html, less, txt, rdf
File Age Distribution per Extension
Days since first update
366+
181-365
91-180
31-90
1-30
java100% | 0% | 0% | 0% | 0%
scala100% | 0% | 0% | 0% | 0%
css100% | 0% | 0% | 0% | 0%
erb100% | 0% | 0% | 0% | 0%
pig100% | 0% | 0% | 0% | 0%
groovy100% | 0% | 0% | 0% | 0%
xsl100% | 0% | 0% | 0% | 0%
rb100% | 0% | 0% | 0% | 0%
py100% | 0% | 0% | 0% | 0%
less100% | 0% | 0% | 0% | 0%
rdf100% | 0% | 0% | 0% | 0%
builder100% | 0% | 0% | 0% | 0%
html100% | 0% | 0% | 0% | 0%
js100% | 0% | 0% | 0% | 0%
File Freshness Distribution per Extension
Days since last update
366+
181-365
91-180
31-90
1-30
java100% | 0% | 0% | 0% | 0%
css100% | 0% | 0% | 0% | 0%
scala27% | 72% | 0% | 0% | 0%
pig100% | 0% | 0% | 0% | 0%
erb75% | 24% | 0% | 0% | 0%
groovy100% | 0% | 0% | 0% | 0%
xsl100% | 0% | 0% | 0% | 0%
rb100% | 0% | 0% | 0% | 0%
less100% | 0% | 0% | 0% | 0%
rdf100% | 0% | 0% | 0% | 0%
builder100% | 0% | 0% | 0% | 0%
py4% | 95% | 0% | 0% | 0%
html100% | 0% | 0% | 0% | 0%
js100% | 0% | 0% | 0% | 0%
File Change History per Logical Decomposition
primary
primary (file age distribution)
Days since first update
366+
181-365
91-180
31-90
1-30
datafu-pig100% | 0% | 0% | 0% | 0%
datafu-hourglass100% | 0% | 0% | 0% | 0%
datafu-spark100% | 0% | 0% | 0% | 0%
site100% | 0% | 0% | 0% | 0%
buildSrc100% | 0% | 0% | 0% | 0%
gradle100% | 0% | 0% | 0% | 0%
build-plugin100% | 0% | 0% | 0% | 0%
ROOT100% | 0% | 0% | 0% | 0%
primary (file freshness distribution)
Days since last update
366+
181-365
91-180
31-90
1-30
datafu-pig100% | 0% | 0% | 0% | 0%
datafu-hourglass100% | 0% | 0% | 0% | 0%
site90% | 9% | 0% | 0% | 0%
datafu-spark25% | 74% | 0% | 0% | 0%
buildSrc100% | 0% | 0% | 0% | 0%
gradle100% | 0% | 0% | 0% | 0%
build-plugin100% | 0% | 0% | 0% | 0%
ROOT100% | 0% | 0% | 0% | 0%
Oldest Files (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
bootstrap-theme.css
in site/source/stylesheets
340 - 2014-01-23 2014-01-23 1 1 mhayes@linkedin.com mhayes@linkedin.com
_docs_nav.erb
in site/source/layouts
55 - 2014-01-23 2023-01-24 15 6 mhayes@linkedin.com eyal@apache.org
pig.rb
in site/lib
54 1 2014-01-23 2015-10-16 3 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
all.less
in site/source/stylesheets
53 - 2014-01-23 2019-01-29 5 4 mhayes@linkedin.com eyal@apache.org
layout.erb
in site/source/layouts
51 - 2014-01-23 2015-12-10 3 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
index.markdown.erb
in site/source
49 - 2014-01-23 2019-10-10 10 5 mhayes@linkedin.com eyal@apache.org
blog.erb
in site/source/layouts
41 - 2014-01-23 2018-07-05 5 3 mhayes@linkedin.com mhayes@apache.org
config.rb
in site
37 3 2014-01-23 2021-11-06 12 5 mhayes@linkedin.com eyal@apache.org
index.html.erb
in site/source/blog
36 - 2014-01-23 2014-11-25 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
docs.erb
in site/source/layouts
33 - 2014-01-23 2014-11-25 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
_footer.erb
in site/source/layouts
27 - 2014-01-23 2023-01-24 15 6 mhayes@linkedin.com eyal@apache.org
_header.erb
in site/source/layouts
24 - 2014-01-23 2018-03-17 6 3 mhayes@linkedin.com mhayes@apache.org
highlight.css.erb
in site/source/stylesheets
19 - 2014-01-23 2014-11-25 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
builder
sitemap.xml.builder
in site/source
14 - 2014-01-23 2019-06-02 5 4 mhayes@linkedin.com eyal@apache.org
all.js
in site/source/javascripts
1 - 2014-01-23 2014-11-25 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
PageRankImpl.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
391 33 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
SimpleRandomSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
319 21 2014-03-03 2014-11-24 3 3 mhayes@linkedin.com jghoman@gmail.com
PageRank.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
312 8 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
VAR.java
in datafu-pig/src/main/java/datafu/pig/stats
300 16 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
FloatVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
289 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
LongVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
DoubleVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
IntVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
StreamingQuantile.java
in datafu-pig/src/main/java/datafu/pig/stats
276 16 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
ReservoirSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
230 23 2014-03-03 2014-11-24 2 2 mhayes@linkedin.com jghoman@gmail.com
EmpiricalCountEntropy.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
229 20 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
WeightedReservoirSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
181 13 2014-03-03 2014-11-24 3 3 mhayes@linkedin.com jghoman@gmail.com
AliasableEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
160 23 2014-03-03 2016-10-26 6 5 mhayes@linkedin.com eyal@apache.org
HyperLogLogPlusPlus.java
in datafu-pig/src/main/java/datafu/pig/stats
152 13 2014-03-03 2018-07-10 5 5 mhayes@linkedin.com mhayes@apache.org
SimpleRandomSampleWithReplacementElect.java
in datafu-pig/src/main/java/datafu/pig/sampling
143 10 2014-03-03 2014-11-24 2 2 mhayes@linkedin.com jghoman@gmail.com
SetDifference.java
in datafu-pig/src/main/java/datafu/pig/sets
140 8 2014-03-03 2014-11-24 3 3 mhayes@linkedin.com jghoman@gmail.com
ChaoShenEntropyEstimator.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
134 11 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
Coalesce.java
in datafu-pig/src/main/java/datafu/pig/util
128 4 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
SimpleRandomSampleWithReplacementVote.java
in datafu-pig/src/main/java/datafu/pig/sampling
121 2 2014-03-03 2014-11-24 3 3 mhayes@linkedin.com jghoman@gmail.com
BagGroup.java
in datafu-pig/src/main/java/datafu/pig/bags
117 4 2014-03-03 2014-11-24 5 5 mhayes@linkedin.com jghoman@gmail.com
CondEntropy.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
117 7 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
Sessionize.java
in datafu-pig/src/main/java/datafu/pig/sessions
107 5 2014-03-03 2017-06-23 3 3 mhayes@linkedin.com jtolar@yahoo-inc.com
POSTag.java
in datafu-pig/src/main/java/datafu/pig/text/opennlp
106 4 2014-03-03 2015-11-06 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
WeightedSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
104 5 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
Quantile.java
in datafu-pig/src/main/java/datafu/pig/stats
103 5 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
CountEach.java
in datafu-pig/src/main/java/datafu/pig/bags
102 6 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
SimpleEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
101 5 2014-03-03 2017-11-17 4 2 mhayes@linkedin.com flip@infochimps.org
Entropy.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
99 7 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
DistinctBy.java
in datafu-pig/src/main/java/datafu/pig/bags
98 6 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
BagConcat.java
in datafu-pig/src/main/java/datafu/pig/bags
94 4 2014-03-03 2014-11-24 2 2 mhayes@linkedin.com jghoman@gmail.com
BagSplit.java
in datafu-pig/src/main/java/datafu/pig/bags
94 4 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
MarkovPairs.java
in datafu-pig/src/main/java/datafu/pig/stats
92 5 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
Enumerate.java
in datafu-pig/src/main/java/datafu/pig/bags
88 6 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
UnorderedPairs.java
in datafu-pig/src/main/java/datafu/pig/bags
80 2 2014-03-03 2014-08-02 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
SetIntersect.java
in datafu-pig/src/main/java/datafu/pig/sets
75 4 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
Files Not Recently Changed (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
bootstrap-theme.css
in site/source/stylesheets
340 - 2014-01-23 2014-01-23 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/stats
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/bags
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/random
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/sets
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/urls
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/sessions
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/sampling
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/geo
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/hash
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/util
1 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
ProgressIndicator.java
in datafu-pig/src/main/java/datafu/pig/linkanalysis
5 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
In.java
in datafu-pig/src/main/java/datafu/pig/util
5 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
Assert.java
in datafu-pig/src/main/java/datafu/pig/util
5 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
IntToBool.java
in datafu-pig/src/main/java/datafu/pig/util
8 1 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
BoolToInt.java
in datafu-pig/src/main/java/datafu/pig/util
8 1 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
Multiline.java
in build-plugin/src/main/java/org/adrianwalker/multilinestring
9 - 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
CachedFile.java
in datafu-pig/src/main/java/datafu/pig/text/opennlp
18 1 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
EntropyUtil.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
20 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
RandomUUID.java
in datafu-pig/src/main/java/datafu/pig/random
20 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
EmpiricalEntropyEstimator.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
25 3 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
Reservoir.java
in datafu-pig/src/main/java/datafu/pig/sampling
25 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
NullToEmptyBag.java
in datafu-pig/src/main/java/datafu/pig/bags
26 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
AppendToBag.java
in datafu-pig/src/main/java/datafu/pig/bags
27 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
EntropyEstimator.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
29 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
SetUnion.java
in datafu-pig/src/main/java/datafu/pig/sets
33 1 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
EmptyBagToNull.java
in datafu-pig/src/main/java/datafu/pig/bags
35 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
EmptyBagToNullFields.java
in datafu-pig/src/main/java/datafu/pig/bags
58 2 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
ScoredTuple.java
in datafu-pig/src/main/java/datafu/pig/sampling
71 11 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
SetIntersect.java
in datafu-pig/src/main/java/datafu/pig/sets
75 4 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
MarkovPairs.java
in datafu-pig/src/main/java/datafu/pig/stats
92 5 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
BagSplit.java
in datafu-pig/src/main/java/datafu/pig/bags
94 4 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
WeightedSample.java
in datafu-pig/src/main/java/datafu/pig/sampling
104 5 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
ChaoShenEntropyEstimator.java
in datafu-pig/src/main/java/datafu/pig/stats/entropy
134 11 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
IntVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
DoubleVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
LongVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
288 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
FloatVAR.java
in datafu-pig/src/main/java/datafu/pig/stats
289 15 2014-03-03 2014-03-03 1 1 mhayes@linkedin.com mhayes@linkedin.com
package-info.java
in datafu-pig/src/main/java/datafu/pig/hash/lsh/util
1 - 2014-05-13 2014-05-13 1 1 cestella@gmail.com cestella@gmail.com
package-info.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
1 - 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
package-info.java
in datafu-hourglass/src/main/java/datafu/hourglass/fs
1 - 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
package-info.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
1 - 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
package-info.java
in datafu-hourglass/src/main/java/datafu/hourglass/model
1 - 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
package-info.java
in datafu-hourglass/src/main/java/datafu/hourglass/avro
1 - 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
package-info.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
1 - 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
MaxInputDataExceededException.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
11 2 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
AvroKeyValueWithMetadataOutputFormat.java
in datafu-hourglass/src/main/java/datafu/hourglass/avro
21 1 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
AvroMultipleInputsKeyInputFormat.java
in datafu-hourglass/src/main/java/datafu/hourglass/avro
24 - 2014-05-18 2014-05-18 1 1 jarcec@apache.org jarcec@apache.org
SHA.java
in datafu-pig/src/main/java/datafu/pig/hash
28 3 2014-03-03 2014-05-18 2 2 mhayes@linkedin.com flip@infochimps.org
Most Recently Created Files (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
SparkDFUtils.scala
in datafu-spark/src/main/scala/datafu/spark
320 23 2019-07-17 2022-12-14 12 8 oraviv@paypal.com saharon@paypal.com
SparkUDAFs.scala
in datafu-spark/src/main/scala/datafu/spark
176 18 2019-07-17 2019-07-17 1 1 oraviv@paypal.com oraviv@paypal.com
ScalaPythonBridge.scala
in datafu-spark/src/main/scala/datafu/spark
103 8 2019-07-17 2023-01-26 4 4 oraviv@paypal.com arpitbhardwaj09@gmail.com
PythonPathsManager.scala
in datafu-spark/src/main/scala/datafu/spark
102 1 2019-07-17 2022-10-12 2 3 oraviv@paypal.com arpitbhardwaj09@gmail.com
SparkPythonRunner.scala
in datafu-spark/src/main/scala/spark/utils/overwrites
101 4 2019-07-17 2023-01-26 5 4 oraviv@paypal.com arpitbhardwaj09@gmail.com
SparkOverwriteUDAFs.scala
in datafu-spark/src/main/scala/spark/utils/overwrites
89 5 2019-07-17 2022-08-04 3 3 oraviv@paypal.com brahamim@paypal.com
DataFrameOps.scala
in datafu-spark/src/main/scala/datafu/spark
73 1 2019-07-17 2022-12-14 7 5 oraviv@paypal.com saharon@paypal.com
df_utils.py
in datafu-spark/src/main/resources/pyspark_utils
53 14 2019-07-17 2022-12-14 7 4 oraviv@paypal.com saharon@paypal.com
bridge_utils.py
in datafu-spark/src/main/resources/pyspark_utils
37 3 2019-07-17 2023-01-26 2 2 oraviv@paypal.com arpitbhardwaj09@gmail.com
init_spark_context.py
in datafu-spark/src/main/resources/pyspark_utils
3 - 2019-07-17 2019-07-17 1 1 oraviv@paypal.com oraviv@paypal.com
__init__.py
in datafu-spark/src/main/resources/pyspark_utils
1 - 2019-07-17 2019-07-17 1 1 oraviv@paypal.com oraviv@paypal.com
pig
left_outer_join.pig
in datafu-pig/src/main/resources/datafu
37 - 2018-11-28 2019-01-07 2 2 eyal@apache.org mhayes@apache.org
ExtremalTupleByNthField.java
in datafu-pig/src/main/java/datafu/org/apache/pig/piggybank/evaluation
177 19 2018-10-11 2020-03-31 2 2 eyal@apache.org mhayes@apache.org
pig
dedup.pig
in datafu-pig/src/main/resources/datafu
34 - 2018-10-11 2018-10-11 1 1 eyal@apache.org eyal@apache.org
pig
sample_by_keys.pig
in datafu-pig/src/main/resources/datafu
38 - 2018-07-10 2019-01-07 3 3 eallweil@paypal.com mhayes@apache.org
Autojar.groovy
in buildSrc/src/main/groovy/datafu/autojar/task
125 10 2018-07-05 2021-01-11 2 2 mhayes@apache.org venkatasubrahmanian.narayan...
ExtractAutojar.groovy
in buildSrc/src/main/groovy/datafu/autojar/task
29 1 2018-07-05 2018-07-05 1 1 mhayes@apache.org mhayes@apache.org
GradleAutojarPlugin.groovy
in buildSrc/src/main/groovy/datafu/autojar
14 1 2018-07-05 2018-07-05 1 1 mhayes@apache.org mhayes@apache.org
Hasher.java
in datafu-pig/src/main/java/datafu/pig/hash
76 8 2017-12-05 2020-03-31 2 2 flip@infochimps.org mhayes@apache.org
HasherRand.java
in datafu-pig/src/main/java/datafu/pig/hash
51 5 2017-12-05 2020-03-31 4 2 flip@infochimps.org mhayes@apache.org
TupleDiff.java
in datafu-pig/src/main/java/datafu/pig/util
128 8 2017-09-11 2017-09-11 1 1 eallweil@paypal.com eallweil@paypal.com
pig
diff_macros.pig
in datafu-pig/src/main/resources/datafu
34 - 2017-09-11 2019-01-07 3 3 eallweil@paypal.com mhayes@apache.org
pig
tf_idf.pig
in datafu-pig/src/main/resources/datafu
84 - 2017-08-06 2017-08-06 1 1 russell.jurney@gmail.com russell.jurney@gmail.com
pig
count_macros.pig
in datafu-pig/src/main/resources/datafu
38 - 2017-08-03 2019-01-07 3 3 eallweil@paypal.com mhayes@apache.org
CountDistinctUpTo.java
in datafu-pig/src/main/java/datafu/pig/bags
156 19 2016-06-08 2020-03-31 2 2 eallweil@paypal.com mhayes@apache.org
TupleFromBag.java
in datafu-pig/src/main/java/datafu/pig/bags
62 4 2015-08-03 2016-03-03 2 3 matthew.terence.hayes@gmail... eallweil@paypal.com
find_dupes.rb
in datafu-hourglass
9 - 2015-05-23 2015-05-27 2 1 matthew.terence.hayes@gmail... matthew.terence.hayes@gmail...
xsl
rat-output-to-html.xsl
in gradle/resources
153 - 2014-11-23 2014-11-23 1 1 matthew.terence.hayes@gmail... matthew.terence.hayes@gmail...
SelectStringFieldByName.java
in datafu-pig/src/main/java/datafu/pig/util
32 1 2014-11-03 2014-11-03 1 1 russell.jurney@gmail.com russell.jurney@gmail.com
BagJoin.java
in datafu-pig/src/main/java/datafu/pig/bags
219 7 2014-10-20 2014-11-24 3 3 jasonr@netflix.com jghoman@gmail.com
ZipBags.java
in datafu-pig/src/main/java/datafu/pig/bags
58 2 2014-09-15 2014-09-15 1 1 ajoseph4@binghamton.edu ajoseph4@binghamton.edu
URLInfo.java
in datafu-pig/src/main/java/datafu/pig/urls
102 8 2014-08-10 2014-08-10 1 1 jbanerjee1@gmail.com jbanerjee1@gmail.com
35 - 2014-08-04 2014-08-04 1 1 mhayes@linkedin.com mhayes@linkedin.com
Base64Decode.java
in datafu-pig/src/main/java/datafu/pig/util
14 1 2014-07-01 2014-07-07 2 2 russell.jurney@gmail.com matthew.terence.hayes@gmail...
Base64Encode.java
in datafu-pig/src/main/java/datafu/pig/util
14 1 2014-07-01 2014-07-07 2 2 russell.jurney@gmail.com matthew.terence.hayes@gmail...
StagedOutputJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
492 14 2014-05-18 2014-08-02 2 2 jarcec@apache.org matthew.terence.hayes@gmail...
AbstractPartitionPreservingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
387 20 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
AbstractPartitionCollapsingIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
349 24 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
PartitionCollapsingExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
315 21 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
AbstractJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
257 28 2014-05-18 2016-03-09 5 3 jarcec@apache.org matthew.terence.hayes@gmail...
ExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
234 25 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
CollapsingReducer.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
222 12 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
AbstractNonIncrementalJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
206 11 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
CollapsingMapper.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
202 17 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
PathUtils.java
in datafu-hourglass/src/main/java/datafu/hourglass/fs
192 11 2014-05-18 2015-05-23 5 3 jarcec@apache.org matthew.terence.hayes@gmail...
PartitionCollapsingSchemas.java
in datafu-hourglass/src/main/java/datafu/hourglass/schemas
160 10 2014-05-18 2014-11-24 2 2 jarcec@apache.org jghoman@gmail.com
PartitionPreservingExecutionPlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
156 13 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
CollapsingCombiner.java
in datafu-hourglass/src/main/java/datafu/hourglass/mapreduce
152 9 2014-05-18 2014-11-24 3 3 jarcec@apache.org jghoman@gmail.com
ReduceEstimator.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
130 7 2014-05-18 2014-11-24 2 2 jarcec@apache.org jghoman@gmail.com
DateRangePlanner.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
124 1 2014-05-18 2014-11-24 2 2 jarcec@apache.org jghoman@gmail.com
Most Recently Changed Files (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
ScalaPythonBridge.scala
in datafu-spark/src/main/scala/datafu/spark
103 8 2019-07-17 2023-01-26 4 4 oraviv@paypal.com arpitbhardwaj09@gmail.com
SparkPythonRunner.scala
in datafu-spark/src/main/scala/spark/utils/overwrites
101 4 2019-07-17 2023-01-26 5 4 oraviv@paypal.com arpitbhardwaj09@gmail.com
bridge_utils.py
in datafu-spark/src/main/resources/pyspark_utils
37 3 2019-07-17 2023-01-26 2 2 oraviv@paypal.com arpitbhardwaj09@gmail.com
_docs_nav.erb
in site/source/layouts
55 - 2014-01-23 2023-01-24 15 6 mhayes@linkedin.com eyal@apache.org
_footer.erb
in site/source/layouts
27 - 2014-01-23 2023-01-24 15 6 mhayes@linkedin.com eyal@apache.org
SparkDFUtils.scala
in datafu-spark/src/main/scala/datafu/spark
320 23 2019-07-17 2022-12-14 12 8 oraviv@paypal.com saharon@paypal.com
DataFrameOps.scala
in datafu-spark/src/main/scala/datafu/spark
73 1 2019-07-17 2022-12-14 7 5 oraviv@paypal.com saharon@paypal.com
df_utils.py
in datafu-spark/src/main/resources/pyspark_utils
53 14 2019-07-17 2022-12-14 7 4 oraviv@paypal.com saharon@paypal.com
PythonPathsManager.scala
in datafu-spark/src/main/scala/datafu/spark
102 1 2019-07-17 2022-10-12 2 3 oraviv@paypal.com arpitbhardwaj09@gmail.com
SparkOverwriteUDAFs.scala
in datafu-spark/src/main/scala/spark/utils/overwrites
89 5 2019-07-17 2022-08-04 3 3 oraviv@paypal.com brahamim@paypal.com
config.rb
in site
37 3 2014-01-23 2021-11-06 12 5 mhayes@linkedin.com eyal@apache.org
Autojar.groovy
in buildSrc/src/main/groovy/datafu/autojar/task
125 10 2018-07-05 2021-01-11 2 2 mhayes@apache.org venkatasubrahmanian.narayan...
ExtremalTupleByNthField.java
in datafu-pig/src/main/java/datafu/org/apache/pig/piggybank/evaluation
177 19 2018-10-11 2020-03-31 2 2 eyal@apache.org mhayes@apache.org
CountDistinctUpTo.java
in datafu-pig/src/main/java/datafu/pig/bags
156 19 2016-06-08 2020-03-31 2 2 eallweil@paypal.com mhayes@apache.org
Hasher.java
in datafu-pig/src/main/java/datafu/pig/hash
76 8 2017-12-05 2020-03-31 2 2 flip@infochimps.org mhayes@apache.org
HasherRand.java
in datafu-pig/src/main/java/datafu/pig/hash
51 5 2017-12-05 2020-03-31 4 2 flip@infochimps.org mhayes@apache.org
EcjMultilineProcessor.java
in build-plugin/src/main/java/org/adrianwalker/multilinestring
41 2 2014-03-03 2020-02-06 2 2 mhayes@linkedin.com mhayes@apache.org
JavacMultilineProcessor.java
in build-plugin/src/main/java/org/adrianwalker/multilinestring
39 2 2014-03-03 2020-02-06 2 2 mhayes@linkedin.com mhayes@apache.org
MultilineProcessor.java
in build-plugin/src/main/java/org/adrianwalker/multilinestring
33 2 2014-03-03 2020-02-06 7 3 mhayes@linkedin.com mhayes@apache.org
index.markdown.erb
in site/source
49 - 2014-01-23 2019-10-10 10 5 mhayes@linkedin.com eyal@apache.org
SparkUDAFs.scala
in datafu-spark/src/main/scala/datafu/spark
176 18 2019-07-17 2019-07-17 1 1 oraviv@paypal.com oraviv@paypal.com
init_spark_context.py
in datafu-spark/src/main/resources/pyspark_utils
3 - 2019-07-17 2019-07-17 1 1 oraviv@paypal.com oraviv@paypal.com
__init__.py
in datafu-spark/src/main/resources/pyspark_utils
1 - 2019-07-17 2019-07-17 1 1 oraviv@paypal.com oraviv@paypal.com
builder
sitemap.xml.builder
in site/source
14 - 2014-01-23 2019-06-02 5 4 mhayes@linkedin.com eyal@apache.org
all.less
in site/source/stylesheets
53 - 2014-01-23 2019-01-29 5 4 mhayes@linkedin.com eyal@apache.org
pig
count_macros.pig
in datafu-pig/src/main/resources/datafu
38 - 2017-08-03 2019-01-07 3 3 eallweil@paypal.com mhayes@apache.org
pig
sample_by_keys.pig
in datafu-pig/src/main/resources/datafu
38 - 2018-07-10 2019-01-07 3 3 eallweil@paypal.com mhayes@apache.org
pig
left_outer_join.pig
in datafu-pig/src/main/resources/datafu
37 - 2018-11-28 2019-01-07 2 2 eyal@apache.org mhayes@apache.org
pig
diff_macros.pig
in datafu-pig/src/main/resources/datafu
34 - 2017-09-11 2019-01-07 3 3 eallweil@paypal.com mhayes@apache.org
pig
dedup.pig
in datafu-pig/src/main/resources/datafu
34 - 2018-10-11 2018-10-11 1 1 eyal@apache.org eyal@apache.org
HyperLogLogPlusPlus.java
in datafu-pig/src/main/java/datafu/pig/stats
152 13 2014-03-03 2018-07-10 5 5 mhayes@linkedin.com mhayes@apache.org
blog.erb
in site/source/layouts
41 - 2014-01-23 2018-07-05 5 3 mhayes@linkedin.com mhayes@apache.org
ExtractAutojar.groovy
in buildSrc/src/main/groovy/datafu/autojar/task
29 1 2018-07-05 2018-07-05 1 1 mhayes@apache.org mhayes@apache.org
GradleAutojarPlugin.groovy
in buildSrc/src/main/groovy/datafu/autojar
14 1 2018-07-05 2018-07-05 1 1 mhayes@apache.org mhayes@apache.org
_header.erb
in site/source/layouts
24 - 2014-01-23 2018-03-17 6 3 mhayes@linkedin.com mhayes@apache.org
AvroMultipleInputsUtil.java
in datafu-hourglass/src/main/java/datafu/hourglass/avro
78 3 2014-05-18 2018-01-29 4 3 jarcec@apache.org mhayes@apache.org
SimpleEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
101 5 2014-03-03 2017-11-17 4 2 mhayes@linkedin.com flip@infochimps.org
ContextualEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
44 7 2014-03-03 2017-11-17 6 3 mhayes@linkedin.com flip@infochimps.org
TupleDiff.java
in datafu-pig/src/main/java/datafu/pig/util
128 8 2017-09-11 2017-09-11 1 1 eallweil@paypal.com eallweil@paypal.com
pig
tf_idf.pig
in datafu-pig/src/main/resources/datafu
84 - 2017-08-06 2017-08-06 1 1 russell.jurney@gmail.com russell.jurney@gmail.com
Sessionize.java
in datafu-pig/src/main/java/datafu/pig/sessions
107 5 2014-03-03 2017-06-23 3 3 mhayes@linkedin.com jtolar@yahoo-inc.com
SessionCount.java
in datafu-pig/src/main/java/datafu/pig/sessions
48 4 2014-03-03 2017-06-23 2 2 mhayes@linkedin.com jtolar@yahoo-inc.com
AliasableEvalFunc.java
in datafu-pig/src/main/java/datafu/pig/util
160 23 2014-03-03 2016-10-26 6 5 mhayes@linkedin.com eyal@apache.org
AbstractJob.java
in datafu-hourglass/src/main/java/datafu/hourglass/jobs
257 28 2014-05-18 2016-03-09 5 3 jarcec@apache.org matthew.terence.hayes@gmail...
TupleFromBag.java
in datafu-pig/src/main/java/datafu/pig/bags
62 4 2015-08-03 2016-03-03 2 3 matthew.terence.hayes@gmail... eallweil@paypal.com
FirstTupleFromBag.java
in datafu-pig/src/main/java/datafu/pig/bags
50 5 2014-03-03 2016-02-05 2 2 mhayes@linkedin.com eyal_allweil@yahoo.com
layout.erb
in site/source/layouts
51 - 2014-01-23 2015-12-10 3 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
TokenizeSimple.java
in datafu-pig/src/main/java/datafu/pig/text/opennlp
55 2 2014-03-03 2015-11-07 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
POSTag.java
in datafu-pig/src/main/java/datafu/pig/text/opennlp
106 4 2014-03-03 2015-11-06 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...
SentenceDetect.java
in datafu-pig/src/main/java/datafu/pig/text/opennlp
74 4 2014-03-03 2015-11-06 2 2 mhayes@linkedin.com matthew.terence.hayes@gmail...