apache / incubator-stormcrawler
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 6% | 23% | 39% | 30%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
java0% | 6% | 24% | 39% | 29%
flux0% | 0% | 0% | 100% | 0%
yaml0% | 0% | 0% | 0% | 100%
xml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
core0% | 6% | 24% | 39% | 29%
archetype0% | 0% | 0% | 52% | 47%
Longest Files (Top 50)
File# lines# units
FetcherBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
714 23
HttpProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol/okhttp
475 10
SimpleFetcherBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
397 7
JSoupParserBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
368 7
BasicURLNormalizer.java
in core/src/main/java/org/apache/stormcrawler/filtering/basic
297 8
SiteMapParserBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
287 7
HttpProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol/httpclient
283 8
FastURLFilter.java
in core/src/main/java/org/apache/stormcrawler/filtering/regex
245 12
AbstractIndexerBolt.java
in core/src/main/java/org/apache/stormcrawler/indexing
208 15
DelegatorProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol
192 12
RegexURLNormalizer.java
in core/src/main/java/org/apache/stormcrawler/filtering/regex
186 6
FeedParserBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
184 5
XPathFilter.java
in core/src/main/java/org/apache/stormcrawler/parse/filter
175 7
Metadata.java
in core/src/main/java/org/apache/stormcrawler
172 23
FileSpout.java
in core/src/main/java/org/apache/stormcrawler/spout
169 14
AbstractStatusUpdaterBolt.java
in core/src/main/java/org/apache/stormcrawler/persistence
166 5
AbstractQueryingSpout.java
in core/src/main/java/org/apache/stormcrawler/persistence
166 15
CharsetIdentification.java
in core/src/main/java/org/apache/stormcrawler/util
157 8
TextExtractor.java
in core/src/main/java/org/apache/stormcrawler/parse
155 7
HttpRobotRulesParser.java
in core/src/main/java/org/apache/stormcrawler/protocol
151 5
URLFilters.java
in core/src/main/java/org/apache/stormcrawler/filtering
144 7
DefaultScheduler.java
in core/src/main/java/org/apache/stormcrawler/persistence
138 6
ConfUtils.java
in core/src/main/java/org/apache/stormcrawler/util
138 15
MultiProxyManager.java
in core/src/main/java/org/apache/stormcrawler/proxy
137 9
ParseFilters.java
in core/src/main/java/org/apache/stormcrawler/parse
134 10
AbstractHttpProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol
127 9
CollectionTagger.java
in core/src/main/java/org/apache/stormcrawler/parse/filter
125 10
InitialisationUtil.java
in core/src/main/java/org/apache/stormcrawler/util
125 10
AdaptiveScheduler.java
in core/src/main/java/org/apache/stormcrawler/persistence
124 2
MemorySpout.java
in core/src/main/java/org/apache/stormcrawler/spout
122 10
flux
crawler.flux
in archetype/src/main/resources/archetype-resources
115 -
URLPartitionerBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
115 3
JSoupFilters.java
in core/src/main/java/org/apache/stormcrawler/parse
112 8
SchedulingURLBuffer.java
in core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer
112 6
RobotRulesParser.java
in core/src/main/java/org/apache/stormcrawler/protocol
111 6
FileResponse.java
in core/src/main/java/org/apache/stormcrawler/protocol/file
107 5
RegexURLFilterBase.java
in core/src/main/java/org/apache/stormcrawler/filtering/regex
107 4
Protocol.java
in core/src/main/java/org/apache/stormcrawler/protocol
105 1
SCProxy.java
in core/src/main/java/org/apache/stormcrawler/proxy
104 15
RobotsTags.java
in core/src/main/java/org/apache/stormcrawler/util
101 10
URLUtil.java
in core/src/main/java/org/apache/stormcrawler/util
100 8
RemoteDriverProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol/selenium
90 3
CookieConverter.java
in core/src/main/java/org/apache/stormcrawler/util
88 2
LDJsonParseFilter.java
in core/src/main/java/org/apache/stormcrawler/parse/filter
87 6
XPathFilter.java
in core/src/main/java/org/apache/stormcrawler/jsoup
87 6
crawler-default.yaml
in core/src/main/resources
85 -
MetadataTransfer.java
in core/src/main/java/org/apache/stormcrawler/util
84 5
HostURLFilter.java
in core/src/main/java/org/apache/stormcrawler/filtering/host
80 1
ParseResult.java
in core/src/main/java/org/apache/stormcrawler/parse
79 13
LDJsonParseFilter.java
in core/src/main/java/org/apache/stormcrawler/jsoup
79 5
Files With Most Units (Top 50)
File# lines# units
Metadata.java
in core/src/main/java/org/apache/stormcrawler
172 23
FetcherBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
714 23
AbstractIndexerBolt.java
in core/src/main/java/org/apache/stormcrawler/indexing
208 15
AbstractQueryingSpout.java
in core/src/main/java/org/apache/stormcrawler/persistence
166 15
ConfUtils.java
in core/src/main/java/org/apache/stormcrawler/util
138 15
SCProxy.java
in core/src/main/java/org/apache/stormcrawler/proxy
104 15
FileSpout.java
in core/src/main/java/org/apache/stormcrawler/spout
169 14
ParseResult.java
in core/src/main/java/org/apache/stormcrawler/parse
79 13
RobotRules.java
in core/src/main/java/org/apache/stormcrawler/protocol
60 13
DelegatorProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol
192 12
FastURLFilter.java
in core/src/main/java/org/apache/stormcrawler/filtering/regex
245 12
ParseFilters.java
in core/src/main/java/org/apache/stormcrawler/parse
134 10
CollectionTagger.java
in core/src/main/java/org/apache/stormcrawler/parse/filter
125 10
ParseData.java
in core/src/main/java/org/apache/stormcrawler/parse
45 10
InitialisationUtil.java
in core/src/main/java/org/apache/stormcrawler/util
125 10
RobotsTags.java
in core/src/main/java/org/apache/stormcrawler/util
101 10
HttpProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol/okhttp
475 10
MemorySpout.java
in core/src/main/java/org/apache/stormcrawler/spout
122 10
Outlink.java
in core/src/main/java/org/apache/stormcrawler/parse
35 9
MultiProxyManager.java
in core/src/main/java/org/apache/stormcrawler/proxy
137 9
AbstractHttpProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol
127 9
JSoupFilters.java
in core/src/main/java/org/apache/stormcrawler/parse
112 8
CharsetIdentification.java
in core/src/main/java/org/apache/stormcrawler/util
157 8
URLUtil.java
in core/src/main/java/org/apache/stormcrawler/util
100 8
HttpProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol/httpclient
283 8
BasicURLNormalizer.java
in core/src/main/java/org/apache/stormcrawler/filtering/basic
297 8
XPathFilter.java
in core/src/main/java/org/apache/stormcrawler/parse/filter
175 7
TextExtractor.java
in core/src/main/java/org/apache/stormcrawler/parse
155 7
AbstractURLBuffer.java
in core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer
64 7
JSoupParserBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
368 7
SiteMapParserBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
287 7
SimpleFetcherBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
397 7
URLFilters.java
in core/src/main/java/org/apache/stormcrawler/filtering
144 7
LDJsonParseFilter.java
in core/src/main/java/org/apache/stormcrawler/parse/filter
87 6
DocumentFragmentBuilder.java
in core/src/main/java/org/apache/stormcrawler/parse
69 6
XPathFilter.java
in core/src/main/java/org/apache/stormcrawler/jsoup
87 6
DefaultScheduler.java
in core/src/main/java/org/apache/stormcrawler/persistence
138 6
SchedulingURLBuffer.java
in core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer
112 6
FileProtocol.java
in core/src/main/java/org/apache/stormcrawler/protocol/file
32 6
RobotRulesParser.java
in core/src/main/java/org/apache/stormcrawler/protocol
111 6
RegexURLNormalizer.java
in core/src/main/java/org/apache/stormcrawler/filtering/regex
186 6
LDJsonParseFilter.java
in core/src/main/java/org/apache/stormcrawler/jsoup
79 5
PriorityURLBuffer.java
in core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer
73 5
AbstractStatusUpdaterBolt.java
in core/src/main/java/org/apache/stormcrawler/persistence
166 5
MetadataTransfer.java
in core/src/main/java/org/apache/stormcrawler/util
84 5
Configurable.java
in core/src/main/java/org/apache/stormcrawler/util
41 5
URLFilterBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
78 5
StatusEmitterBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
73 5
FeedParserBolt.java
in core/src/main/java/org/apache/stormcrawler/bolt
184 5
ProtocolFactory.java
in core/src/main/java/org/apache/stormcrawler/protocol
71 5
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 4 long lines.

File# lines# units# long lines
archetype-metadata.xml
in archetype/src/main/resources/META-INF/maven
37 - 1
InitialisationUtil.java
in core/src/main/java/org/apache/stormcrawler/util
125 10 1
SCProxy.java
in core/src/main/java/org/apache/stormcrawler/proxy
104 15 1
RobotRulesParser.java
in core/src/main/java/org/apache/stormcrawler/protocol
111 6 1
Correlations

File Size vs. Commits (all time): 110 points

core/src/main/java/org/apache/stormcrawler/filtering/sitemap/SitemapFilter.java x: 4 commits (all time) y: 49 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/FastURLFilter.java x: 5 commits (all time) y: 245 lines of code core/src/main/java/org/apache/stormcrawler/persistence/DefaultScheduler.java x: 5 commits (all time) y: 138 lines of code core/src/main/java/org/apache/stormcrawler/bolt/FetcherBolt.java x: 4 commits (all time) y: 714 lines of code core/src/main/java/org/apache/stormcrawler/bolt/JSoupParserBolt.java x: 7 commits (all time) y: 368 lines of code core/src/main/java/org/apache/stormcrawler/bolt/SimpleFetcherBolt.java x: 4 commits (all time) y: 397 lines of code core/src/main/java/org/apache/stormcrawler/filtering/basic/BasicURLNormalizer.java x: 3 commits (all time) y: 297 lines of code core/src/main/java/org/apache/stormcrawler/persistence/AbstractStatusUpdaterBolt.java x: 3 commits (all time) y: 166 lines of code core/src/main/java/org/apache/stormcrawler/protocol/ProtocolResponse.java x: 3 commits (all time) y: 37 lines of code core/src/main/java/org/apache/stormcrawler/util/CharsetIdentification.java x: 3 commits (all time) y: 157 lines of code archetype/src/main/resources/META-INF/maven/archetype-metadata.xml x: 11 commits (all time) y: 37 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/CollectionTagger.java x: 4 commits (all time) y: 125 lines of code core/src/main/java/org/apache/stormcrawler/parse/ParseFilter.java x: 4 commits (all time) y: 13 lines of code core/src/main/java/org/apache/stormcrawler/util/URLUtil.java x: 4 commits (all time) y: 100 lines of code core/src/main/java/org/apache/stormcrawler/bolt/FeedParserBolt.java x: 4 commits (all time) y: 184 lines of code core/src/main/java/org/apache/stormcrawler/bolt/SiteMapParserBolt.java x: 5 commits (all time) y: 287 lines of code core/src/main/java/org/apache/stormcrawler/parse/ParseFilters.java x: 4 commits (all time) y: 134 lines of code core/src/main/java/org/apache/stormcrawler/protocol/httpclient/HttpProtocol.java x: 4 commits (all time) y: 283 lines of code core/src/main/java/org/apache/stormcrawler/protocol/okhttp/HttpProtocol.java x: 4 commits (all time) y: 475 lines of code core/src/main/java/org/apache/stormcrawler/spout/FileSpout.java x: 5 commits (all time) y: 169 lines of code core/src/main/java/org/apache/stormcrawler/filtering/basic/BasicURLFilter.java x: 3 commits (all time) y: 58 lines of code core/src/main/java/org/apache/stormcrawler/persistence/Scheduler.java x: 3 commits (all time) y: 23 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/URLBuffer.java x: 3 commits (all time) y: 42 lines of code core/src/main/java/org/apache/stormcrawler/util/InitialisationUtil.java x: 3 commits (all time) y: 125 lines of code core/src/main/java/org/apache/stormcrawler/util/MetadataTransfer.java x: 3 commits (all time) y: 84 lines of code core/src/main/resources/crawler-default.yaml x: 98 commits (all time) y: 85 lines of code archetype/src/main/resources/archetype-resources/crawler-conf.yaml x: 60 commits (all time) y: 62 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexURLNormalizer.java x: 3 commits (all time) y: 186 lines of code core/src/main/java/org/apache/stormcrawler/parse/JSoupFilters.java x: 3 commits (all time) y: 112 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/MimeTypeNormalization.java x: 3 commits (all time) y: 36 lines of code core/src/main/java/org/apache/stormcrawler/protocol/file/FileResponse.java x: 3 commits (all time) y: 107 lines of code core/src/main/java/org/apache/stormcrawler/protocol/selenium/SeleniumProtocol.java x: 3 commits (all time) y: 68 lines of code core/src/main/java/org/apache/stormcrawler/proxy/MultiProxyManager.java x: 3 commits (all time) y: 137 lines of code core/src/main/java/org/apache/stormcrawler/util/CookieConverter.java x: 3 commits (all time) y: 88 lines of code core/src/main/java/org/apache/stormcrawler/ConfigurableTopology.java x: 2 commits (all time) y: 66 lines of code core/src/main/java/org/apache/stormcrawler/Constants.java x: 2 commits (all time) y: 19 lines of code core/src/main/java/org/apache/stormcrawler/JSONResource.java x: 2 commits (all time) y: 16 lines of code core/src/main/java/org/apache/stormcrawler/Metadata.java x: 2 commits (all time) y: 172 lines of code core/src/main/java/org/apache/stormcrawler/bolt/StatusEmitterBolt.java x: 2 commits (all time) y: 73 lines of code core/src/main/java/org/apache/stormcrawler/bolt/URLFilterBolt.java x: 2 commits (all time) y: 78 lines of code core/src/main/java/org/apache/stormcrawler/bolt/URLPartitionerBolt.java x: 2 commits (all time) y: 115 lines of code core/src/main/java/org/apache/stormcrawler/filtering/URLFilter.java x: 2 commits (all time) y: 13 lines of code core/src/main/java/org/apache/stormcrawler/filtering/URLFilters.java x: 2 commits (all time) y: 144 lines of code core/src/main/java/org/apache/stormcrawler/filtering/basic/SelfURLFilter.java x: 2 commits (all time) y: 23 lines of code core/src/main/java/org/apache/stormcrawler/filtering/depth/MaxDepthFilter.java x: 2 commits (all time) y: 49 lines of code core/src/main/java/org/apache/stormcrawler/filtering/host/HostURLFilter.java x: 2 commits (all time) y: 80 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexRule.java x: 2 commits (all time) y: 11 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexURLFilter.java x: 2 commits (all time) y: 22 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexURLFilterBase.java x: 2 commits (all time) y: 107 lines of code core/src/main/java/org/apache/stormcrawler/filtering/robots/RobotsFilter.java x: 2 commits (all time) y: 51 lines of code core/src/main/java/org/apache/stormcrawler/indexing/AbstractIndexerBolt.java x: 2 commits (all time) y: 208 lines of code core/src/main/java/org/apache/stormcrawler/indexing/DummyIndexer.java x: 2 commits (all time) y: 27 lines of code core/src/main/java/org/apache/stormcrawler/indexing/StdOutIndexer.java x: 2 commits (all time) y: 54 lines of code core/src/main/java/org/apache/stormcrawler/jsoup/LinkParseFilter.java x: 2 commits (all time) y: 74 lines of code core/src/main/java/org/apache/stormcrawler/jsoup/XPathFilter.java x: 2 commits (all time) y: 87 lines of code core/src/main/java/org/apache/stormcrawler/parse/DocumentFragmentBuilder.java x: 2 commits (all time) y: 69 lines of code core/src/main/java/org/apache/stormcrawler/parse/Outlink.java x: 2 commits (all time) y: 35 lines of code core/src/main/java/org/apache/stormcrawler/parse/ParseData.java x: 2 commits (all time) y: 45 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/CommaSeparatedToMultivaluedMetadata.java x: 2 commits (all time) y: 42 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/DebugParseFilter.java x: 2 commits (all time) y: 39 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/MD5SignatureParseFilter.java x: 2 commits (all time) y: 57 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/XPathFilter.java x: 2 commits (all time) y: 175 lines of code core/src/main/java/org/apache/stormcrawler/persistence/AbstractQueryingSpout.java x: 2 commits (all time) y: 166 lines of code core/src/main/java/org/apache/stormcrawler/persistence/EmptyQueueListener.java x: 2 commits (all time) y: 5 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/AbstractURLBuffer.java x: 2 commits (all time) y: 64 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/SchedulingURLBuffer.java x: 2 commits (all time) y: 112 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/SimpleURLBuffer.java x: 2 commits (all time) y: 43 lines of code core/src/main/java/org/apache/stormcrawler/protocol/AbstractHttpProtocol.java x: 2 commits (all time) y: 127 lines of code core/src/main/java/org/apache/stormcrawler/protocol/DelegatorProtocol.java x: 2 commits (all time) y: 192 lines of code core/src/main/java/org/apache/stormcrawler/protocol/HttpHeaders.java x: 2 commits (all time) y: 33 lines of code core/src/main/java/org/apache/stormcrawler/protocol/HttpRobotRulesParser.java x: 2 commits (all time) y: 151 lines of code core/src/main/java/org/apache/stormcrawler/protocol/Protocol.java x: 2 commits (all time) y: 105 lines of code core/src/main/java/org/apache/stormcrawler/protocol/RobotRules.java x: 2 commits (all time) y: 60 lines of code core/src/main/java/org/apache/stormcrawler/protocol/selenium/RemoteDriverProtocol.java x: 2 commits (all time) y: 90 lines of code core/src/main/java/org/apache/stormcrawler/proxy/ProxyManager.java x: 2 commits (all time) y: 7 lines of code core/src/main/java/org/apache/stormcrawler/proxy/SCProxy.java x: 2 commits (all time) y: 104 lines of code core/src/main/java/org/apache/stormcrawler/spout/MemorySpout.java x: 2 commits (all time) y: 122 lines of code core/src/main/java/org/apache/stormcrawler/util/ConfUtils.java x: 2 commits (all time) y: 138 lines of code core/src/main/java/org/apache/stormcrawler/util/RobotsTags.java x: 2 commits (all time) y: 101 lines of code archetype/src/main/resources/archetype-resources/crawler.flux x: 13 commits (all time) y: 115 lines of code
714.0
lines of code
  min: 3.0
  average: 98.25
  25th percentile: 36.0
  median: 72.0
  75th percentile: 125.0
  max: 714.0
0 98.0
commits (all time)
min: 2.0 | average: 4.11 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 3.0 | max: 98.0

File Size vs. Contributors (all time): 110 points

core/src/main/java/org/apache/stormcrawler/filtering/sitemap/SitemapFilter.java x: 3 contributors (all time) y: 49 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/FastURLFilter.java x: 3 contributors (all time) y: 245 lines of code core/src/main/java/org/apache/stormcrawler/persistence/DefaultScheduler.java x: 3 contributors (all time) y: 138 lines of code core/src/main/java/org/apache/stormcrawler/bolt/FetcherBolt.java x: 2 contributors (all time) y: 714 lines of code core/src/main/java/org/apache/stormcrawler/bolt/JSoupParserBolt.java x: 3 contributors (all time) y: 368 lines of code core/src/main/java/org/apache/stormcrawler/bolt/SimpleFetcherBolt.java x: 2 contributors (all time) y: 397 lines of code core/src/main/java/org/apache/stormcrawler/filtering/basic/BasicURLNormalizer.java x: 2 contributors (all time) y: 297 lines of code core/src/main/java/org/apache/stormcrawler/persistence/AbstractStatusUpdaterBolt.java x: 2 contributors (all time) y: 166 lines of code core/src/main/java/org/apache/stormcrawler/protocol/ProtocolResponse.java x: 2 contributors (all time) y: 37 lines of code core/src/main/java/org/apache/stormcrawler/util/CharsetIdentification.java x: 2 contributors (all time) y: 157 lines of code archetype/src/main/resources/META-INF/maven/archetype-metadata.xml x: 3 contributors (all time) y: 37 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/CollectionTagger.java x: 3 contributors (all time) y: 125 lines of code core/src/main/java/org/apache/stormcrawler/parse/ParseFilter.java x: 3 contributors (all time) y: 13 lines of code core/src/main/java/org/apache/stormcrawler/util/URLUtil.java x: 3 contributors (all time) y: 100 lines of code core/src/main/java/org/apache/stormcrawler/bolt/FeedParserBolt.java x: 3 contributors (all time) y: 184 lines of code core/src/main/java/org/apache/stormcrawler/bolt/SiteMapParserBolt.java x: 4 contributors (all time) y: 287 lines of code core/src/main/java/org/apache/stormcrawler/parse/ParseFilters.java x: 3 contributors (all time) y: 134 lines of code core/src/main/java/org/apache/stormcrawler/protocol/httpclient/HttpProtocol.java x: 3 contributors (all time) y: 283 lines of code core/src/main/java/org/apache/stormcrawler/protocol/okhttp/HttpProtocol.java x: 3 contributors (all time) y: 475 lines of code core/src/main/java/org/apache/stormcrawler/spout/FileSpout.java x: 3 contributors (all time) y: 169 lines of code core/src/main/java/org/apache/stormcrawler/filtering/basic/BasicURLFilter.java x: 2 contributors (all time) y: 58 lines of code core/src/main/java/org/apache/stormcrawler/persistence/Scheduler.java x: 2 contributors (all time) y: 23 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/URLBuffer.java x: 2 contributors (all time) y: 42 lines of code core/src/main/java/org/apache/stormcrawler/util/InitialisationUtil.java x: 2 contributors (all time) y: 125 lines of code core/src/main/java/org/apache/stormcrawler/util/MetadataTransfer.java x: 2 contributors (all time) y: 84 lines of code core/src/main/resources/crawler-default.yaml x: 11 contributors (all time) y: 85 lines of code archetype/src/main/resources/archetype-resources/crawler-conf.yaml x: 7 contributors (all time) y: 62 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexURLNormalizer.java x: 3 contributors (all time) y: 186 lines of code core/src/main/java/org/apache/stormcrawler/parse/JSoupFilters.java x: 3 contributors (all time) y: 112 lines of code core/src/main/java/org/apache/stormcrawler/parse/TextExtractor.java x: 3 contributors (all time) y: 155 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/MimeTypeNormalization.java x: 3 contributors (all time) y: 36 lines of code core/src/main/java/org/apache/stormcrawler/protocol/file/FileResponse.java x: 3 contributors (all time) y: 107 lines of code core/src/main/java/org/apache/stormcrawler/protocol/selenium/SeleniumProtocol.java x: 3 contributors (all time) y: 68 lines of code core/src/main/java/org/apache/stormcrawler/proxy/MultiProxyManager.java x: 3 contributors (all time) y: 137 lines of code core/src/main/java/org/apache/stormcrawler/util/CookieConverter.java x: 3 contributors (all time) y: 88 lines of code core/src/main/java/org/apache/stormcrawler/ConfigurableTopology.java x: 2 contributors (all time) y: 66 lines of code core/src/main/java/org/apache/stormcrawler/Constants.java x: 2 contributors (all time) y: 19 lines of code core/src/main/java/org/apache/stormcrawler/JSONResource.java x: 2 contributors (all time) y: 16 lines of code core/src/main/java/org/apache/stormcrawler/Metadata.java x: 2 contributors (all time) y: 172 lines of code core/src/main/java/org/apache/stormcrawler/bolt/StatusEmitterBolt.java x: 2 contributors (all time) y: 73 lines of code core/src/main/java/org/apache/stormcrawler/bolt/URLFilterBolt.java x: 2 contributors (all time) y: 78 lines of code core/src/main/java/org/apache/stormcrawler/bolt/URLPartitionerBolt.java x: 2 contributors (all time) y: 115 lines of code core/src/main/java/org/apache/stormcrawler/filtering/URLFilter.java x: 2 contributors (all time) y: 13 lines of code core/src/main/java/org/apache/stormcrawler/filtering/URLFilters.java x: 2 contributors (all time) y: 144 lines of code core/src/main/java/org/apache/stormcrawler/filtering/depth/MaxDepthFilter.java x: 2 contributors (all time) y: 49 lines of code core/src/main/java/org/apache/stormcrawler/filtering/host/HostURLFilter.java x: 2 contributors (all time) y: 80 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexRule.java x: 2 contributors (all time) y: 11 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexURLFilter.java x: 2 contributors (all time) y: 22 lines of code core/src/main/java/org/apache/stormcrawler/filtering/regex/RegexURLFilterBase.java x: 2 contributors (all time) y: 107 lines of code core/src/main/java/org/apache/stormcrawler/filtering/robots/RobotsFilter.java x: 2 contributors (all time) y: 51 lines of code core/src/main/java/org/apache/stormcrawler/indexing/AbstractIndexerBolt.java x: 2 contributors (all time) y: 208 lines of code core/src/main/java/org/apache/stormcrawler/indexing/DummyIndexer.java x: 2 contributors (all time) y: 27 lines of code core/src/main/java/org/apache/stormcrawler/indexing/StdOutIndexer.java x: 2 contributors (all time) y: 54 lines of code core/src/main/java/org/apache/stormcrawler/jsoup/LinkParseFilter.java x: 2 contributors (all time) y: 74 lines of code core/src/main/java/org/apache/stormcrawler/jsoup/XPathFilter.java x: 2 contributors (all time) y: 87 lines of code core/src/main/java/org/apache/stormcrawler/parse/DocumentFragmentBuilder.java x: 2 contributors (all time) y: 69 lines of code core/src/main/java/org/apache/stormcrawler/parse/Outlink.java x: 2 contributors (all time) y: 35 lines of code core/src/main/java/org/apache/stormcrawler/parse/ParseData.java x: 2 contributors (all time) y: 45 lines of code core/src/main/java/org/apache/stormcrawler/parse/filter/XPathFilter.java x: 2 contributors (all time) y: 175 lines of code core/src/main/java/org/apache/stormcrawler/persistence/EmptyQueueListener.java x: 2 contributors (all time) y: 5 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/AbstractURLBuffer.java x: 2 contributors (all time) y: 64 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/SchedulingURLBuffer.java x: 2 contributors (all time) y: 112 lines of code core/src/main/java/org/apache/stormcrawler/persistence/urlbuffer/SimpleURLBuffer.java x: 2 contributors (all time) y: 43 lines of code core/src/main/java/org/apache/stormcrawler/protocol/AbstractHttpProtocol.java x: 2 contributors (all time) y: 127 lines of code core/src/main/java/org/apache/stormcrawler/protocol/DelegatorProtocol.java x: 2 contributors (all time) y: 192 lines of code core/src/main/java/org/apache/stormcrawler/protocol/HttpHeaders.java x: 2 contributors (all time) y: 33 lines of code core/src/main/java/org/apache/stormcrawler/protocol/HttpRobotRulesParser.java x: 2 contributors (all time) y: 151 lines of code core/src/main/java/org/apache/stormcrawler/protocol/Protocol.java x: 2 contributors (all time) y: 105 lines of code core/src/main/java/org/apache/stormcrawler/protocol/RobotRules.java x: 2 contributors (all time) y: 60 lines of code core/src/main/java/org/apache/stormcrawler/protocol/selenium/RemoteDriverProtocol.java x: 2 contributors (all time) y: 90 lines of code core/src/main/java/org/apache/stormcrawler/proxy/ProxyManager.java x: 2 contributors (all time) y: 7 lines of code core/src/main/java/org/apache/stormcrawler/proxy/SCProxy.java x: 2 contributors (all time) y: 104 lines of code core/src/main/java/org/apache/stormcrawler/spout/MemorySpout.java x: 2 contributors (all time) y: 122 lines of code core/src/main/java/org/apache/stormcrawler/util/ConfUtils.java x: 2 contributors (all time) y: 138 lines of code core/src/main/java/org/apache/stormcrawler/util/RobotsTags.java x: 2 contributors (all time) y: 101 lines of code archetype/src/main/resources/archetype-resources/src/main/resources/default-regex-normalizers.xml x: 1 contributors (all time) y: 3 lines of code
714.0
lines of code
  min: 3.0
  average: 98.25
  25th percentile: 36.0
  median: 72.0
  75th percentile: 125.0
  max: 714.0
0 11.0
contributors (all time)
min: 1.0 | average: 2.36 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 3.0 | max: 11.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 1 points

core/src/main/java/org/apache/stormcrawler/filtering/sitemap/SitemapFilter.java x: 1 commits (90d) y: 49 lines of code
49.0
lines of code
  min: 49.0
  average: 49.0
  25th percentile: 49.0
  median: 49.0
  75th percentile: 49.0
  max: 49.0
0 1.0
commits (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (90 days): 1 points

core/src/main/java/org/apache/stormcrawler/filtering/sitemap/SitemapFilter.java x: 1 contributors (90d) y: 49 lines of code
49.0
lines of code
  min: 49.0
  average: 49.0
  25th percentile: 49.0
  median: 49.0
  75th percentile: 49.0
  max: 49.0
0 1.0
contributors (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0