duplicated block id: 1 size: 43 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java (152:201) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java (154:203) duplicated block id: 2 size: 36 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java (328:377) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java (330:379) duplicated block id: 3 size: 24 cleaned lines of code in 3 files: - src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/HttpResponse.java (498:525) - src/plugin/protocol-selenium/src/java/org/apache/nutch/protocol/selenium/HttpResponse.java (448:475) - src/plugin/protocol-htmlunit/src/java/org/apache/nutch/protocol/htmlunit/HttpResponse.java (563:590) duplicated block id: 4 size: 24 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMBuilder.java (141:174) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMBuilder.java (136:169) duplicated block id: 5 size: 21 cleaned lines of code in 3 files: - src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/HttpResponse.java (439:463) - src/plugin/protocol-selenium/src/java/org/apache/nutch/protocol/selenium/HttpResponse.java (389:413) - src/plugin/protocol-htmlunit/src/java/org/apache/nutch/protocol/htmlunit/HttpResponse.java (501:525) duplicated block id: 6 size: 21 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java (284:315) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java (286:317) duplicated block id: 7 size: 17 cleaned lines of code in 2 files: - src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/HttpResponse.java (466:496) - src/plugin/protocol-selenium/src/java/org/apache/nutch/protocol/selenium/HttpResponse.java (416:446) duplicated block id: 8 size: 17 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMBuilder.java (408:429) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMBuilder.java (398:419) duplicated block id: 9 size: 16 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java (254:277) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java (256:279) duplicated block id: 10 size: 14 cleaned lines of code in 3 files: - src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/HttpResponse.java (416:437) - src/plugin/protocol-selenium/src/java/org/apache/nutch/protocol/selenium/HttpResponse.java (366:387) - src/plugin/protocol-htmlunit/src/java/org/apache/nutch/protocol/htmlunit/HttpResponse.java (478:499) duplicated block id: 11 size: 13 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java (211:226) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java (213:228) duplicated block id: 12 size: 10 cleaned lines of code in 5 files: - src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/DummyX509TrustManager.java (44:55) - src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/DummyX509TrustManager.java (44:55) - src/plugin/protocol-selenium/src/java/org/apache/nutch/protocol/selenium/DummyX509TrustManager.java (44:55) - src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/DummyX509TrustManager.java (51:62) - src/plugin/protocol-htmlunit/src/java/org/apache/nutch/protocol/htmlunit/DummyX509TrustManager.java (44:55) duplicated block id: 13 size: 10 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMBuilder.java (645:659) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMBuilder.java (643:657) duplicated block id: 14 size: 8 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMBuilder.java (443:454) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMBuilder.java (435:446) duplicated block id: 15 size: 8 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/XMLCharacterRecognizer.java (98:110) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/XMLCharacterRecognizer.java (98:110) duplicated block id: 16 size: 7 cleaned lines of code in 2 files: - src/plugin/indexer-opensearch-1x/src/java/org/apache/nutch/indexwriter/opensearch1x/OpenSearch1xIndexWriter.java (376:386) - src/plugin/indexer-elastic/src/java/org/apache/nutch/indexwriter/elastic/ElasticIndexWriter.java (311:321) duplicated block id: 17 size: 7 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java (235:243) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java (237:245) duplicated block id: 18 size: 7 cleaned lines of code in 2 files: - src/plugin/urlfilter-domain/src/java/org/apache/nutch/urlfilter/domain/DomainURLFilter.java (87:98) - src/plugin/urlfilter-domaindenylist/src/java/org/apache/nutch/urlfilter/domaindenylist/DomainDenylistURLFilter.java (87:98) duplicated block id: 19 size: 6 cleaned lines of code in 3 files: - src/java/org/apache/nutch/util/DomainStatistics.java (219:228) - src/java/org/apache/nutch/util/CrawlCompletionStats.java (233:242) - src/java/org/apache/nutch/util/ProtocolStatusStatistics.java (154:163) duplicated block id: 20 size: 6 cleaned lines of code in 3 files: - src/java/org/apache/nutch/util/DomainStatistics.java (234:242) - src/java/org/apache/nutch/util/CrawlCompletionStats.java (248:256) - src/java/org/apache/nutch/util/ProtocolStatusStatistics.java (169:177) duplicated block id: 21 size: 6 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java (317:324) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java (319:326) duplicated block id: 22 size: 6 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/XMLCharacterRecognizer.java (60:70) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/XMLCharacterRecognizer.java (60:70) duplicated block id: 23 size: 6 cleaned lines of code in 2 files: - src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/XMLCharacterRecognizer.java (79:89) - src/plugin/parse-html/src/java/org/apache/nutch/parse/html/XMLCharacterRecognizer.java (79:89)