duplicated block id: 1 size: 28 cleaned lines of code in 2 files: - tika-core/src/main/java/org/apache/tika/sax/DIFContentHandler.java (55:85) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/src/main/java/org/apache/tika/parser/dif/DIFContentHandler.java (54:84) duplicated block id: 2 size: 21 cleaned lines of code in 2 files: - tika-core/src/main/java/org/apache/tika/sax/DIFContentHandler.java (111:133) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/src/main/java/org/apache/tika/parser/dif/DIFContentHandler.java (110:132) duplicated block id: 3 size: 20 cleaned lines of code in 2 files: - tika-example/src/main/java/org/apache/tika/example/GrabPhoneNumbersExample.java (65:89) - tika-example/src/main/java/org/apache/tika/example/StandardsExtractionExample.java (70:94) duplicated block id: 4 size: 14 cleaned lines of code in 2 files: - tika-core/src/main/java/org/apache/tika/sax/DIFContentHandler.java (93:108) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/src/main/java/org/apache/tika/parser/dif/DIFContentHandler.java (92:107) duplicated block id: 5 size: 14 cleaned lines of code in 2 files: - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/SXWPFWordExtractorDecorator.java (269:286) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XWPFWordExtractorDecorator.java (550:567) duplicated block id: 6 size: 12 cleaned lines of code in 2 files: - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/onenote/fsshttpb/streamobj/space/ObjectSpaceObjectStreamOfOIDs.java (56:71) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/onenote/fsshttpb/streamobj/space/ObjectSpaceObjectStreamOfContextIDs.java (55:70) duplicated block id: 7 size: 11 cleaned lines of code in 2 files: - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/chm/ChmItspHeader.java (149:162) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/chm/ChmItsfHeader.java (377:391) duplicated block id: 8 size: 10 cleaned lines of code in 2 files: - tika-pipes/tika-fetchers/tika-fetcher-s3/src/main/java/org/apache/tika/pipes/fetcher/s3/S3Fetcher.java (264:275) - tika-pipes/tika-fetchers/tika-fetcher-microsoft-graph/src/main/java/org/apache/tika/pipes/fetchers/microsoftgraph/MicrosoftGraphFetcher.java (68:79) duplicated block id: 9 size: 8 cleaned lines of code in 2 files: - tika-serialization/src/main/java/org/apache/tika/serialization/pipes/JsonFetchEmitTuple.java (118:127) - tika-serialization/src/main/java/org/apache/tika/serialization/ParseContextDeserializer.java (68:77) duplicated block id: 10 size: 8 cleaned lines of code in 2 files: - tika-langdetect/tika-langdetect-mitll-text/src/main/java/org/apache/tika/langdetect/mitll/TextLangDetector.java (113:122) - tika-langdetect/tika-langdetect-lingo24/src/main/java/org/apache/tika/langdetect/lingo24/Lingo24LangDetector.java (126:135) duplicated block id: 11 size: 6 cleaned lines of code in 2 files: - tika-core/src/main/java/org/apache/tika/utils/XMLReaderUtils.java (891:898) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/xps/XPSPageContentHandler.java (136:143) duplicated block id: 12 size: 6 cleaned lines of code in 2 files: - tika-core/src/main/java/org/apache/tika/fork/ContentHandlerResource.java (35:42) - tika-core/src/main/java/org/apache/tika/fork/RecursiveMetadataContentHandlerResource.java (40:47) duplicated block id: 13 size: 6 cleaned lines of code in 2 files: - tika-core/src/main/java/org/apache/tika/parser/ParseContext.java (79:86) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-zip-commons/src/main/java/org/apache/tika/detect/zip/StreamingDetectContext.java (70:77) duplicated block id: 14 size: 6 cleaned lines of code in 2 files: - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/onenote/fsshttpb/streamobj/StreamObjectHeaderEnd16bit.java (39:47) - tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/onenote/fsshttpb/streamobj/StreamObjectHeaderEnd8bit.java (37:46) duplicated block id: 15 size: 6 cleaned lines of code in 2 files: - tika-batch/src/main/java/org/apache/tika/batch/fs/RecursiveParserWrapperFSConsumer.java (58:65) - tika-batch/src/main/java/org/apache/tika/batch/fs/StreamOutRPWFSConsumer.java (57:64) duplicated block id: 16 size: 6 cleaned lines of code in 2 files: - tika-pipes/tika-pipes-iterators/tika-pipes-iterator-kafka/src/main/java/org/apache/tika/pipes/pipesiterator/kafka/KafkaPipesIterator.java (133:140) - tika-pipes/tika-emitters/tika-emitter-kafka/src/main/java/org/apache/tika/pipes/emitter/kafka/KafkaEmitter.java (272:279) duplicated block id: 17 size: 6 cleaned lines of code in 2 files: - tika-eval/tika-eval-app/src/main/java/org/apache/tika/eval/app/batch/ExtractProfilerBuilder.java (87:94) - tika-eval/tika-eval-app/src/main/java/org/apache/tika/eval/app/batch/FileProfilerBuilder.java (64:71)