duplicated block id: 1 size: 1175 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (1:1388) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (1:1388) duplicated block id: 2 size: 202 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/type.proto (1:241) - gluten-core/src/main/resources/substrait/proto/substrait/type.proto (1:241) duplicated block id: 3 size: 193 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (308:623) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (310:625) duplicated block id: 4 size: 193 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/stat/StatFunctions.scala (41:354) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/stat/StatFunctions.scala (41:362) duplicated block id: 5 size: 134 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/type_expressions.proto (1:160) - gluten-core/src/main/resources/substrait/proto/substrait/type_expressions.proto (1:160) duplicated block id: 6 size: 121 cleaned lines of code in 2 files: - shims/spark33/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (28:221) - shims/spark34/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (28:221) duplicated block id: 7 size: 115 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/function.proto (1:148) - gluten-core/src/main/resources/substrait/proto/substrait/function.proto (1:148) duplicated block id: 8 size: 106 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/parameterized_types.proto (1:128) - gluten-core/src/main/resources/substrait/proto/substrait/parameterized_types.proto (1:128) duplicated block id: 9 size: 105 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (287:441) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (279:433) duplicated block id: 10 size: 99 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (463:623) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (449:609) duplicated block id: 11 size: 98 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala (140:286) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala (142:288) duplicated block id: 12 size: 93 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (51:209) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (44:202) duplicated block id: 13 size: 92 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (530:662) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (493:625) duplicated block id: 14 size: 92 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (530:662) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (491:623) duplicated block id: 15 size: 81 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (280:395) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (301:415) duplicated block id: 16 size: 79 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (56:163) - shims/spark33/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (54:160) duplicated block id: 17 size: 77 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/BasicWriteStatsTracker.scala (47:180) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/BasicWriteStatsTracker.scala (39:172) duplicated block id: 18 size: 73 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (60:165) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (52:157) duplicated block id: 19 size: 69 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (194:281) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (265:352) duplicated block id: 20 size: 68 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/plan.proto (1:82) - gluten-core/src/main/resources/substrait/proto/substrait/plan.proto (1:82) duplicated block id: 21 size: 64 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/extensions/extensions.proto (1:81) - gluten-core/src/main/resources/substrait/proto/substrait/extensions/extensions.proto (1:81) duplicated block id: 22 size: 63 cleaned lines of code in 2 files: - shims/spark32/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (63:157) - shims/spark34/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (70:164) duplicated block id: 23 size: 63 cleaned lines of code in 2 files: - shims/spark32/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (63:157) - shims/spark33/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (70:164) duplicated block id: 24 size: 63 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/DeltaMergeTreeFileFormat.scala (32:108) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/DeltaMergeTreeFileFormat.scala (31:107) duplicated block id: 25 size: 58 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/delta/files/MergeTreeCommitProtocol.scala (37:131) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/delta/files/MergeTreeCommitProtocol.scala (37:136) duplicated block id: 26 size: 56 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (107:192) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (178:263) duplicated block id: 27 size: 55 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (74:146) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (75:147) duplicated block id: 28 size: 55 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (227:300) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (191:264) duplicated block id: 29 size: 54 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (66:154) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (58:146) duplicated block id: 30 size: 54 cleaned lines of code in 2 files: - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornColumnarBatchSerializer.scala (175:263) - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (156:250) duplicated block id: 31 size: 52 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala (67:135) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala (69:137) duplicated block id: 32 size: 48 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/columnarbatch/IndicatorVector.java (80:157) - gluten-data/src/main/java/io/glutenproject/columnarbatch/PlaceholderVector.java (38:115) duplicated block id: 33 size: 48 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (397:462) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (417:482) duplicated block id: 34 size: 45 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (26:91) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (26:91) duplicated block id: 35 size: 43 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (224:308) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (202:286) duplicated block id: 36 size: 43 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/FileSourceScanMetricsUpdater.scala (27:76) - gluten-data/src/main/scala/io/glutenproject/metrics/HiveTableScanMetricsUpdater.scala (23:71) duplicated block id: 37 size: 41 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/extended_expression.proto (1:51) - gluten-core/src/main/resources/substrait/proto/substrait/extended_expression.proto (1:51) duplicated block id: 38 size: 40 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/delta/files/MergeTreeCommitProtocol.scala (159:230) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/delta/files/MergeTreeCommitProtocol.scala (160:232) duplicated block id: 39 size: 38 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (170:223) - shims/spark33/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (182:235) duplicated block id: 40 size: 37 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (45:81) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (84:120) duplicated block id: 41 size: 37 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (6:42) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (84:120) duplicated block id: 42 size: 37 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (6:42) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (45:81) duplicated block id: 43 size: 36 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (90:125) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (131:166) duplicated block id: 44 size: 36 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (157:216) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (153:212) duplicated block id: 45 size: 36 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (223:263) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (218:258) duplicated block id: 46 size: 35 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/BasicWriteStatsTracker.scala (213:258) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/BasicWriteStatsTracker.scala (205:250) duplicated block id: 47 size: 35 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/ds/TpcdsSuite.scala (28:63) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/h/TpchSuite.scala (28:63) duplicated block id: 48 size: 34 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (106:154) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (85:133) duplicated block id: 49 size: 33 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (398:442) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (369:413) duplicated block id: 50 size: 33 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHColumnarBatchSerializer.scala (130:185) - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (191:246) duplicated block id: 51 size: 33 cleaned lines of code in 2 files: - shims/spark32/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (157:209) - shims/spark33/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (169:221) duplicated block id: 52 size: 33 cleaned lines of code in 2 files: - shims/spark32/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (157:209) - shims/spark34/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (169:221) duplicated block id: 53 size: 33 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (398:442) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (367:411) duplicated block id: 54 size: 32 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/WriteFiles.scala (45:90) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/WriteFiles.scala (45:90) duplicated block id: 55 size: 32 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (284:329) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (363:408) duplicated block id: 56 size: 31 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q14a.sql (2:32) - tools/gluten-it/common/src/main/resources/tpcds-queries/q14b.sql (2:32) duplicated block id: 57 size: 31 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (46:83) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (46:84) duplicated block id: 58 size: 29 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (491:520) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (408:437) duplicated block id: 59 size: 29 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (151:197) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (233:279) duplicated block id: 60 size: 28 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (185:220) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (150:185) duplicated block id: 61 size: 28 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (86:122) - gluten-core/src/main/scala/io/glutenproject/utils/InputPartitionsUtil.scala (74:109) duplicated block id: 62 size: 27 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/BatchScanMetricsUpdater.scala (23:56) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FileSourceScanMetricsUpdater.scala (27:60) duplicated block id: 63 size: 27 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (85:116) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (86:117) duplicated block id: 64 size: 27 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (32:64) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (33:65) duplicated block id: 65 size: 27 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (51:92) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (57:98) duplicated block id: 66 size: 26 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (214:259) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (207:252) duplicated block id: 67 size: 26 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q24a.sql (2:27) - tools/gluten-it/common/src/main/resources/tpcds-queries/q24b.sql (2:27) duplicated block id: 68 size: 26 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/Serializations/ExcelDecimalSerialization.h (34:84) - cpp-ch/local-engine/Storages/Serializations/ExcelSerialization.h (42:92) duplicated block id: 69 size: 26 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (229:274) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (214:259) duplicated block id: 70 size: 26 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q14a.sql (34:59) - tools/gluten-it/common/src/main/resources/tpcds-queries/q14b.sql (34:59) duplicated block id: 71 size: 25 cleaned lines of code in 2 files: - substrait/substrait-spark/src/main/spark-3.3/io/substrait/spark/logical/AbstractLogicalPlanVisitor.scala (24:71) - substrait/substrait-spark/src/main/spark-3.4/io/substrait/spark/logical/AbstractLogicalPlanVisitor.scala (24:71) duplicated block id: 72 size: 25 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (71:104) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (73:106) duplicated block id: 73 size: 24 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/metrics/MetricsData.java (75:120) - backends-clickhouse/src/main/java/io/glutenproject/metrics/MetricsProcessor.java (58:103) duplicated block id: 74 size: 24 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/metadata/AddFileTags.scala (31:54) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/metadata/AddFileTags.scala (90:113) duplicated block id: 75 size: 23 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/JoinSelectionShim.scala (25:47) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/JoinSelectionShim.scala (25:47) duplicated block id: 76 size: 23 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (42:67) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (43:68) duplicated block id: 77 size: 23 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (12:34) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (12:34) duplicated block id: 78 size: 23 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (268:291) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (208:231) duplicated block id: 79 size: 23 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/capabilities.proto (1:29) - gluten-core/src/main/resources/substrait/proto/substrait/capabilities.proto (1:29) duplicated block id: 80 size: 23 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHFilterExecTransformer.scala (26:57) - backends-velox/src/main/scala/io/glutenproject/execution/FilterExecTransformer.scala (26:58) duplicated block id: 81 size: 22 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q39a.sql (2:23) - tools/gluten-it/common/src/main/resources/tpcds-queries/q39b.sql (2:23) duplicated block id: 82 size: 22 cleaned lines of code in 2 files: - substrait/substrait-spark/src/main/spark-3.2/io/substrait/spark/logical/AbstractLogicalPlanVisitor.scala (29:71) - substrait/substrait-spark/src/main/spark-3.4/io/substrait/spark/logical/AbstractLogicalPlanVisitor.scala (29:71) duplicated block id: 83 size: 22 cleaned lines of code in 2 files: - substrait/substrait-spark/src/main/spark-3.2/io/substrait/spark/logical/AbstractLogicalPlanVisitor.scala (29:71) - substrait/substrait-spark/src/main/spark-3.3/io/substrait/spark/logical/AbstractLogicalPlanVisitor.scala (29:71) duplicated block id: 84 size: 22 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (151:172) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (215:236) duplicated block id: 85 size: 22 cleaned lines of code in 2 files: - cpp/CMake/BuildQATZstd.cmake (1:24) - cpp/CMake/BuildQATzip.cmake (1:24) duplicated block id: 86 size: 22 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:24) - cpp/CMake/BuildQATZstd.cmake (1:24) duplicated block id: 87 size: 22 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (36:57) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (60:81) duplicated block id: 88 size: 22 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:24) - cpp/CMake/BuildQATzip.cmake (1:24) duplicated block id: 89 size: 21 cleaned lines of code in 2 files: - shims/spark32/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (26:63) - shims/spark34/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (28:65) duplicated block id: 90 size: 21 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (33:55) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (29:51) duplicated block id: 91 size: 21 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (123:149) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (216:242) duplicated block id: 92 size: 21 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (32:54) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (29:51) duplicated block id: 93 size: 21 cleaned lines of code in 2 files: - shims/spark32/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (26:63) - shims/spark33/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVectorShim.java (28:65) duplicated block id: 94 size: 20 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (593:615) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (566:588) duplicated block id: 95 size: 20 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q23a.sql (10:29) - tools/gluten-it/common/src/main/resources/tpcds-queries/q23b.sql (10:29) duplicated block id: 96 size: 20 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (114:139) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (124:149) duplicated block id: 97 size: 20 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (566:588) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (593:615) duplicated block id: 98 size: 20 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (331:365) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (411:445) duplicated block id: 99 size: 20 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (566:588) - cpp-ch/local-engine/proto/substrait/algebra.proto (593:615) duplicated block id: 100 size: 20 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (80:99) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (108:127) duplicated block id: 101 size: 20 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (341:383) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (312:354) duplicated block id: 102 size: 20 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (114:139) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (217:242) duplicated block id: 103 size: 20 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (566:588) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (593:615) duplicated block id: 104 size: 20 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (341:383) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (310:352) duplicated block id: 105 size: 19 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (335:357) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (411:433) duplicated block id: 106 size: 19 cleaned lines of code in 2 files: - tools/workload/tpcds/run_tpcds/tpcds_parquet.scala (107:128) - tools/workload/tpch/run_tpch/tpch_parquet.scala (67:88) duplicated block id: 107 size: 19 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/ddl.proto (1:25) - gluten-core/src/main/resources/substrait/proto/substrait/ddl.proto (1:25) duplicated block id: 108 size: 19 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/metrics/Metrics.java (102:120) - gluten-data/src/main/java/io/glutenproject/metrics/OperatorMetrics.java (96:114) duplicated block id: 109 size: 19 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (304:325) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (402:423) duplicated block id: 110 size: 19 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndex.scala (27:46) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndex.scala (27:46) duplicated block id: 111 size: 19 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (335:357) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (431:453) duplicated block id: 112 size: 18 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScan.scala (27:44) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScan.scala (26:43) duplicated block id: 113 size: 18 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (69:90) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (70:91) duplicated block id: 114 size: 18 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (49:70) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/QueriesCompare.scala (54:75) duplicated block id: 115 size: 18 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (69:90) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (70:91) duplicated block id: 116 size: 17 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (225:243) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (193:211) duplicated block id: 117 size: 17 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionArrayDistinct.cpp (227:253) - cpp-ch/local-engine/Functions/SparkFunctionArrayDistinct.cpp (281:307) duplicated block id: 118 size: 17 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (184:208) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (169:193) duplicated block id: 119 size: 17 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (124:141) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (95:112) duplicated block id: 120 size: 17 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:77) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:172) duplicated block id: 121 size: 17 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:17) - cpp/CMake/BuildGoogleBenchmark.cmake (1:18) duplicated block id: 122 size: 17 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (123:139) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (154:170) duplicated block id: 123 size: 17 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:44) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:44) duplicated block id: 124 size: 17 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionFloor.h (138:158) - cpp-ch/local-engine/Functions/SparkFunctionRoundHalfUp.h (89:115) duplicated block id: 125 size: 17 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (636:665) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (648:677) duplicated block id: 126 size: 17 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/parameterized_types.proto (14:30) - cpp-ch/local-engine/proto/substrait/type_expressions.proto (14:30) duplicated block id: 127 size: 17 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:18) - cpp/CMake/BuildQpl.cmake (1:18) duplicated block id: 128 size: 17 cleaned lines of code in 2 files: - cpp/velox/benchmarks/common/OrcReaderIterator.h (81:105) - cpp/velox/benchmarks/common/ParquetReaderIterator.h (78:102) duplicated block id: 129 size: 17 cleaned lines of code in 2 files: - cpp/CMake/BuildQATZstd.cmake (1:18) - cpp/CMake/BuildQpl.cmake (1:18) duplicated block id: 130 size: 17 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala (46:64) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala (48:66) duplicated block id: 131 size: 17 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/type_expressions.proto (14:30) - gluten-core/src/main/resources/substrait/proto/substrait/parameterized_types.proto (14:30) duplicated block id: 132 size: 17 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (168:186) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (193:211) duplicated block id: 133 size: 17 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (18:34) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (42:58) duplicated block id: 134 size: 17 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (27:45) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/QueriesCompare.scala (27:45) duplicated block id: 135 size: 17 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (184:208) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (162:186) duplicated block id: 136 size: 17 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/proto/substrait/parameterized_types.proto (14:30) - gluten-core/src/main/resources/substrait/proto/substrait/type_expressions.proto (14:30) duplicated block id: 137 size: 17 cleaned lines of code in 2 files: - cpp/CMake/BuildQATzip.cmake (1:18) - cpp/CMake/BuildQpl.cmake (1:18) duplicated block id: 138 size: 17 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (168:186) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (225:243) duplicated block id: 139 size: 17 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/TaskContextUtils.scala (26:42) - shims/spark33/src/main/scala/org/apache/spark/TaskContextUtils.scala (26:42) duplicated block id: 140 size: 17 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q94.sql (2:18) - tools/gluten-it/common/src/main/resources/tpcds-queries/q95.sql (7:23) duplicated block id: 141 size: 17 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (123:140) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (95:112) duplicated block id: 142 size: 17 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (18:34) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (42:58) duplicated block id: 143 size: 17 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/parameterized_types.proto (14:30) - gluten-core/src/main/resources/substrait/proto/substrait/type_expressions.proto (14:30) duplicated block id: 144 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 145 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:39) duplicated block id: 146 size: 16 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (225:241) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (284:300) duplicated block id: 147 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATzip.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 148 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/BatchScanMetricsUpdater.scala (40:56) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HiveTableScanMetricsUpdater.scala (37:52) duplicated block id: 149 size: 16 cleaned lines of code in 2 files: - cpp/CMake/FindZstd.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 150 size: 16 cleaned lines of code in 2 files: - cpp/CMake/FindZstd.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 151 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/ConfigArrow.cmake (1:16) duplicated block id: 152 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 153 size: 16 cleaned lines of code in 2 files: - cpp/CMake/Findglog.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 154 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQpl.cmake (1:16) - cpp/CMake/FindZstd.cmake (1:16) duplicated block id: 155 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (493:525) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (460:492) duplicated block id: 156 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATzip.cmake (1:16) - cpp/CMake/ConfigArrow.cmake (1:16) duplicated block id: 157 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATZstd.cmake (1:16) - cpp/CMake/ConfigArrow.cmake (1:16) duplicated block id: 158 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (445:465) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (416:436) duplicated block id: 159 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/BuildMemkind.cmake (1:16) duplicated block id: 160 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:39) duplicated block id: 161 size: 16 cleaned lines of code in 2 files: - cpp/CMake/Findjemalloc_pic.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 162 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:39) duplicated block id: 163 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/GenerateMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:39) duplicated block id: 164 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 165 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:39) duplicated block id: 166 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATZstd.cmake (1:16) - cpp/CMake/FindZstd.cmake (1:16) duplicated block id: 167 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/BuildQpl.cmake (1:16) duplicated block id: 168 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQpl.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 169 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQpl.cmake (1:16) - cpp/CMake/ConfigArrow.cmake (1:16) duplicated block id: 170 size: 16 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (132:152) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (133:154) duplicated block id: 171 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FileSourceScanMetricsUpdater.scala (44:60) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HiveTableScanMetricsUpdater.scala (37:52) duplicated block id: 172 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATzip.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 173 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQpl.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 174 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/BuildQATZstd.cmake (1:16) duplicated block id: 175 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/FindZstd.cmake (1:16) duplicated block id: 176 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:39) duplicated block id: 177 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/FindZstd.cmake (1:16) duplicated block id: 178 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATZstd.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 179 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/GenerateMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:39) duplicated block id: 180 size: 16 cleaned lines of code in 2 files: - cpp/CMake/ConfigArrow.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 181 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/BuildQpl.cmake (1:16) duplicated block id: 182 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/GenerateMetricsUpdater.scala (23:39) duplicated block id: 183 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (493:525) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (458:490) duplicated block id: 184 size: 16 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (168:184) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (284:300) duplicated block id: 185 size: 16 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (275:291) - cpp/core/benchmarks/CompressionBenchmark.cc (351:367) duplicated block id: 186 size: 16 cleaned lines of code in 2 files: - cpp/CMake/FindZstd.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 187 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:39) duplicated block id: 188 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:39) duplicated block id: 189 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (54:72) - gluten-core/src/main/scala/io/glutenproject/utils/InputPartitionsUtil.scala (54:72) duplicated block id: 190 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATZstd.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 191 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 192 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:39) duplicated block id: 193 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 194 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/ConfigArrow.cmake (1:16) duplicated block id: 195 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:16) - cpp/CMake/FindZstd.cmake (1:16) duplicated block id: 196 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATZstd.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 197 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/BuildQATzip.cmake (1:16) duplicated block id: 198 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:39) duplicated block id: 199 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/BuildMemkind.cmake (1:16) duplicated block id: 200 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 201 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:39) duplicated block id: 202 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/BuildQATzip.cmake (1:16) duplicated block id: 203 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/GenerateMetricsUpdater.scala (23:39) duplicated block id: 204 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATzip.cmake (1:16) - cpp/CMake/FindZstd.cmake (1:16) duplicated block id: 205 size: 16 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/BasicWriteStatsTracker.scala (182:211) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/BasicWriteStatsTracker.scala (174:203) duplicated block id: 206 size: 16 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (193:209) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (284:300) duplicated block id: 207 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 208 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/GenerateMetricsUpdater.scala (23:39) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:39) duplicated block id: 209 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (109:125) - gluten-data/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (294:310) duplicated block id: 210 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildMemkind.cmake (1:16) - cpp/CMake/ConfigArrow.cmake (1:16) duplicated block id: 211 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQpl.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 212 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/Findglog.cmake (1:16) duplicated block id: 213 size: 16 cleaned lines of code in 2 files: - cpp/CMake/ConfigArrow.cmake (1:16) - cpp/CMake/ResolveDependency.cmake (1:16) duplicated block id: 214 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGflags.cmake (1:16) - cpp/CMake/BuildQATZstd.cmake (1:16) duplicated block id: 215 size: 16 cleaned lines of code in 2 files: - cpp/CMake/ConfigArrow.cmake (1:16) - cpp/CMake/FindZstd.cmake (1:16) duplicated block id: 216 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildQATzip.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 217 size: 16 cleaned lines of code in 2 files: - cpp/CMake/Findglog.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 218 size: 16 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (445:465) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (414:434) duplicated block id: 219 size: 16 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (43:70) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (49:75) duplicated block id: 220 size: 16 cleaned lines of code in 2 files: - cpp/CMake/ConfigArrow.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 221 size: 16 cleaned lines of code in 2 files: - cpp/CMake/BuildGoogleBenchmark.cmake (1:16) - cpp/CMake/Findjemalloc_pic.cmake (1:16) duplicated block id: 222 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (55:69) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (43:57) duplicated block id: 223 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (78:107) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (77:106) duplicated block id: 224 size: 15 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (28:42) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (137:151) duplicated block id: 225 size: 15 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (604:618) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (636:650) duplicated block id: 226 size: 15 cleaned lines of code in 2 files: - gluten-core/src/main/scala/org/apache/spark/sql/execution/GlutenExplainUtils.scala (75:90) - gluten-core/src/main/scala/org/apache/spark/sql/execution/GlutenImplicits.scala (98:113) duplicated block id: 227 size: 15 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q24a.sql (29:43) - tools/gluten-it/common/src/main/resources/tpcds-queries/q24b.sql (29:43) duplicated block id: 228 size: 15 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (521:538) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1190:1207) duplicated block id: 229 size: 15 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (152:169) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (52:69) duplicated block id: 230 size: 15 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (258:274) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (326:342) duplicated block id: 231 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (78:107) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (70:99) duplicated block id: 232 size: 15 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (67:81) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (137:151) duplicated block id: 233 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (241:255) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (320:334) duplicated block id: 234 size: 15 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (26:45) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (26:45) duplicated block id: 235 size: 15 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (106:120) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (137:151) duplicated block id: 236 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHColumnarBatchSerializer.scala (100:121) - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (142:163) duplicated block id: 237 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (257:277) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (336:356) duplicated block id: 238 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (241:255) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (340:354) duplicated block id: 239 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (257:277) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (356:376) duplicated block id: 240 size: 15 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/GraceMergingAggregatedStep.cpp (138:156) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.cpp (68:86) duplicated block id: 241 size: 15 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/shuffle/CHColumnarShuffleWriter.scala (158:176) - gluten-data/src/main/scala/org/apache/spark/shuffle/ColumnarShuffleWriter.scala (209:231) duplicated block id: 242 size: 15 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (395:412) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1016:1033) duplicated block id: 243 size: 15 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (152:169) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/QueriesCompare.scala (57:74) duplicated block id: 244 size: 15 cleaned lines of code in 2 files: - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornColumnarBatchSerializer.scala (109:127) - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (126:144) duplicated block id: 245 size: 15 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (26:45) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/ExpandOutputPartitioningShim.scala (26:45) duplicated block id: 246 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (33:50) - gluten-core/src/main/scala/io/glutenproject/utils/InputPartitionsUtil.scala (29:46) duplicated block id: 247 size: 14 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ShuffleWriterJniWrapper.java (57:70) - gluten-data/src/main/java/io/glutenproject/vectorized/ShuffleWriterJniWrapper.java (143:156) duplicated block id: 248 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (687:701) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (636:650) duplicated block id: 249 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (52:69) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (47:64) duplicated block id: 250 size: 14 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/ds/TpcdsSuite.scala (91:104) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/h/TpchSuite.scala (90:103) duplicated block id: 251 size: 14 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/Serializations/ExcelNumberReader.h (91:110) - cpp-ch/local-engine/Storages/Serializations/ExcelNumberReader.h (119:138) duplicated block id: 252 size: 14 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (77:90) - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (85:98) duplicated block id: 253 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/LowCopyFileSegmentShuffleInputStream.java (61:80) - gluten-data/src/main/java/io/glutenproject/vectorized/LowCopyFileSegmentJniByteInputStream.java (96:114) duplicated block id: 254 size: 14 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/GraceMergingAggregatedStep.cpp (174:187) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.cpp (116:131) duplicated block id: 255 size: 14 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1394:1415) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1425:1446) duplicated block id: 256 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (687:701) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (648:662) duplicated block id: 257 size: 14 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (223:237) - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (274:288) duplicated block id: 258 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (293:313) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (392:412) duplicated block id: 259 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (293:313) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (372:392) duplicated block id: 260 size: 14 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (500:517) - gluten-core/src/main/scala/io/glutenproject/expression/UnaryExpressionTransformer.scala (114:131) duplicated block id: 261 size: 14 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (6:19) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (21:34) duplicated block id: 262 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (261:293) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (256:288) duplicated block id: 263 size: 13 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (249:264) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (304:319) duplicated block id: 264 size: 13 cleaned lines of code in 2 files: - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornColumnarBatchSerializer.scala (72:85) - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (84:97) duplicated block id: 265 size: 13 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionArrayDistinct.cpp (177:196) - cpp-ch/local-engine/Functions/SparkFunctionArrayDistinct.cpp (288:307) duplicated block id: 266 size: 13 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionArrayDistinct.cpp (177:196) - cpp-ch/local-engine/Functions/SparkFunctionArrayDistinct.cpp (234:253) duplicated block id: 267 size: 13 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (51:63) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (81:93) duplicated block id: 268 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (185:199) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (232:246) duplicated block id: 269 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (237:251) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (150:164) duplicated block id: 270 size: 13 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/rel/ExtensionTableBuilder.java (26:38) - gluten-core/src/main/java/io/glutenproject/substrait/rel/ExtensionTableNode.java (52:64) duplicated block id: 271 size: 13 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (150:164) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (232:246) duplicated block id: 272 size: 13 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q12.sql (18:30) - tools/gluten-it/common/src/main/resources/tpcds-queries/q98.sql (18:30) duplicated block id: 273 size: 13 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (254:266) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (332:344) duplicated block id: 274 size: 13 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHParquetWriterInjects.scala (27:41) - backends-velox/src/main/scala/org/apache/spark/sql/execution/datasources/velox/VeloxParquetWriterInjects.scala (27:41) duplicated block id: 275 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/TaskContextUtils.scala (26:38) - shims/spark34/src/main/scala/org/apache/spark/TaskContextUtils.scala (26:38) duplicated block id: 276 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (255:274) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (275:294) duplicated block id: 277 size: 13 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/TaskContextUtils.scala (26:38) - shims/spark34/src/main/scala/org/apache/spark/TaskContextUtils.scala (26:38) duplicated block id: 278 size: 13 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/stat/RamStat.scala (44:60) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/stat/RamStat.scala (66:82) duplicated block id: 279 size: 13 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q34.sql (2:14) - tools/gluten-it/common/src/main/resources/tpcds-queries/q73.sql (2:14) duplicated block id: 280 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (82:97) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (102:117) duplicated block id: 281 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (82:97) - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (101:116) duplicated block id: 282 size: 13 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (457:469) - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (497:509) duplicated block id: 283 size: 13 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (302:315) - cpp-ch/local-engine/local_engine_jni.cpp (859:872) duplicated block id: 284 size: 13 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (185:199) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (237:251) duplicated block id: 285 size: 13 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ArrowParquetBlockInputFormat.cpp (36:56) - cpp-ch/local-engine/Storages/ch_parquet/OptimizedParquetBlockInputFormat.cpp (99:119) duplicated block id: 286 size: 12 cleaned lines of code in 2 files: - gluten-celeborn/common/src/main/scala/org/apache/spark/shuffle/CelebornHashBasedColumnarShuffleWriter.scala (90:103) - gluten-data/src/main/scala/org/apache/spark/shuffle/ColumnarShuffleWriter.scala (245:258) duplicated block id: 287 size: 12 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (268:284) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (261:277) duplicated block id: 288 size: 12 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/JoinSelectionShim.scala (25:36) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/JoinSelectionShim.scala (25:36) duplicated block id: 289 size: 12 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (166:187) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (228:249) duplicated block id: 290 size: 12 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (70:81) - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (88:99) duplicated block id: 291 size: 12 cleaned lines of code in 2 files: - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (97:116) - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornColumnarBatchSerializer.scala (152:171) duplicated block id: 292 size: 12 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (278:308) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (261:291) duplicated block id: 293 size: 12 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/JoinSelectionShim.scala (25:36) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/JoinSelectionShim.scala (25:36) duplicated block id: 294 size: 12 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (61:72) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (109:120) duplicated block id: 295 size: 12 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (228:249) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (248:269) duplicated block id: 296 size: 12 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (458:469) - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (480:491) duplicated block id: 297 size: 12 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (61:72) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (81:92) duplicated block id: 298 size: 12 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (70:81) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (89:100) duplicated block id: 299 size: 12 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (88:108) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (84:104) duplicated block id: 300 size: 12 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (218:229) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (311:322) duplicated block id: 301 size: 12 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (697:708) - cpp-ch/local-engine/local_engine_jni.cpp (758:769) duplicated block id: 302 size: 12 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (480:491) - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (498:509) duplicated block id: 303 size: 12 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (180:193) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (217:229) duplicated block id: 304 size: 12 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (315:329) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (293:307) duplicated block id: 305 size: 11 cleaned lines of code in 2 files: - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornHashBasedColumnarShuffleWriter.scala (40:50) - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornHashBasedColumnarShuffleWriter.scala (41:51) duplicated block id: 306 size: 11 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1475:1487) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1521:1533) duplicated block id: 307 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:295) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:374) duplicated block id: 308 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/shuffle/CHColumnarShuffleWriter.scala (85:96) - gluten-data/src/main/scala/org/apache/spark/shuffle/ColumnarShuffleWriter.scala (116:128) duplicated block id: 309 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:161) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:374) duplicated block id: 310 size: 11 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/SerializedPlanParser.cpp (912:923) - cpp-ch/local-engine/Parser/SerializedPlanParser.cpp (930:941) duplicated block id: 311 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (531:541) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (447:457) duplicated block id: 312 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/shuffle/CHColumnarShuffleWriter.scala (150:161) - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornHashBasedColumnarShuffleWriter.scala (120:131) duplicated block id: 313 size: 11 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q25.sql (10:20) - tools/gluten-it/common/src/main/resources/tpcds-queries/q29.sql (10:20) duplicated block id: 314 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (26:36) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (89:99) duplicated block id: 315 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:145) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:279) duplicated block id: 316 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:145) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:295) duplicated block id: 317 size: 11 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q25.sql (36:46) - tools/gluten-it/common/src/main/resources/tpcds-queries/q29.sql (35:45) duplicated block id: 318 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:145) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:374) duplicated block id: 319 size: 11 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (56:66) - tools/gluten-it/common/src/main/resources/tpcds-queries/q74.sql (39:49) duplicated block id: 320 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:145) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:161) duplicated block id: 321 size: 11 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (192:207) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (212:227) duplicated block id: 322 size: 11 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (274:285) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (341:352) duplicated block id: 323 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:145) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:203) duplicated block id: 324 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:145) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:249) duplicated block id: 325 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:161) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:249) duplicated block id: 326 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:161) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:203) duplicated block id: 327 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:161) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:279) duplicated block id: 328 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:161) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:295) duplicated block id: 329 size: 11 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (37:55) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (47:64) duplicated block id: 330 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHFilterExecTransformer.scala (63:75) - backends-velox/src/main/scala/io/glutenproject/execution/FilterExecTransformer.scala (62:74) duplicated block id: 331 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:279) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:295) duplicated block id: 332 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:279) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:374) duplicated block id: 333 size: 11 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/metrics/Metrics.java (90:100) - gluten-data/src/main/java/io/glutenproject/metrics/OperatorMetrics.java (85:95) duplicated block id: 334 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:249) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:374) duplicated block id: 335 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:203) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:249) duplicated block id: 336 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:203) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:374) duplicated block id: 337 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:203) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:279) duplicated block id: 338 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:203) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:295) duplicated block id: 339 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:249) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:279) duplicated block id: 340 size: 11 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:249) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:295) duplicated block id: 341 size: 11 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (929:939) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2542:2552) duplicated block id: 342 size: 11 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2007:2019) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2197:2210) duplicated block id: 343 size: 11 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (41:51) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (39:49) duplicated block id: 344 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1660) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1803) duplicated block id: 345 size: 10 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/Queries.java (29:42) - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/QueriesCompare.java (29:42) duplicated block id: 346 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (275:285) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (511:521) duplicated block id: 347 size: 10 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (433:442) - cpp/velox/shuffle/VeloxShuffleReader.h (30:39) duplicated block id: 348 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:32) duplicated block id: 349 size: 10 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (417:429) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (502:514) duplicated block id: 350 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1534) duplicated block id: 351 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:32) duplicated block id: 352 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1487) duplicated block id: 353 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1447) duplicated block id: 354 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (206:224) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (747:765) duplicated block id: 355 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1594) duplicated block id: 356 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1447) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1700) duplicated block id: 357 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHMergeTreeWriterInjects.scala (66:75) - shims/common/src/main/scala/io/glutenproject/execution/datasource/GlutenFormatWriterInjects.scala (41:50) duplicated block id: 358 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1447) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1660) duplicated block id: 359 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1854:1868) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1887:1901) duplicated block id: 360 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1447) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1487) duplicated block id: 361 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1447) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1534) duplicated block id: 362 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1447) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1594) duplicated block id: 363 size: 10 cleaned lines of code in 2 files: - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (84:94) - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornColumnarBatchSerializer.scala (141:151) duplicated block id: 364 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (600:609) - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (623:632) duplicated block id: 365 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (125:137) - shims/spark32/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (150:162) duplicated block id: 366 size: 10 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (102:118) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (114:130) duplicated block id: 367 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1773) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1791:1805) duplicated block id: 368 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (121:130) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (324:333) duplicated block id: 369 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (300:317) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (441:458) duplicated block id: 370 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (219:230) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (475:486) duplicated block id: 371 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (645:654) - cpp-ch/local-engine/local_engine_jni.cpp (718:727) duplicated block id: 372 size: 10 cleaned lines of code in 2 files: - gluten-core/src/main/scala/org/apache/spark/sql/execution/python/EvalPythonExecTransformer.scala (70:80) - gluten-core/src/main/scala/org/apache/spark/sql/execution/python/EvalPythonExecTransformer.scala (89:98) duplicated block id: 373 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (113:122) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (129:138) duplicated block id: 374 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:32) duplicated block id: 375 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1700) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1803) duplicated block id: 376 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1447) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1803) duplicated block id: 377 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:32) duplicated block id: 378 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:32) duplicated block id: 379 size: 10 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (122:134) - shims/spark33/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (147:159) duplicated block id: 380 size: 10 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CollapseProjectShim.scala (21:32) - shims/spark34/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CollapseProjectShim.scala (21:32) duplicated block id: 381 size: 10 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/ExpandExecTransformer.scala (115:124) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (282:291) duplicated block id: 382 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (152:163) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (214:225) duplicated block id: 383 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (71:86) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (54:69) duplicated block id: 384 size: 10 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (48:57) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (128:137) duplicated block id: 385 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (125:137) - shims/spark33/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (147:159) duplicated block id: 386 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FileSourceScanMetricsUpdater.scala (27:37) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HiveTableScanMetricsUpdater.scala (23:33) duplicated block id: 387 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/AggregateRelParser.cpp (309:318) - cpp-ch/local-engine/Parser/AggregateRelParser.cpp (336:345) duplicated block id: 388 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1660) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1700) duplicated block id: 389 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1534) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1803) duplicated block id: 390 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1487) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1594) duplicated block id: 391 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1487) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1534) duplicated block id: 392 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:32) duplicated block id: 393 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (150:162) - shims/spark33/src/main/scala/org/apache/spark/sql/hive/execution/HiveFileFormat.scala (122:134) duplicated block id: 394 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:32) duplicated block id: 395 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:32) duplicated block id: 396 size: 10 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (126:139) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/QueriesCompare.scala (147:160) duplicated block id: 397 size: 10 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/ExpandExecTransformer.scala (115:124) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (222:231) duplicated block id: 398 size: 10 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (229:238) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (256:266) duplicated block id: 399 size: 10 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q50.sql (3:12) - tools/gluten-it/common/src/main/resources/tpcds-queries/q50.sql (37:46) duplicated block id: 400 size: 10 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (447:456) - cpp/core/benchmarks/CompressionBenchmark.cc (459:468) duplicated block id: 401 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HashAggregateMetricsUpdater.scala (77:89) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HashJoinMetricsUpdater.scala (97:109) duplicated block id: 402 size: 10 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (105:114) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (129:138) duplicated block id: 403 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:32) duplicated block id: 404 size: 10 cleaned lines of code in 2 files: - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornHashBasedColumnarShuffleWriter.scala (57:66) - gluten-data/src/main/scala/org/apache/spark/shuffle/ColumnarShuffleWriter.scala (64:73) duplicated block id: 405 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:32) duplicated block id: 406 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/DefaultHashAggregateResult.cpp (72:86) - cpp-ch/local-engine/Operator/EmptyProjectStep.cpp (38:52) duplicated block id: 407 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (168:181) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (164:177) duplicated block id: 408 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:32) duplicated block id: 409 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:32) duplicated block id: 410 size: 10 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleWriter.cc (1002:1012) - cpp/velox/shuffle/VeloxShuffleWriter.cc (1053:1063) duplicated block id: 411 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (214:225) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (234:245) duplicated block id: 412 size: 10 cleaned lines of code in 2 files: - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornHashBasedColumnarShuffleWriter.scala (82:92) - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornHashBasedColumnarShuffleWriter.scala (100:110) duplicated block id: 413 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:32) duplicated block id: 414 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/LowCopyFileSegmentShuffleInputStream.java (80:93) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/OnHeapCopyShuffleInputStream.java (60:73) duplicated block id: 415 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1803) duplicated block id: 416 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/Shuffle/NativeSplitter.cpp (68:81) - cpp-ch/local-engine/Shuffle/ShuffleSplitter.cpp (107:120) duplicated block id: 417 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1594) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1660) duplicated block id: 418 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1594) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1700) duplicated block id: 419 size: 10 cleaned lines of code in 2 files: - cpp/core/memory/HbwAllocator.h (28:44) - cpp/core/memory/MemoryAllocator.h (74:90) duplicated block id: 420 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1024:1033) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1078:1087) duplicated block id: 421 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (137:146) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (105:114) duplicated block id: 422 size: 10 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (266:276) - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (272:282) duplicated block id: 423 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1487) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1803) duplicated block id: 424 size: 10 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:75) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:68) duplicated block id: 425 size: 10 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/rel/ReadRelNode.java (95:108) - gluten-core/src/main/java/io/glutenproject/substrait/rel/WriteRelNode.java (60:72) duplicated block id: 426 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1534) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1594) duplicated block id: 427 size: 10 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (113:122) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (137:146) duplicated block id: 428 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1700) duplicated block id: 429 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1660) duplicated block id: 430 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/shuffle/CHColumnarShuffleWriter.scala (179:190) - gluten-data/src/main/scala/org/apache/spark/shuffle/ColumnarShuffleWriter.scala (244:255) duplicated block id: 431 size: 10 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (192:203) - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (215:226) duplicated block id: 432 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1534) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1700) duplicated block id: 433 size: 10 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/BatchScanMetricsUpdater.scala (23:33) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HiveTableScanMetricsUpdater.scala (23:33) duplicated block id: 434 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:32) duplicated block id: 435 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1487) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1700) duplicated block id: 436 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1487) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1660) duplicated block id: 437 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:32) - gluten-data/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:32) duplicated block id: 438 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (403:412) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1078:1087) duplicated block id: 439 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1534) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1660) duplicated block id: 440 size: 10 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1594) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1803) duplicated block id: 441 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q50.sql (4:12) - tools/gluten-it/common/src/main/resources/tpcds-queries/q50.sql (48:56) duplicated block id: 442 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (325:333) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2544:2552) duplicated block id: 443 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/shuffle/CHColumnarShuffleWriter.scala (180:190) - gluten-celeborn/common/src/main/scala/org/apache/spark/shuffle/CelebornHashBasedColumnarShuffleWriter.scala (90:100) duplicated block id: 444 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionCheckDecimalOverflow.cpp (116:130) - cpp-ch/local-engine/Functions/SparkFunctionUnscaleValue.cpp (104:117) duplicated block id: 445 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHColumnarBatchSerializer.scala (73:88) - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (156:171) duplicated block id: 446 size: 9 cleaned lines of code in 2 files: - substrait/substrait-spark/src/main/scala/io/substrait/spark/expression/ToSparkExpression.scala (130:139) - substrait/substrait-spark/src/main/scala/io/substrait/spark/logical/ToLogicalPlan.scala (67:76) duplicated block id: 447 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:293) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) duplicated block id: 448 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (203:214) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (261:272) duplicated block id: 449 size: 9 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (29:40) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (33:44) duplicated block id: 450 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (168:176) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (183:191) duplicated block id: 451 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:159) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) duplicated block id: 452 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q42.sql (6:14) - tools/gluten-it/common/src/main/resources/tpcds-queries/q52.sql (6:14) duplicated block id: 453 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (122:130) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (931:939) duplicated block id: 454 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (122:130) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2544:2552) duplicated block id: 455 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:277) duplicated block id: 456 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:372) duplicated block id: 457 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) duplicated block id: 458 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (285:293) duplicated block id: 459 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHIteratorApi.scala (133:141) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/IteratorApiImpl.scala (65:73) duplicated block id: 460 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (224:235) - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (238:249) duplicated block id: 461 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:247) duplicated block id: 462 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:201) duplicated block id: 463 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHColumnarBatchSerializer.scala (51:61) - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (54:64) duplicated block id: 464 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q50.sql (38:46) - tools/gluten-it/common/src/main/resources/tpcds-queries/q50.sql (48:56) duplicated block id: 465 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (78:87) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (126:135) duplicated block id: 466 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:143) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) duplicated block id: 467 size: 9 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (90:102) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (71:83) duplicated block id: 468 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (28:36) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (49:57) duplicated block id: 469 size: 9 cleaned lines of code in 2 files: - gluten-celeborn/common/src/main/scala/org/apache/spark/shuffle/CelebornPartitionPusher.scala (41:49) - gluten-celeborn/common/src/main/scala/org/apache/spark/shuffle/CelebornPartitionPusher.scala (53:61) duplicated block id: 470 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (23:31) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:31) duplicated block id: 471 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (114:123) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (250:260) duplicated block id: 472 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (135:143) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) duplicated block id: 473 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q24a.sql (45:53) - tools/gluten-it/common/src/main/resources/tpcds-queries/q24b.sql (45:53) duplicated block id: 474 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (217:226) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (217:226) duplicated block id: 475 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2070:2079) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2168:2177) duplicated block id: 476 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (118:126) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (198:206) duplicated block id: 477 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHColumnarBatchSerializer.scala (73:88) - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornColumnarBatchSerializer.scala (175:190) duplicated block id: 478 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (302:310) - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (188:196) duplicated block id: 479 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (23:31) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:31) duplicated block id: 480 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (364:372) duplicated block id: 481 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (109:117) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (56:64) duplicated block id: 482 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (151:159) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (168:176) duplicated block id: 483 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q47.sql (41:49) - tools/gluten-it/common/src/main/resources/tpcds-queries/q57.sql (38:46) duplicated block id: 484 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q30.sql (17:25) - tools/gluten-it/common/src/main/resources/tpcds-queries/q81.sql (17:25) duplicated block id: 485 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (2:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (2:10) duplicated block id: 486 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/Serializations/ExcelReadHelpers.cpp (247:257) - cpp-ch/local-engine/Storages/Serializations/ExcelReadHelpers.cpp (273:284) duplicated block id: 487 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (28:36) - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (49:57) duplicated block id: 488 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (216:226) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (306:316) duplicated block id: 489 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (217:226) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (277:286) duplicated block id: 490 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (216:226) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (286:296) duplicated block id: 491 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (7:15) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (49:57) duplicated block id: 492 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (7:15) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (28:36) duplicated block id: 493 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:31) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (23:31) duplicated block id: 494 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:31) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (23:31) duplicated block id: 495 size: 9 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (255:263) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (319:327) duplicated block id: 496 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WholeStageTransformer.scala (163:171) - substrait/substrait-spark/src/main/scala/io/substrait/debug/TreePrinter.scala (70:78) duplicated block id: 497 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/shuffle/CHColumnarShuffleWriter.scala (125:137) - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornHashBasedColumnarShuffleWriter.scala (94:106) duplicated block id: 498 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/ExpandExecTransformer.scala (100:109) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (209:217) duplicated block id: 499 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/org/apache/spark/sql/execution/GlutenExplainUtils.scala (94:102) - gluten-core/src/main/scala/org/apache/spark/sql/execution/GlutenImplicits.scala (137:145) duplicated block id: 500 size: 9 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (248:264) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (817:833) duplicated block id: 501 size: 9 cleaned lines of code in 2 files: - cpp/core/memory/MemoryAllocator.h (51:66) - cpp/core/memory/MemoryAllocator.h (73:88) duplicated block id: 502 size: 9 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (103:113) - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (122:132) duplicated block id: 503 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (81:89) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (56:64) duplicated block id: 504 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (519:528) - gluten-core/src/main/scala/io/glutenproject/expression/UnaryExpressionTransformer.scala (133:142) duplicated block id: 505 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (36:44) - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (29:37) duplicated block id: 506 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (269:277) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) duplicated block id: 507 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/SubstraitSource/OrcUtil.cpp (164:175) - cpp-ch/local-engine/Storages/ch_parquet/OptimizedParquetBlockInputFormat.cpp (135:146) duplicated block id: 508 size: 9 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (57:71) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (58:72) duplicated block id: 509 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/DefaultHashAggregateResult.cpp (35:47) - cpp-ch/local-engine/Operator/ExpandStep.cpp (32:44) duplicated block id: 510 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/memory/memtarget/TreeMemoryTargets.java (214:227) - gluten-core/src/main/java/io/glutenproject/memory/memtarget/spark/TreeMemoryConsumer.java (131:144) duplicated block id: 511 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (239:247) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) duplicated block id: 512 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (33:43) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (32:42) duplicated block id: 513 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/aggregate_function_parser/LeadLagParser.cpp (41:51) - cpp-ch/local-engine/Parser/aggregate_function_parser/LeadLagParser.cpp (84:94) duplicated block id: 514 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:31) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (23:31) duplicated block id: 515 size: 9 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (195:203) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (313:321) duplicated block id: 516 size: 9 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (169:181) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (225:233) duplicated block id: 517 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHIteratorApi.scala (251:259) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/IteratorApiImpl.scala (188:196) duplicated block id: 518 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (193:201) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (325:333) duplicated block id: 519 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (78:87) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (134:143) duplicated block id: 520 size: 9 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (195:203) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (220:228) duplicated block id: 521 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/GenerateMetricsUpdater.scala (23:31) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (23:31) duplicated block id: 522 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (336:344) - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (355:363) duplicated block id: 523 size: 9 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (70:81) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (74:85) duplicated block id: 524 size: 9 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (325:333) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (931:939) duplicated block id: 525 size: 9 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/ExpandExecTransformer.scala (100:109) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (269:277) duplicated block id: 526 size: 9 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/shuffle/CHColumnarShuffleWriter.scala (36:47) - gluten-data/src/main/scala/org/apache/spark/shuffle/ColumnarShuffleWriter.scala (38:49) duplicated block id: 527 size: 9 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (169:181) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (284:292) duplicated block id: 528 size: 9 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (169:181) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (168:176) duplicated block id: 529 size: 9 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (169:181) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (193:201) duplicated block id: 530 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (7:15) - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (49:57) duplicated block id: 531 size: 9 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (7:15) - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (28:36) duplicated block id: 532 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (132:139) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (169:176) duplicated block id: 533 size: 8 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/SparkShimProvider.scala (31:39) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/SparkShimProvider.scala (31:39) duplicated block id: 534 size: 8 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1522:1534) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1771) duplicated block id: 535 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (46:53) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (43:50) duplicated block id: 536 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (24:31) - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (43:50) duplicated block id: 537 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (22:29) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (26:33) duplicated block id: 538 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (285:292) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (170:177) duplicated block id: 539 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (428:436) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (277:285) duplicated block id: 540 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (827:835) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (860:868) duplicated block id: 541 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (160:167) - gluten-data/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (246:253) duplicated block id: 542 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/ExpressionConverter.scala (69:76) - gluten-core/src/main/scala/io/glutenproject/expression/ExpressionConverter.scala (86:93) duplicated block id: 543 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/function.proto (64:74) - gluten-core/src/main/resources/substrait/proto/substrait/function.proto (46:56) duplicated block id: 544 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (428:436) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (217:225) duplicated block id: 545 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (75:82) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (157:164) duplicated block id: 546 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (18:25) - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (42:49) duplicated block id: 547 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q24a.sql (30:37) - tools/gluten-it/common/src/main/resources/tpcds-queries/q24b.sql (4:11) duplicated block id: 548 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScanBuilder.scala (56:63) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (311:318) duplicated block id: 549 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/memory/memtarget/NoopMemoryTarget.java (43:54) - gluten-core/src/main/java/io/glutenproject/memory/memtarget/TreeMemoryTargets.java (178:189) duplicated block id: 550 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/parameterized_types.proto (1:11) - gluten-core/src/main/resources/substrait/proto/substrait/type_expressions.proto (1:11) duplicated block id: 551 size: 8 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (224:231) - cpp/core/benchmarks/CompressionBenchmark.cc (353:360) duplicated block id: 552 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q47.sql (29:36) - tools/gluten-it/common/src/main/resources/tpcds-queries/q57.sql (28:35) duplicated block id: 553 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (94:101) - gluten-data/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (283:290) duplicated block id: 554 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/SparkWriteFilesCommitProtocol.scala (53:62) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (397:406) duplicated block id: 555 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/metrics/MetricsData.java (51:64) - backends-clickhouse/src/main/java/io/glutenproject/metrics/MetricsProcessor.java (42:55) duplicated block id: 556 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (88:95) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (102:109) duplicated block id: 557 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/function.proto (46:56) - cpp-ch/local-engine/proto/substrait/function.proto (64:74) duplicated block id: 558 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (827:835) - cpp-ch/local-engine/proto/substrait/algebra.proto (860:868) duplicated block id: 559 size: 8 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (193:200) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (144:151) duplicated block id: 560 size: 8 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleWriter.cc (281:290) - cpp/velox/shuffle/VeloxShuffleWriter.cc (1093:1101) duplicated block id: 561 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (97:104) - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (356:363) duplicated block id: 562 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (97:104) - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (337:344) duplicated block id: 563 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (3:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (51:58) duplicated block id: 564 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (201:208) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (297:304) duplicated block id: 565 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (163:173) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (147:157) duplicated block id: 566 size: 8 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (224:231) - cpp/core/benchmarks/CompressionBenchmark.cc (277:284) duplicated block id: 567 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q5.sql (95:102) - tools/gluten-it/common/src/main/resources/tpcds-queries/q80.sql (63:70) duplicated block id: 568 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (3:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (27:34) duplicated block id: 569 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:163) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:383) duplicated block id: 570 size: 8 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/TaskContextUtils.scala (38:45) - shims/spark34/src/main/scala/org/apache/spark/TaskContextUtils.scala (39:46) duplicated block id: 571 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (3:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (27:34) duplicated block id: 572 size: 8 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (49:60) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala (53:64) duplicated block id: 573 size: 8 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1402:1414) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1771) duplicated block id: 574 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (129:138) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (166:175) duplicated block id: 575 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (21:28) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (66:73) duplicated block id: 576 size: 8 cleaned lines of code in 2 files: - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (39:46) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (44:51) duplicated block id: 577 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (41:48) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (47:54) duplicated block id: 578 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (145:154) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (182:191) duplicated block id: 579 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/CHColumnToSparkRow.cpp (188:200) - cpp-ch/local-engine/Parser/CHColumnToSparkRow.cpp (243:255) duplicated block id: 580 size: 8 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/SparkShimProvider.scala (31:39) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/SparkShimProvider.scala (31:39) duplicated block id: 581 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (88:95) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (130:137) duplicated block id: 582 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (3:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (209:216) duplicated block id: 583 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (602:609) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (194:201) duplicated block id: 584 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1012:1022) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1039:1049) duplicated block id: 585 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (203:214) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (292:304) duplicated block id: 586 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (170:177) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (107:114) duplicated block id: 587 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q26.sql (12:19) - tools/gluten-it/common/src/main/resources/tpcds-queries/q7.sql (12:19) duplicated block id: 588 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (42:49) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (66:73) duplicated block id: 589 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/SparkWriteFilesCommitProtocol.scala (53:62) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (377:386) duplicated block id: 590 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (860:868) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (827:835) duplicated block id: 591 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q49.sql (9:16) - tools/gluten-it/common/src/main/resources/tpcds-queries/q49.sql (94:101) duplicated block id: 592 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (194:201) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (602:609) duplicated block id: 593 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (150:157) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (96:103) duplicated block id: 594 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (575:582) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (194:201) duplicated block id: 595 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q49.sql (9:16) - tools/gluten-it/common/src/main/resources/tpcds-queries/q49.sql (51:58) duplicated block id: 596 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/LowCopyNettyShuffleInputStream.java (61:70) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/OnHeapCopyShuffleInputStream.java (65:74) duplicated block id: 597 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (42:49) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (42:49) duplicated block id: 598 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q44.sql (5:12) - tools/gluten-it/common/src/main/resources/tpcds-queries/q44.sql (17:24) duplicated block id: 599 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (42:49) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (18:25) duplicated block id: 600 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (194:201) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (575:582) duplicated block id: 601 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (46:53) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (40:47) duplicated block id: 602 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/IO/NativeReader.cpp (86:94) - cpp-ch/local-engine/Storages/IO/NativeReader.cpp (103:111) duplicated block id: 603 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/ValidatorApiImpl.scala (48:55) - backends-velox/src/main/scala/io/glutenproject/execution/VeloxColumnarToRowExec.scala (44:51) duplicated block id: 604 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q67.sql (4:11) - tools/gluten-it/common/src/main/resources/tpcds-queries/q67.sql (33:40) duplicated block id: 605 size: 8 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1475:1487) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1771) duplicated block id: 606 size: 8 cleaned lines of code in 2 files: - cpp/core/shuffle/Utils.cc (156:163) - cpp/core/shuffle/Utils.h (56:63) duplicated block id: 607 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q67.sql (3:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q67.sql (13:20) duplicated block id: 608 size: 8 cleaned lines of code in 2 files: - cpp/core/utils/macros.h (44:53) - cpp/core/utils/macros.h (80:89) duplicated block id: 609 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/function.proto (46:56) - gluten-core/src/main/resources/substrait/proto/substrait/function.proto (64:74) duplicated block id: 610 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (46:53) - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (39:46) duplicated block id: 611 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/memory/memtarget/spark/RegularMemoryConsumer.java (79:90) - gluten-core/src/main/java/io/glutenproject/memory/memtarget/spark/TreeMemoryConsumer.java (92:103) duplicated block id: 612 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/parameterized_types.proto (1:11) - cpp-ch/local-engine/proto/substrait/type_expressions.proto (1:11) duplicated block id: 613 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q14b.sql (75:82) - tools/gluten-it/common/src/main/resources/tpcds-queries/q14b.sql (90:97) duplicated block id: 614 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (298:307) - backends-velox/src/main/scala/org/apache/spark/sql/execution/SparkWriteFilesCommitProtocol.scala (53:62) duplicated block id: 615 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (3:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (51:58) duplicated block id: 616 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (85:92) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (103:110) duplicated block id: 617 size: 8 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (225:232) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (144:151) duplicated block id: 618 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (3:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (27:34) duplicated block id: 619 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/NamedExpressionsTransformer.scala (31:39) - gluten-core/src/main/scala/io/glutenproject/expression/UnaryExpressionTransformer.scala (172:180) duplicated block id: 620 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (295:302) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (309:316) duplicated block id: 621 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/proto/substrait/function.proto (46:56) - gluten-core/src/main/resources/substrait/proto/substrait/function.proto (64:74) duplicated block id: 622 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/memory/memtarget/spark/RegularMemoryConsumer.java (115:125) - gluten-core/src/main/java/io/glutenproject/memory/memtarget/spark/TreeMemoryConsumer.java (72:82) duplicated block id: 623 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Shuffle/CachedShuffleWriter.cpp (123:135) - cpp-ch/local-engine/Shuffle/ShuffleSplitter.cpp (84:97) duplicated block id: 624 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/SerializedPlanParser.cpp (741:749) - cpp-ch/local-engine/Parser/SerializedPlanParser.cpp (1335:1343) duplicated block id: 625 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/delta/files/MergeTreeCommitProtocol.scala (149:156) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/delta/files/MergeTreeCommitProtocol.scala (149:156) duplicated block id: 626 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (27:34) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (3:10) duplicated block id: 627 size: 8 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (57:66) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (68:77) duplicated block id: 628 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:35) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:358) duplicated block id: 629 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarQueryStagePrepOverrides.scala (64:71) - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (604:611) duplicated block id: 630 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:158) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:195) duplicated block id: 631 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:178) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:215) duplicated block id: 632 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (160:167) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (197:204) duplicated block id: 633 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (18:25) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (66:73) duplicated block id: 634 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (75:82) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (116:123) duplicated block id: 635 size: 8 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/Offset.scala (25:35) - shims/spark33/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/Offset.scala (25:35) duplicated block id: 636 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:68) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:383) duplicated block id: 637 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (428:436) - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (217:225) duplicated block id: 638 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (59:66) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (107:114) duplicated block id: 639 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpch-queries/q11.sql (6:13) - tools/gluten-it/common/src/main/resources/tpch-queries/q11.sql (19:26) duplicated block id: 640 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (661:670) - cpp-ch/local-engine/local_engine_jni.cpp (731:740) duplicated block id: 641 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (65:72) - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (104:112) duplicated block id: 642 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q24a.sql (4:11) - tools/gluten-it/common/src/main/resources/tpcds-queries/q24b.sql (30:37) duplicated block id: 643 size: 8 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (284:291) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (144:151) duplicated block id: 644 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_datetime.yaml (33:40) - gluten-core/src/main/resources/substrait/extensions/functions_datetime.yaml (71:78) duplicated block id: 645 size: 8 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (57:66) - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (67:76) duplicated block id: 646 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScanBuilder.scala (56:63) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (319:326) duplicated block id: 647 size: 8 cleaned lines of code in 2 files: - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (39:46) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (43:50) duplicated block id: 648 size: 8 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:35) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:358) duplicated block id: 649 size: 8 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/SparkShimProvider.scala (31:39) - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/SparkShimProvider.scala (31:39) duplicated block id: 650 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2072:2079) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2255:2262) duplicated block id: 651 size: 8 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (264:276) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (433:445) duplicated block id: 652 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/proto/substrait/parameterized_types.proto (1:11) - gluten-core/src/main/resources/substrait/proto/substrait/type_expressions.proto (1:11) duplicated block id: 653 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/EmptyProjectStep.cpp (90:100) - cpp-ch/local-engine/Operator/ExpandStep.cpp (34:44) duplicated block id: 654 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (39:46) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHShuffleSplitterJniWrapper.java (69:76) duplicated block id: 655 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (194:201) - cpp-ch/local-engine/proto/substrait/algebra.proto (575:582) duplicated block id: 656 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/SortExecTransformer.scala (68:76) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (136:143) duplicated block id: 657 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (88:95) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (131:138) duplicated block id: 658 size: 8 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1435:1447) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1771) duplicated block id: 659 size: 8 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1688:1700) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1771) duplicated block id: 660 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (43:50) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (40:47) duplicated block id: 661 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/algebra.proto (194:201) - cpp-ch/local-engine/proto/substrait/algebra.proto (602:609) duplicated block id: 662 size: 8 cleaned lines of code in 2 files: - cpp/core/memory/HbwAllocator.h (28:42) - cpp/core/memory/MemoryAllocator.h (52:66) duplicated block id: 663 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/ArrayExpressionTransformer.scala (47:54) - gluten-core/src/main/scala/io/glutenproject/expression/MapExpressionTransformer.scala (49:56) duplicated block id: 664 size: 8 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (686:693) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (744:751) duplicated block id: 665 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (18:25) - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (66:73) duplicated block id: 666 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/GraceMergingAggregatedStep.h (48:56) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.h (103:112) duplicated block id: 667 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (147:154) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (164:171) duplicated block id: 668 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (163:173) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (154:164) duplicated block id: 669 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/proto/substrait/type_expressions.proto (1:11) - gluten-core/src/main/resources/substrait/proto/substrait/parameterized_types.proto (1:11) duplicated block id: 670 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/GraceMergingAggregatedStep.cpp (36:50) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.cpp (261:275) duplicated block id: 671 size: 8 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/sql/shims/SparkShims.scala (94:101) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/GlutenFileFormatWriter.scala (24:31) duplicated block id: 672 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q10.sql (2:9) - tools/gluten-it/common/src/main/resources/tpcds-queries/q69.sql (2:9) duplicated block id: 673 size: 8 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1582:1594) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1771) duplicated block id: 674 size: 8 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (168:175) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (144:151) duplicated block id: 675 size: 8 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1648:1660) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1771) duplicated block id: 676 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/aggregate_function_parser/LeadLagParser.cpp (55:65) - cpp-ch/local-engine/Parser/aggregate_function_parser/LeadLagParser.cpp (103:113) duplicated block id: 677 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (46:53) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (44:51) duplicated block id: 678 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (6:13) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (66:73) duplicated block id: 679 size: 8 cleaned lines of code in 2 files: - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (39:46) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (40:47) duplicated block id: 680 size: 8 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (169:179) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (144:151) duplicated block id: 681 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q24a.sql (4:11) - tools/gluten-it/common/src/main/resources/tpcds-queries/q24a.sql (30:37) duplicated block id: 682 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q49.sql (51:58) - tools/gluten-it/common/src/main/resources/tpcds-queries/q49.sql (94:101) duplicated block id: 683 size: 8 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (125:134) - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (440:449) duplicated block id: 684 size: 8 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q24b.sql (4:11) - tools/gluten-it/common/src/main/resources/tpcds-queries/q24b.sql (30:37) duplicated block id: 685 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (194:201) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (602:609) duplicated block id: 686 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (827:835) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (860:868) duplicated block id: 687 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/DefaultHashAggregateResult.cpp (37:47) - cpp-ch/local-engine/Operator/EmptyProjectStep.cpp (90:100) duplicated block id: 688 size: 8 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2170:2177) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2255:2262) duplicated block id: 689 size: 8 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (194:201) - gluten-core/src/main/resources/substrait/proto/substrait/algebra.proto (575:582) duplicated block id: 690 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1522:1530) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 691 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:52) - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:59) duplicated block id: 692 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1522:1530) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 693 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (130:140) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (212:222) duplicated block id: 694 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (454:460) - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (483:489) duplicated block id: 695 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1894:1901) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1918:1925) duplicated block id: 696 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/expression/ExpressionBuilder.java (262:268) - gluten-core/src/main/java/io/glutenproject/substrait/expression/WindowFunctionNode.java (43:49) duplicated block id: 697 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (912:921) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (966:975) duplicated block id: 698 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (81:87) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (113:119) duplicated block id: 699 size: 7 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (258:264) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (206:212) duplicated block id: 700 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:50) - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:50) duplicated block id: 701 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (155:162) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (217:224) duplicated block id: 702 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (454:461) - gluten-core/src/main/scala/io/glutenproject/expression/UnaryExpressionTransformer.scala (77:84) duplicated block id: 703 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) duplicated block id: 704 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (171:177) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (80:86) duplicated block id: 705 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (48:60) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (53:65) duplicated block id: 706 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:70) - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:52) duplicated block id: 707 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/GlutenMergeTreeWriterInjects.scala (22:31) - shims/common/src/main/scala/io/glutenproject/execution/datasource/GlutenOrcWriterInjects.scala (21:30) duplicated block id: 708 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (141:147) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (178:184) duplicated block id: 709 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (289:298) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (869:878) duplicated block id: 710 size: 7 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (565:571) - cpp/velox/shuffle/VeloxShuffleWriter.cc (774:780) duplicated block id: 711 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (418:426) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (588:595) duplicated block id: 712 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHOrcWriterInjects.scala (23:32) - backends-velox/src/main/scala/org/apache/spark/sql/execution/datasources/velox/VeloxOrcWriterInjects.scala (23:32) duplicated block id: 713 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (227:233) - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (267:273) duplicated block id: 714 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (37:43) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (165:171) duplicated block id: 715 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (595:604) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (912:921) duplicated block id: 716 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (227:233) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (283:289) duplicated block id: 717 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) duplicated block id: 718 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (179:186) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (542:549) duplicated block id: 719 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (595:604) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (966:975) duplicated block id: 720 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 721 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 722 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (974:982) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (3100:3108) duplicated block id: 723 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (70:76) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (37:43) duplicated block id: 724 size: 7 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (600:606) - cpp/core/jni/JniWrapper.cc (1374:1380) duplicated block id: 725 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (467:473) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (278:284) duplicated block id: 726 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/org/apache/spark/sql/execution/GlutenExplainUtils.scala (104:112) - gluten-core/src/main/scala/org/apache/spark/sql/execution/GlutenImplicits.scala (148:157) duplicated block id: 727 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (160:166) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (165:171) duplicated block id: 728 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 729 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) duplicated block id: 730 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q77.sql (73:79) - tools/gluten-it/common/src/main/resources/tpcds-queries/q80.sql (64:70) duplicated block id: 731 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (51:57) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (644:650) duplicated block id: 732 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/EmptyProjectStep.cpp (40:49) - cpp-ch/local-engine/Operator/GraceMergingAggregatedStep.cpp (138:147) duplicated block id: 733 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (51:57) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (612:618) duplicated block id: 734 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/ExpandExecTransformer.scala (109:117) - gluten-core/src/main/scala/io/glutenproject/execution/ExpandExecTransformer.scala (161:170) duplicated block id: 735 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (160:166) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (37:43) duplicated block id: 736 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/AggregateFunctions/AggregateFunctionPartialMerge.cpp (52:58) - cpp-ch/local-engine/AggregateFunctions/AggregateFunctionPartialMerge.cpp (77:83) duplicated block id: 737 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1919:1926) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1936:1943) duplicated block id: 738 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (171:178) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (72:79) duplicated block id: 739 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 740 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 741 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (83:90) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (217:224) duplicated block id: 742 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (81:87) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (105:111) duplicated block id: 743 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (406:412) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1053:1059) duplicated block id: 744 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (70:76) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (217:223) duplicated block id: 745 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) duplicated block id: 746 size: 7 cleaned lines of code in 2 files: - dev/license-header.py (132:141) - dev/util.py (71:80) duplicated block id: 747 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1042:1048) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1052:1058) duplicated block id: 748 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) duplicated block id: 749 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:50) - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:59) duplicated block id: 750 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) duplicated block id: 751 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:50) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:58) duplicated block id: 752 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:52) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:50) duplicated block id: 753 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (587:593) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (505:511) duplicated block id: 754 size: 7 cleaned lines of code in 2 files: - cpp/core/utils/qpl/qpl_codec.cc (44:50) - cpp/core/utils/qpl/qpl_codec.cc (113:119) duplicated block id: 755 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) duplicated block id: 756 size: 7 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (444:455) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (265:276) duplicated block id: 757 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1042:1048) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1062:1068) duplicated block id: 758 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (48:54) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeDeltaTxnWriter.scala (223:229) duplicated block id: 759 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (40:46) - gluten-core/src/main/scala/io/glutenproject/backendsapi/TransformerApi.scala (35:41) duplicated block id: 760 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:70) - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:50) duplicated block id: 761 size: 7 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (303:314) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (265:276) duplicated block id: 762 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (42:51) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (140:149) duplicated block id: 763 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/rel/LocalFilesBuilder.java (26:32) - gluten-iceberg/src/main/java/io/glutenproject/substrait/rel/IcebergLocalFilesBuilder.java (27:33) duplicated block id: 764 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 765 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (180:186) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (37:43) duplicated block id: 766 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (2:8) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (48:54) duplicated block id: 767 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (2:8) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (128:134) duplicated block id: 768 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/tool/parquet_to_mergetree.py (60:66) - cpp-ch/local-engine/tool/parquet_to_mergetree.py (86:92) duplicated block id: 769 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (37:43) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (197:203) duplicated block id: 770 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_datetime.yaml (46:52) - gluten-core/src/main/resources/substrait/extensions/functions_datetime.yaml (84:90) duplicated block id: 771 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) duplicated block id: 772 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1264:1270) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1290:1296) duplicated block id: 773 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (184:190) - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (221:227) duplicated block id: 774 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (910:920) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1113:1123) duplicated block id: 775 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (180:186) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (165:171) duplicated block id: 776 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (51:57) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (105:111) duplicated block id: 777 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (55:61) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (604:610) duplicated block id: 778 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (55:61) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (636:642) duplicated block id: 779 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) duplicated block id: 780 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (604:610) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (45:51) duplicated block id: 781 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (51:57) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (129:135) duplicated block id: 782 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (21:27) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (36:42) duplicated block id: 783 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/rel/InputIteratorRelNode.java (44:52) - gluten-core/src/main/java/io/glutenproject/substrait/rel/ReadRelNode.java (95:103) duplicated block id: 784 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1402:1410) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 785 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1767) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 786 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:63) - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:52) duplicated block id: 787 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q17.sql (25:31) - tools/gluten-it/common/src/main/resources/tpcds-queries/q29.sql (22:28) duplicated block id: 788 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (21:27) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (51:57) duplicated block id: 789 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1402:1410) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 790 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1759:1767) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 791 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (51:57) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (113:119) duplicated block id: 792 size: 7 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (303:314) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (434:445) duplicated block id: 793 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q17.sql (17:23) - tools/gluten-it/common/src/main/resources/tpcds-queries/q25.sql (12:18) duplicated block id: 794 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (51:57) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (137:143) duplicated block id: 795 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1254:1260) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1290:1296) duplicated block id: 796 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:59) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:50) duplicated block id: 797 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1254:1260) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1264:1270) duplicated block id: 798 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1027:1033) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1053:1059) duplicated block id: 799 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/utils/CHExecUtil.scala (347:354) - gluten-data/src/main/scala/org/apache/spark/sql/execution/utils/ExecUtil.scala (198:205) duplicated block id: 800 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (37:43) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (217:223) duplicated block id: 801 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1475:1483) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 802 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (184:190) - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (438:444) duplicated block id: 803 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (230:236) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (271:277) duplicated block id: 804 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/Serializations/ExcelBoolReader.cpp (146:152) - cpp-ch/local-engine/Storages/Serializations/ExcelBoolReader.cpp (158:164) duplicated block id: 805 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1475:1483) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 806 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (258:264) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (271:277) duplicated block id: 807 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/catalyst/FlushableHashAggregateRule.scala (41:47) - backends-velox/src/main/scala/org/apache/spark/sql/catalyst/FlushableHashAggregateRule.scala (52:58) duplicated block id: 808 size: 7 cleaned lines of code in 2 files: - gluten-celeborn/common/src/main/java/org/apache/spark/shuffle/gluten/celeborn/CelebornShuffleManager.java (141:148) - gluten-celeborn/common/src/main/java/org/apache/spark/shuffle/gluten/celeborn/CelebornShuffleManager.java (159:166) duplicated block id: 809 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (36:42) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (66:72) duplicated block id: 810 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (36:42) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (51:57) duplicated block id: 811 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) duplicated block id: 812 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q83.sql (25:31) - tools/gluten-it/common/src/main/resources/tpcds-queries/q83.sql (41:47) duplicated block id: 813 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (723:729) - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (732:738) duplicated block id: 814 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (221:227) - cpp-ch/local-engine/Storages/SubstraitSource/ReadBufferBuilder.cpp (438:444) duplicated block id: 815 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (428:435) - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (83:90) duplicated block id: 816 size: 7 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (465:472) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (345:352) duplicated block id: 817 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1347:1358) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1961:1970) duplicated block id: 818 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/SerializedPlanParser.cpp (1695:1701) - cpp-ch/local-engine/Parser/SerializedPlanParser.cpp (1991:1997) duplicated block id: 819 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (143:149) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (225:231) duplicated block id: 820 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (160:166) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (37:43) duplicated block id: 821 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (349:358) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (945:954) duplicated block id: 822 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (160:166) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (70:76) duplicated block id: 823 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (474:484) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (838:848) duplicated block id: 824 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/DefaultHashAggregateResult.cpp (74:83) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.cpp (68:77) duplicated block id: 825 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHColumnarBatchSerializer.scala (78:88) - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (120:130) duplicated block id: 826 size: 7 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (414:420) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (255:261) duplicated block id: 827 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (43:49) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (45:51) duplicated block id: 828 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (271:277) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (166:172) duplicated block id: 829 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/RewriteDateTimestampComparisonRule.scala (271:278) - gluten-core/src/main/scala/io/glutenproject/extension/RewriteDateTimestampComparisonRule.scala (283:290) duplicated block id: 830 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (474:484) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (790:800) duplicated block id: 831 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (428:435) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (155:162) duplicated block id: 832 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (44:54) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (373:387) duplicated block id: 833 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/RewriteDateTimestampComparisonRule.scala (259:266) - gluten-core/src/main/scala/io/glutenproject/extension/RewriteDateTimestampComparisonRule.scala (295:302) duplicated block id: 834 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1825:1832) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1861:1868) duplicated block id: 835 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (274:283) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (850:859) duplicated block id: 836 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1825:1832) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1894:1901) duplicated block id: 837 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1825:1832) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1918:1925) duplicated block id: 838 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 839 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) duplicated block id: 840 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (49:55) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeDeltaTxnWriter.scala (223:229) duplicated block id: 841 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (636:642) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (45:51) duplicated block id: 842 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1791:1799) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 843 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/GenericExpressionTransformer.scala (32:38) - gluten-core/src/main/scala/io/glutenproject/expression/PredicateExpressionTransformer.scala (114:120) duplicated block id: 844 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (180:186) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (70:76) duplicated block id: 845 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1791:1799) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 846 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHIteratorApi.scala (252:258) - gluten-core/src/main/scala/io/glutenproject/backendsapi/IteratorApi.scala (73:79) duplicated block id: 847 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (180:186) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (37:43) duplicated block id: 848 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/AggregateRelParser.cpp (298:304) - cpp-ch/local-engine/Parser/AggregateRelParser.cpp (325:331) duplicated block id: 849 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/GenericExpressionTransformer.scala (32:38) - gluten-core/src/main/scala/io/glutenproject/expression/PredicateExpressionTransformer.scala (85:91) duplicated block id: 850 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (595:604) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (732:741) duplicated block id: 851 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (365:371) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (414:420) duplicated block id: 852 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 853 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (55:61) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (45:51) duplicated block id: 854 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 855 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q23a.sql (2:8) - tools/gluten-it/common/src/main/resources/tpcds-queries/q23b.sql (2:8) duplicated block id: 856 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 857 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q67.sql (14:20) - tools/gluten-it/common/src/main/resources/tpcds-queries/q67.sql (33:39) duplicated block id: 858 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) duplicated block id: 859 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHNativeExpressionEvaluator.java (92:98) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/CHNativeExpressionEvaluator.java (106:112) duplicated block id: 860 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (206:212) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (257:263) duplicated block id: 861 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (434:443) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1110:1119) duplicated block id: 862 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) duplicated block id: 863 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 864 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) duplicated block id: 865 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (37:43) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (70:76) duplicated block id: 866 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) duplicated block id: 867 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (70:76) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (197:203) duplicated block id: 868 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/PredicateExpressionTransformer.scala (85:91) - gluten-core/src/main/scala/io/glutenproject/expression/PredicateExpressionTransformer.scala (114:120) duplicated block id: 869 size: 7 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (628:634) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (640:646) duplicated block id: 870 size: 7 cleaned lines of code in 2 files: - cpp/core/utils/macros.h (36:42) - cpp/core/utils/macros.h (72:78) duplicated block id: 871 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (51:57) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (66:72) duplicated block id: 872 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) duplicated block id: 873 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHIteratorApi.scala (203:209) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/IteratorApiImpl.scala (161:167) duplicated block id: 874 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:52) - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:50) duplicated block id: 875 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/EmptyProjectStep.cpp (40:49) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.cpp (68:77) duplicated block id: 876 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (218:224) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (249:255) duplicated block id: 877 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) duplicated block id: 878 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (37:43) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (165:171) duplicated block id: 879 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) duplicated block id: 880 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (180:186) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (197:203) duplicated block id: 881 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 882 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) duplicated block id: 883 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScan.scala (27:33) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScanBase.scala (38:44) duplicated block id: 884 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (63:69) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (612:618) duplicated block id: 885 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (187:193) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (203:209) duplicated block id: 886 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (63:69) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (644:650) duplicated block id: 887 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) duplicated block id: 888 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (28:34) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) duplicated block id: 889 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (534:541) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (588:595) duplicated block id: 890 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 891 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) duplicated block id: 892 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (187:193) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (195:201) duplicated block id: 893 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (43:49) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (604:610) duplicated block id: 894 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 895 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:70) - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:50) duplicated block id: 896 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (43:49) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (636:642) duplicated block id: 897 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 898 size: 7 cleaned lines of code in 2 files: - tools/workload/tpcds/gen_data/parquet_dataset/tpcds_datagen_parquet.scala (29:37) - tools/workload/tpch/gen_data/parquet_dataset/tpch_datagen_parquet.scala (29:37) duplicated block id: 899 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:50) - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:52) duplicated block id: 900 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScan.scala (28:34) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScanBase.scala (38:44) duplicated block id: 901 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 902 size: 7 cleaned lines of code in 2 files: - cpp/velox/compute/WholeStageResultIterator.cc (96:102) - cpp/velox/compute/WholeStageResultIterator.h (34:40) duplicated block id: 903 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (83:90) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (155:162) duplicated block id: 904 size: 7 cleaned lines of code in 2 files: - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (120:130) - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornColumnarBatchSerializer.scala (180:190) duplicated block id: 905 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) duplicated block id: 906 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 907 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/ds/TpcdsSuite.scala (107:113) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/h/TpchSuite.scala (105:111) duplicated block id: 908 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) duplicated block id: 909 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 910 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (217:223) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (37:43) duplicated block id: 911 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/LowCopyFileSegmentShuffleInputStream.java (85:93) - backends-clickhouse/src/main/java/io/glutenproject/vectorized/LowCopyNettyShuffleInputStream.java (61:69) duplicated block id: 912 size: 7 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (295:303) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (278:286) duplicated block id: 913 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (351:357) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 914 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (217:223) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (165:171) duplicated block id: 915 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q75.sql (21:27) - tools/gluten-it/common/src/main/resources/tpcds-queries/q75.sql (34:40) duplicated block id: 916 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1053:1059) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1081:1087) duplicated block id: 917 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q43.sql (3:9) - tools/gluten-it/common/src/main/resources/tpcds-queries/q59.sql (5:11) duplicated block id: 918 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:52) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:58) duplicated block id: 919 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) duplicated block id: 920 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (277:284) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (155:162) duplicated block id: 921 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (179:185) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (203:209) duplicated block id: 922 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (141:150) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (185:194) duplicated block id: 923 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1052:1058) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1062:1068) duplicated block id: 924 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (83:90) - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (217:224) duplicated block id: 925 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/GlutenMergeTreeWriterInjects.scala (22:31) - shims/common/src/main/scala/io/glutenproject/execution/datasource/GlutenParquetWriterInjects.scala (36:45) duplicated block id: 926 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q25.sql (22:28) - tools/gluten-it/common/src/main/resources/tpcds-queries/q29.sql (22:28) duplicated block id: 927 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (160:166) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (180:186) duplicated block id: 928 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (174:180) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (225:231) duplicated block id: 929 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:50) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:50) duplicated block id: 930 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (179:185) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (187:193) duplicated block id: 931 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (179:185) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (195:201) duplicated block id: 932 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (334:343) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (926:935) duplicated block id: 933 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (376:382) duplicated block id: 934 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (65:73) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (68:76) duplicated block id: 935 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (897:903) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (907:913) duplicated block id: 936 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/rel/LocalFilesNode.java (58:64) - gluten-iceberg/src/main/java/io/glutenproject/substrait/rel/IcebergLocalFilesNode.java (54:60) duplicated block id: 937 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (2019:2026) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (2092:2099) duplicated block id: 938 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (83:90) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (277:284) duplicated block id: 939 size: 7 cleaned lines of code in 2 files: - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (120:130) - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (161:171) duplicated block id: 940 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (209:215) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (78:84) duplicated block id: 941 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:63) - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:50) duplicated block id: 942 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (226:239) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (323:338) duplicated block id: 943 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (462:469) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (534:541) duplicated block id: 944 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) duplicated block id: 945 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (195:201) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (203:209) duplicated block id: 946 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1861:1868) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1918:1925) duplicated block id: 947 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (790:800) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (838:848) duplicated block id: 948 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/MergeTreeRelParser.cpp (79:86) - cpp-ch/local-engine/Parser/MergeTreeRelParser.cpp (127:134) duplicated block id: 949 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1435:1443) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 950 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) duplicated block id: 951 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1355:1361) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2972:2978) duplicated block id: 952 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2040:2051) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2224:2235) duplicated block id: 953 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1435:1443) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 954 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (48:60) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (46:58) duplicated block id: 955 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:50) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:58) duplicated block id: 956 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q10.sql (42:48) - tools/gluten-it/common/src/main/resources/tpcds-queries/q10.sql (50:56) duplicated block id: 957 size: 7 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (465:472) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (278:285) duplicated block id: 958 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1688:1696) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 959 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/FilterExecTransformer.scala (43:52) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (384:394) duplicated block id: 960 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/utils/SubstraitPlanPrinterUtil.scala (29:35) - gluten-core/src/main/scala/io/glutenproject/utils/SubstraitPlanPrinterUtil.scala (39:45) duplicated block id: 961 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (119:125) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (208:214) duplicated block id: 962 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1688:1696) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 963 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHSplitResult.java (36:42) - gluten-data/src/main/java/io/glutenproject/vectorized/GlutenSplitResult.java (34:40) duplicated block id: 964 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (213:219) - gluten-core/src/main/scala/io/glutenproject/execution/HashJoinExecTransformer.scala (261:267) duplicated block id: 965 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:52) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:50) duplicated block id: 966 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (160:166) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (217:223) duplicated block id: 967 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/StrategyOverrides.scala (65:71) - gluten-core/src/main/scala/io/glutenproject/extension/StrategyOverrides.scala (138:144) duplicated block id: 968 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/IteratorApiImpl.scala (189:195) - gluten-core/src/main/scala/io/glutenproject/backendsapi/IteratorApi.scala (73:79) duplicated block id: 969 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (244:251) - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (294:301) duplicated block id: 970 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (462:469) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (588:595) duplicated block id: 971 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (217:224) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (155:162) duplicated block id: 972 size: 7 cleaned lines of code in 2 files: - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (162:168) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/GlutenFileFormatWriter.scala (25:31) duplicated block id: 973 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1289:1298) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1313:1322) duplicated block id: 974 size: 7 cleaned lines of code in 2 files: - cpp/velox/jni/JniFileSystem.cc (76:82) - cpp/velox/jni/JniFileSystem.cc (153:159) duplicated block id: 975 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:63) - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:50) duplicated block id: 976 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (263:270) - gluten-core/src/main/scala/io/glutenproject/expression/DateTimeExpressionsTransformer.scala (121:128) duplicated block id: 977 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (199:205) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (208:214) duplicated block id: 978 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) duplicated block id: 979 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1902:1909) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1936:1943) duplicated block id: 980 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (364:373) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (964:973) duplicated block id: 981 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1902:1909) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1919:1926) duplicated block id: 982 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (130:140) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (192:202) duplicated block id: 983 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (197:203) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (217:223) duplicated block id: 984 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/rel/InputIteratorRelNode.java (44:52) - gluten-core/src/main/java/io/glutenproject/substrait/rel/WriteRelNode.java (60:68) duplicated block id: 985 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (43:49) - gluten-core/src/main/scala/io/glutenproject/backendsapi/TransformerApi.scala (35:41) duplicated block id: 986 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 987 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) duplicated block id: 988 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (208:214) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) duplicated block id: 989 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1582:1590) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 990 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (304:313) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (888:897) duplicated block id: 991 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1648:1656) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 992 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/java/io/glutenproject/vectorized/LowCopyNettyShuffleInputStream.java (45:53) - gluten-data/src/main/java/io/glutenproject/vectorized/LowCopyNettyJniByteInputStream.java (67:75) duplicated block id: 993 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1887:1893) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1919:1926) duplicated block id: 994 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1887:1893) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1902:1909) duplicated block id: 995 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (418:426) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (534:541) duplicated block id: 996 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:59) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:58) duplicated block id: 997 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1648:1656) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 998 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (418:426) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (462:469) duplicated block id: 999 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1582:1590) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) duplicated block id: 1000 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1887:1893) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (1936:1943) duplicated block id: 1001 size: 7 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (444:455) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (434:445) duplicated block id: 1002 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (226:239) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (303:318) duplicated block id: 1003 size: 7 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (414:420) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (319:325) duplicated block id: 1004 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/execution/datasource/GlutenOrcWriterInjects.scala (21:30) - shims/common/src/main/scala/io/glutenproject/execution/datasource/GlutenParquetWriterInjects.scala (36:45) duplicated block id: 1005 size: 7 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/sql/shims/SparkShims.scala (95:101) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (162:168) duplicated block id: 1006 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (6:12) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (51:57) duplicated block id: 1007 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1829:1837) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1862:1870) duplicated block id: 1008 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q17.sql (17:23) - tools/gluten-it/common/src/main/resources/tpcds-queries/q29.sql (12:18) duplicated block id: 1009 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (97:103) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 1010 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (6:12) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic_decimal.yaml (36:42) duplicated block id: 1011 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (151:157) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 1012 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (732:741) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (912:921) duplicated block id: 1013 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (65:73) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (61:69) duplicated block id: 1014 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/utils/CHExecUtil.scala (232:238) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (250:256) duplicated block id: 1015 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q5.sql (96:102) - tools/gluten-it/common/src/main/resources/tpcds-queries/q77.sql (73:79) duplicated block id: 1016 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/ds/TpcdsDataGen.scala (101:107) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/h/TpchDataGen.scala (332:338) duplicated block id: 1017 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q17.sql (25:31) - tools/gluten-it/common/src/main/resources/tpcds-queries/q25.sql (22:28) duplicated block id: 1018 size: 7 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (732:741) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (966:975) duplicated block id: 1019 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (197:203) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (37:43) duplicated block id: 1020 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (28:34) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (156:162) duplicated block id: 1021 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHFilterExecTransformer.scala (43:51) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (384:394) duplicated block id: 1022 size: 7 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (319:328) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (907:916) duplicated block id: 1023 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (171:177) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (61:67) duplicated block id: 1024 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q83.sql (9:15) - tools/gluten-it/common/src/main/resources/tpcds-queries/q83.sql (25:31) duplicated block id: 1025 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (197:203) - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (165:171) duplicated block id: 1026 size: 7 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/JsonTupleExpressionTransformer.scala (39:45) - gluten-core/src/main/scala/io/glutenproject/expression/StringExpressionTransformer.scala (36:42) duplicated block id: 1027 size: 7 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/DefaultHashAggregateResult.cpp (74:83) - cpp-ch/local-engine/Operator/GraceMergingAggregatedStep.cpp (138:147) duplicated block id: 1028 size: 7 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeDeltaTxnWriter.scala (270:276) - gluten-core/src/main/scala/org/apache/spark/sql/execution/datasources/GlutenWriterColumnarRules.scala (186:192) duplicated block id: 1029 size: 7 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q83.sql (9:15) - tools/gluten-it/common/src/main/resources/tpcds-queries/q83.sql (41:47) duplicated block id: 1030 size: 7 cleaned lines of code in 2 files: - gluten-celeborn/common/src/main/java/org/apache/spark/shuffle/gluten/celeborn/CelebornShuffleManager.java (150:156) - gluten-celeborn/common/src/main/java/org/apache/spark/shuffle/gluten/celeborn/CelebornShuffleManager.java (169:175) duplicated block id: 1031 size: 7 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (188:194) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (108:114) duplicated block id: 1032 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (439:445) - cpp-ch/local-engine/local_engine_jni.cpp (514:520) duplicated block id: 1033 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (439:445) - cpp-ch/local-engine/local_engine_jni.cpp (502:508) duplicated block id: 1034 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (695:702) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (753:760) duplicated block id: 1035 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1684:1693) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) duplicated block id: 1036 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (39:46) - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (152:159) duplicated block id: 1037 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (36:41) - tools/gluten-it/common/src/main/resources/tpcds-queries/q74.sql (25:30) duplicated block id: 1038 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WriteFilesExecTransformer.scala (49:54) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/WriteFiles.scala (70:75) duplicated block id: 1039 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (232:237) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (425:432) duplicated block id: 1040 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (107:113) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (35:41) duplicated block id: 1041 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (280:285) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (404:409) duplicated block id: 1042 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (314:319) - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (59:64) duplicated block id: 1043 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (177:182) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (193:198) duplicated block id: 1044 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (26:31) - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (22:27) duplicated block id: 1045 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (177:182) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (256:261) duplicated block id: 1046 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (130:135) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/clickhouse/commands/WriteMergeTreeToDelta.scala (144:149) duplicated block id: 1047 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScanBuilder.scala (48:53) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (293:298) duplicated block id: 1048 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (266:271) - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (345:350) duplicated block id: 1049 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:62) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1050 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:51) - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:49) duplicated block id: 1051 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (104:109) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (41:46) duplicated block id: 1052 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (48:54) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashJoinExecTransformer.scala (82:89) duplicated block id: 1053 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (141:146) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (644:649) duplicated block id: 1054 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (57:65) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (70:78) duplicated block id: 1055 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1334:1346) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1949:1957) duplicated block id: 1056 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (141:146) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (612:617) duplicated block id: 1057 size: 6 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (183:188) - cpp/velox/shuffle/VeloxShuffleReader.cc (213:218) duplicated block id: 1058 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (24:29) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (49:54) duplicated block id: 1059 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHSplitResult.java (25:30) - gluten-core/src/main/java/io/glutenproject/vectorized/SplitResult.java (32:37) duplicated block id: 1060 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (129:134) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (209:214) duplicated block id: 1061 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (439:445) - cpp-ch/local-engine/local_engine_jni.cpp (478:484) duplicated block id: 1062 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1063 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:51) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:62) duplicated block id: 1064 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/aggregate_function_parser/LeadLagParser.cpp (29:36) - cpp-ch/local-engine/Parser/aggregate_function_parser/LeadLagParser.cpp (72:79) duplicated block id: 1065 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (439:445) - cpp-ch/local-engine/local_engine_jni.cpp (463:469) duplicated block id: 1066 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (427:433) - cpp-ch/local-engine/local_engine_jni.cpp (439:445) duplicated block id: 1067 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (439:445) - cpp-ch/local-engine/local_engine_jni.cpp (451:457) duplicated block id: 1068 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (167:172) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/QueriesCompare.scala (222:227) duplicated block id: 1069 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (427:433) - cpp-ch/local-engine/local_engine_jni.cpp (463:469) duplicated block id: 1070 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (427:433) - cpp-ch/local-engine/local_engine_jni.cpp (451:457) duplicated block id: 1071 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/LowCopyNettyJniByteInputStream.java (99:104) - gluten-data/src/main/java/io/glutenproject/vectorized/OnHeapJniByteInputStream.java (58:63) duplicated block id: 1072 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (177:182) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (299:304) duplicated block id: 1073 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (65:70) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (124:129) duplicated block id: 1074 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (427:433) - cpp-ch/local-engine/local_engine_jni.cpp (490:496) duplicated block id: 1075 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (256:261) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (536:541) duplicated block id: 1076 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (420:427) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (732:739) duplicated block id: 1077 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (177:182) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (329:334) duplicated block id: 1078 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (427:433) - cpp-ch/local-engine/local_engine_jni.cpp (478:484) duplicated block id: 1079 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (439:445) - cpp-ch/local-engine/local_engine_jni.cpp (490:496) duplicated block id: 1080 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (299:304) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (329:334) duplicated block id: 1081 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (193:198) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (536:541) duplicated block id: 1082 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (104:109) - gluten-core/src/main/scala/io/glutenproject/backendsapi/TransformerApi.scala (36:41) duplicated block id: 1083 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/common/OrcReaderIterator.h (65:72) - cpp/velox/benchmarks/common/ParquetReaderIterator.h (62:69) duplicated block id: 1084 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1431:1440) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) duplicated block id: 1085 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHMergeTreeWriterInjects.scala (121:126) - shims/common/src/main/scala/io/glutenproject/execution/datasource/GlutenFormatWriterInjects.scala (41:46) duplicated block id: 1086 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (104:109) - gluten-core/src/main/scala/io/glutenproject/utils/InputPartitionsUtil.scala (29:34) duplicated block id: 1087 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q14a.sql (65:70) - tools/gluten-it/common/src/main/resources/tpcds-queries/q14b.sql (90:95) duplicated block id: 1088 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (26:31) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (26:31) duplicated block id: 1089 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (190:195) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (457:462) duplicated block id: 1090 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q14a.sql (65:70) - tools/gluten-it/common/src/main/resources/tpcds-queries/q14b.sql (75:80) duplicated block id: 1091 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (129:134) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (200:205) duplicated block id: 1092 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (170:175) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (636:641) duplicated block id: 1093 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/CartesianProductMetricsUpdater.scala (27:32) - gluten-data/src/main/scala/io/glutenproject/metrics/WindowMetricsUpdater.scala (27:32) duplicated block id: 1094 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayElement.h (44:57) - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (43:50) duplicated block id: 1095 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (170:175) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (604:609) duplicated block id: 1096 size: 6 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (147:152) - cpp/velox/shuffle/VeloxShuffleReader.cc (157:162) duplicated block id: 1097 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:51) - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:51) duplicated block id: 1098 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (102:109) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (125:132) duplicated block id: 1099 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (161:168) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (198:205) duplicated block id: 1100 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (62:67) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (254:259) duplicated block id: 1101 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (440:446) - cpp/core/jni/JniWrapper.cc (460:466) duplicated block id: 1102 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/column_reader.cc (1399:1411) - cpp-ch/local-engine/Storages/ch_parquet/arrow/column_reader.cc (1579:1591) duplicated block id: 1103 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1104 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/Constants.scala (34:39) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/Constants.scala (52:57) duplicated block id: 1105 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (427:433) - cpp-ch/local-engine/local_engine_jni.cpp (514:520) duplicated block id: 1106 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:69) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:49) duplicated block id: 1107 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (427:433) - cpp-ch/local-engine/local_engine_jni.cpp (502:508) duplicated block id: 1108 size: 6 cleaned lines of code in 2 files: - cpp/core/shuffle/Payload.cc (455:460) - cpp/core/shuffle/Payload.h (145:150) duplicated block id: 1109 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/example_udf/myMd5.cpp (38:47) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayDistinct.cpp (41:48) duplicated block id: 1110 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/VeloxBackend.scala (92:97) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/VeloxBackend.scala (114:119) duplicated block id: 1111 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (37:42) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (31:36) duplicated block id: 1112 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (420:427) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (595:602) duplicated block id: 1113 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (170:175) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (43:48) duplicated block id: 1114 size: 6 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (523:529) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (599:605) duplicated block id: 1115 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (5:10) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (5:10) duplicated block id: 1116 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:51) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1117 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (177:182) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (536:541) duplicated block id: 1118 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/DeltaMergeTreeFileFormat.scala (97:102) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHMergeTreeWriterInjects.scala (80:85) duplicated block id: 1119 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q61.sql (8:13) - tools/gluten-it/common/src/main/resources/tpcds-queries/q61.sql (27:32) duplicated block id: 1120 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (431:436) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (404:409) duplicated block id: 1121 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (133:138) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (45:50) duplicated block id: 1122 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1123 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q5.sql (121:126) - tools/gluten-it/common/src/main/resources/tpcds-queries/q80.sql (89:94) duplicated block id: 1124 size: 6 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (89:94) - cpp/velox/shuffle/VeloxShuffleReader.cc (117:122) duplicated block id: 1125 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (41:46) - gluten-core/src/main/scala/io/glutenproject/utils/InputPartitionsUtil.scala (29:34) duplicated block id: 1126 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/SubstraitSource/OrcUtil.cpp (128:136) - cpp-ch/local-engine/Storages/ch_parquet/OptimizedParquetBlockInputFormat.cpp (107:115) duplicated block id: 1127 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayElement.h (44:57) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (43:63) duplicated block id: 1128 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (183:188) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (234:239) duplicated block id: 1129 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (936:942) - cpp/core/jni/JniWrapper.cc (957:963) duplicated block id: 1130 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (71:76) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (75:80) duplicated block id: 1131 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (256:261) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (329:334) duplicated block id: 1132 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (256:261) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (299:304) duplicated block id: 1133 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Join/BroadCastJoinBuilder.cpp (79:84) - cpp-ch/local-engine/Join/BroadCastJoinBuilder.h (33:38) duplicated block id: 1134 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/CartesianProductMetricsUpdater.scala (27:32) - gluten-data/src/main/scala/io/glutenproject/metrics/SortMetricsUpdater.scala (27:32) duplicated block id: 1135 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (60:65) - tools/gluten-it/common/src/main/resources/tpcds-queries/q74.sql (25:30) duplicated block id: 1136 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (47:52) - gluten-data/src/main/scala/io/glutenproject/vectorized/ColumnarBatchSerializer.scala (71:76) duplicated block id: 1137 size: 6 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1264:1269) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1274:1279) duplicated block id: 1138 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionHashingExtended.h (587:593) - cpp-ch/local-engine/Functions/SparkFunctionHashingExtended.h (604:610) duplicated block id: 1139 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (33:38) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (41:46) duplicated block id: 1140 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (299:304) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (536:541) duplicated block id: 1141 size: 6 cleaned lines of code in 2 files: - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (30:35) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (34:39) duplicated block id: 1142 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/VeloxToSubstraitExpr.cc (423:432) - cpp/velox/substrait/VeloxToSubstraitExpr.cc (455:461) duplicated block id: 1143 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/BasicPhysicalOperatorTransformer.scala (220:225) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (404:409) duplicated block id: 1144 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (672:680) - cpp-ch/local-engine/local_engine_jni.cpp (741:749) duplicated block id: 1145 size: 6 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (56:64) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (58:66) duplicated block id: 1146 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHIteratorApi.scala (151:157) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/IteratorApiImpl.scala (79:86) duplicated block id: 1147 size: 6 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (70:75) - cpp/velox/shuffle/VeloxShuffleReader.cc (117:122) duplicated block id: 1148 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:51) duplicated block id: 1149 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/Serializations/ExcelReadHelpers.cpp (72:77) - cpp-ch/local-engine/Storages/Serializations/ExcelReadHelpers.h (86:91) duplicated block id: 1150 size: 6 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (70:75) - cpp/velox/shuffle/VeloxShuffleReader.cc (89:94) duplicated block id: 1151 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:69) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:51) duplicated block id: 1152 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (3:8) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (200:205) duplicated block id: 1153 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q23a.sql (35:40) - tools/gluten-it/common/src/main/resources/tpcds-queries/q23b.sql (35:40) duplicated block id: 1154 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/SortMergeJoinExecTransformer.scala (209:216) - gluten-core/src/main/scala/io/glutenproject/utils/SubstraitUtil.scala (34:40) duplicated block id: 1155 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (272:277) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/QueriesCompare.scala (222:227) duplicated block id: 1156 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (463:469) - cpp-ch/local-engine/local_engine_jni.cpp (502:508) duplicated block id: 1157 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (121:126) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (144:149) duplicated block id: 1158 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (463:469) - cpp-ch/local-engine/local_engine_jni.cpp (514:520) duplicated block id: 1159 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (203:209) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (589:595) duplicated block id: 1160 size: 6 cleaned lines of code in 2 files: - cpp/core/shuffle/FallbackRangePartitioner.h (28:33) - cpp/core/shuffle/SinglePartitioner.h (27:32) duplicated block id: 1161 size: 6 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (124:129) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (125:130) duplicated block id: 1162 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (78:85) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (74:81) duplicated block id: 1163 size: 6 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1254:1259) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1274:1279) duplicated block id: 1164 size: 6 cleaned lines of code in 2 files: - cpp/core/shuffle/FallbackRangePartitioner.h (28:33) - cpp/core/shuffle/HashPartitioner.h (28:33) duplicated block id: 1165 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:62) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:49) duplicated block id: 1166 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:51) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1167 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (208:213) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (457:462) duplicated block id: 1168 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:58) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1169 size: 6 cleaned lines of code in 2 files: - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (170:175) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/GlutenFileFormatWriter.scala (33:38) duplicated block id: 1170 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (47:52) - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (41:46) duplicated block id: 1171 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (680:685) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (629:634) duplicated block id: 1172 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/GenericExpressionTransformer.scala (32:37) - gluten-core/src/main/scala/io/glutenproject/expression/NamedExpressionsTransformer.scala (35:40) duplicated block id: 1173 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (191:196) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (242:247) duplicated block id: 1174 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (43:48) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (28:33) duplicated block id: 1175 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (203:209) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (535:541) duplicated block id: 1176 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (160:165) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (513:518) duplicated block id: 1177 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (96:104) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (79:87) duplicated block id: 1178 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (56:61) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (49:54) duplicated block id: 1179 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (214:220) - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (256:262) duplicated block id: 1180 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (203:209) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (463:469) duplicated block id: 1181 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScanBuilder.scala (48:53) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala (285:290) duplicated block id: 1182 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (203:209) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (420:426) duplicated block id: 1183 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (87:92) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (45:50) duplicated block id: 1184 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (3:8) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (120:125) duplicated block id: 1185 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/MergeTreeRelParser.cpp (353:358) - cpp-ch/local-engine/Parser/SerializedPlanParser.cpp (1742:1747) duplicated block id: 1186 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/ShuffledHashJoinExecTransformer.scala (175:183) - gluten-core/src/main/scala/io/glutenproject/utils/SubstraitUtil.scala (25:33) duplicated block id: 1187 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/utils/CHExecUtil.scala (232:237) - gluten-core/src/main/scala/org/apache/spark/sql/execution/ColumnarShuffleExchangeExec.scala (183:188) duplicated block id: 1188 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (93:98) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (103:108) duplicated block id: 1189 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (73:81) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (92:100) duplicated block id: 1190 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/CartesianProductMetricsUpdater.scala (27:32) - gluten-data/src/main/scala/io/glutenproject/metrics/ProjectMetricsUpdater.scala (27:32) duplicated block id: 1191 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (382:387) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (502:507) duplicated block id: 1192 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q48.sql (47:52) - tools/gluten-it/common/src/main/resources/tpcds-queries/q48.sql (55:60) duplicated block id: 1193 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q11.sql (12:17) - tools/gluten-it/common/src/main/resources/tpcds-queries/q74.sql (8:13) duplicated block id: 1194 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (636:641) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (87:92) duplicated block id: 1195 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (174:182) - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (361:369) duplicated block id: 1196 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (170:175) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (87:92) duplicated block id: 1197 size: 6 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/sql/shims/SparkShims.scala (64:69) - shims/spark33/src/main/scala/io/glutenproject/sql/shims/spark33/Spark33Shims.scala (89:94) duplicated block id: 1198 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (74:81) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (84:91) duplicated block id: 1199 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (178:183) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (644:649) duplicated block id: 1200 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (361:366) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (516:521) duplicated block id: 1201 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (178:183) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (612:617) duplicated block id: 1202 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (199:204) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (250:255) duplicated block id: 1203 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (284:289) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (384:389) duplicated block id: 1204 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:49) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1205 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (120:125) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (129:134) duplicated block id: 1206 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (65:70) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (116:121) duplicated block id: 1207 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:62) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:49) duplicated block id: 1208 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (37:42) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (34:39) duplicated block id: 1209 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/vectorized/CHColumnarBatchSerializer.scala (36:43) - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerializer.scala (39:46) duplicated block id: 1210 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionNextDay.cpp (138:145) - cpp-ch/local-engine/Functions/SparkFunctionNextDay.cpp (170:177) duplicated block id: 1211 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q33.sql (47:52) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (47:52) duplicated block id: 1212 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (51:59) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (70:78) duplicated block id: 1213 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHFilterExecTransformer.scala (83:88) - backends-velox/src/main/scala/io/glutenproject/execution/FilterExecTransformer.scala (82:87) duplicated block id: 1214 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (451:457) - cpp-ch/local-engine/local_engine_jni.cpp (514:520) duplicated block id: 1215 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (395:401) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (525:531) duplicated block id: 1216 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (133:138) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (87:92) duplicated block id: 1217 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (250:255) - gluten-core/src/main/scala/org/apache/spark/sql/execution/ColumnarShuffleExchangeExec.scala (183:188) duplicated block id: 1218 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (451:457) - cpp-ch/local-engine/local_engine_jni.cpp (502:508) duplicated block id: 1219 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (56:61) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (67:72) duplicated block id: 1220 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (501:507) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (588:594) duplicated block id: 1221 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q4.sql (12:17) - tools/gluten-it/common/src/main/resources/tpcds-queries/q74.sql (8:13) duplicated block id: 1222 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (329:334) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (536:541) duplicated block id: 1223 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (466:472) - gluten-core/src/main/scala/io/glutenproject/expression/UnaryExpressionTransformer.scala (89:95) duplicated block id: 1224 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (62:67) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (332:337) duplicated block id: 1225 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/ColumnarOverrides.scala (453:458) - gluten-core/src/main/scala/io/glutenproject/extension/columnar/TransformHintRule.scala (569:574) duplicated block id: 1226 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (55:60) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (170:175) duplicated block id: 1227 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:51) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1228 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (451:457) - cpp-ch/local-engine/local_engine_jni.cpp (490:496) duplicated block id: 1229 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (451:457) - cpp-ch/local-engine/local_engine_jni.cpp (478:484) duplicated block id: 1230 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (451:457) - cpp-ch/local-engine/local_engine_jni.cpp (463:469) duplicated block id: 1231 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/table/ClickHouseTableV2.scala (192:198) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/table/ClickHouseTableV2.scala (203:209) duplicated block id: 1232 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (250:256) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (785:790) duplicated block id: 1233 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (232:237) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (445:452) duplicated block id: 1234 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScan.scala (51:57) - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/ClickHouseScan.scala (45:51) duplicated block id: 1235 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (87:92) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (186:191) duplicated block id: 1236 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1237 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:62) - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:58) duplicated block id: 1238 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (133:141) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (119:127) duplicated block id: 1239 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (193:198) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (329:334) duplicated block id: 1240 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (190:195) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (208:213) duplicated block id: 1241 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionConv.h (41:49) - cpp-ch/local-engine/Functions/SparkFunctionMonthsBetween.h (39:47) duplicated block id: 1242 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (193:198) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (299:304) duplicated block id: 1243 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:69) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1244 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (193:198) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (256:261) duplicated block id: 1245 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (505:512) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (966:973) duplicated block id: 1246 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (43:48) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (87:92) duplicated block id: 1247 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpch-queries/q19.sql (15:20) - tools/gluten-it/common/src/main/resources/tpch-queries/q19.sql (25:30) duplicated block id: 1248 size: 6 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (89:94) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (77:82) duplicated block id: 1249 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (69:74) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (109:114) duplicated block id: 1250 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (93:98) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (144:149) duplicated block id: 1251 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (44:49) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (104:109) duplicated block id: 1252 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (215:220) - gluten-core/src/main/resources/substrait/extensions/functions_arithmetic.yaml (266:271) duplicated block id: 1253 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/memory/memtarget/TreeMemoryTargets.java (207:212) - gluten-core/src/main/java/io/glutenproject/memory/memtarget/spark/TreeMemoryConsumer.java (123:128) duplicated block id: 1254 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:58) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1255 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (463:469) - cpp-ch/local-engine/local_engine_jni.cpp (490:496) duplicated block id: 1256 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (463:469) - cpp-ch/local-engine/local_engine_jni.cpp (478:484) duplicated block id: 1257 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (501:507) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (534:540) duplicated block id: 1258 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (505:512) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (912:919) duplicated block id: 1259 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:69) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:57) duplicated block id: 1260 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/CartesianProductMetricsUpdater.scala (27:32) - gluten-data/src/main/scala/io/glutenproject/metrics/LimitMetricsUpdater.scala (27:32) duplicated block id: 1261 size: 6 cleaned lines of code in 2 files: - cpp/velox/jni/JniFileSystem.cc (128:135) - cpp/velox/jni/JniFileSystem.cc (188:195) duplicated block id: 1262 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q35.sql (25:30) - tools/gluten-it/common/src/main/resources/tpcds-queries/q69.sql (16:21) duplicated block id: 1263 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/extension/StrategyOverrides.scala (202:207) - gluten-core/src/main/scala/io/glutenproject/extension/StrategyOverrides.scala (210:215) duplicated block id: 1264 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q47.sql (19:24) - tools/gluten-it/common/src/main/resources/tpcds-queries/q57.sql (19:24) duplicated block id: 1265 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (914:920) - cpp/core/jni/JniWrapper.cc (936:942) duplicated block id: 1266 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/java/io/glutenproject/fs/OnHeapFileSystem.java (217:222) - gluten-data/src/main/java/io/glutenproject/vectorized/LowCopyFileSegmentJniByteInputStream.java (119:124) duplicated block id: 1267 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1644:1653) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) duplicated block id: 1268 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/Serializations/ExcelReadHelpers.cpp (42:47) - cpp-ch/local-engine/Storages/Serializations/ExcelReadHelpers.h (93:98) duplicated block id: 1269 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (335:340) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (348:353) duplicated block id: 1270 size: 6 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1274:1279) - shims/common/src/main/scala/io/glutenproject/GlutenConfig.scala (1290:1295) duplicated block id: 1271 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/backendsapi/TransformerApi.scala (36:41) - gluten-core/src/main/scala/io/glutenproject/utils/InputPartitionsUtil.scala (29:34) duplicated block id: 1272 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (34:40) - gluten-data/src/main/scala/io/glutenproject/metrics/MetricsUtil.scala (47:52) duplicated block id: 1273 size: 6 cleaned lines of code in 2 files: - substrait/substrait-spark/src/main/scala/io/substrait/debug/TreePrinter.scala (102:107) - substrait/substrait-spark/src/main/scala/io/substrait/debug/TreePrinter.scala (114:119) duplicated block id: 1274 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/DateTimeExpressionsTransformer.scala (35:41) - gluten-core/src/main/scala/io/glutenproject/expression/NamedExpressionsTransformer.scala (28:34) duplicated block id: 1275 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (55:60) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (87:92) duplicated block id: 1276 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (615:620) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (1107:1112) duplicated block id: 1277 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/SerializedPlanParser.h (344:349) - cpp-ch/local-engine/Parser/SerializedPlanParser.h (351:356) duplicated block id: 1278 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:49) duplicated block id: 1279 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/CHColumnToSparkRow.cpp (116:121) - cpp-ch/local-engine/Parser/CHColumnToSparkRow.cpp (209:214) duplicated block id: 1280 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (96:104) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (73:81) duplicated block id: 1281 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndex.scala (28:33) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (35:40) duplicated block id: 1282 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/CartesianProductMetricsUpdater.scala (27:32) - gluten-data/src/main/scala/io/glutenproject/metrics/ExpandMetricsUpdater.scala (27:32) duplicated block id: 1283 size: 6 cleaned lines of code in 2 files: - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (30:35) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (31:36) duplicated block id: 1284 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:49) duplicated block id: 1285 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/org/apache/spark/util/SparkDirectoryUtil.scala (87:92) - gluten-core/src/main/scala/org/apache/spark/util/SparkDirectoryUtil.scala (108:113) duplicated block id: 1286 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (505:512) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (732:739) duplicated block id: 1287 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:62) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1288 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (105:110) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (207:212) duplicated block id: 1289 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHMergeTreeWriterInjects.scala (66:71) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHMergeTreeWriterInjects.scala (121:126) duplicated block id: 1290 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (133:138) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (55:60) duplicated block id: 1291 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q36.sql (4:9) - tools/gluten-it/common/src/main/resources/tpcds-queries/q86.sql (4:9) duplicated block id: 1292 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:51) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:57) duplicated block id: 1293 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2029:2034) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2077:2082) duplicated block id: 1294 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (284:289) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala (364:369) duplicated block id: 1295 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (33:38) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (104:109) duplicated block id: 1296 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/IteratorApiImpl.scala (227:232) - gluten-core/src/main/scala/io/glutenproject/backendsapi/IteratorApi.scala (85:90) duplicated block id: 1297 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/io/glutenproject/metrics/CartesianProductMetricsUpdater.scala (27:32) - gluten-data/src/main/scala/io/glutenproject/metrics/FilterMetricsUpdater.scala (27:32) duplicated block id: 1298 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (93:98) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (186:191) duplicated block id: 1299 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/column_reader.cc (394:399) - cpp-ch/local-engine/Storages/ch_parquet/arrow/column_reader.cc (415:420) duplicated block id: 1300 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (133:138) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (43:48) duplicated block id: 1301 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/ExpandTransform.cpp (73:82) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.cpp (111:121) duplicated block id: 1302 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayElement.h (44:57) - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (45:52) duplicated block id: 1303 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (505:512) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (595:602) duplicated block id: 1304 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (236:242) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (291:297) duplicated block id: 1305 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (141:146) - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (63:68) duplicated block id: 1306 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (404:409) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (220:225) duplicated block id: 1307 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (141:146) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (51:56) duplicated block id: 1308 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayElement.h (44:57) - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (43:50) duplicated block id: 1309 size: 6 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/sql/shims/SparkShims.scala (64:69) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (90:95) duplicated block id: 1310 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:51) - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:58) duplicated block id: 1311 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (258:263) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/WriteFiles.scala (70:75) duplicated block id: 1312 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (133:141) - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (126:134) duplicated block id: 1313 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowColumnVector.java (421:428) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1068:1075) duplicated block id: 1314 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (649:654) - cpp/core/jni/JniWrapper.cc (671:676) duplicated block id: 1315 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (649:654) - cpp/core/jni/JniWrapper.cc (682:687) duplicated block id: 1316 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:51) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1317 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (260:265) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Queries.scala (157:162) duplicated block id: 1318 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (649:654) - cpp/core/jni/JniWrapper.cc (660:665) duplicated block id: 1319 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (285:290) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/GlutenFileFormatWriter.scala (33:38) duplicated block id: 1320 size: 6 cleaned lines of code in 2 files: - cpp/CMake/BuildGlog.cmake (8:13) - cpp/CMake/FindThrift.cmake (8:13) duplicated block id: 1321 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HashAggregateMetricsUpdater.scala (25:31) - backends-clickhouse/src/main/scala/io/glutenproject/metrics/HashJoinMetricsUpdater.scala (23:28) duplicated block id: 1322 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (24:29) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (7:12) duplicated block id: 1323 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (660:665) - cpp/core/jni/JniWrapper.cc (682:687) duplicated block id: 1324 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (69:74) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (150:155) duplicated block id: 1325 size: 6 cleaned lines of code in 2 files: - gluten-celeborn/clickhouse/src/main/scala/org/apache/spark/shuffle/CHCelebornHashBasedColumnarShuffleWriterFactory.scala (31:36) - gluten-celeborn/velox/src/main/scala/org/apache/spark/shuffle/VeloxCelebornHashBasedColumnarShuffleWriterFactory.scala (31:36) duplicated block id: 1326 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:49) duplicated block id: 1327 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (33:38) - gluten-core/src/main/scala/io/glutenproject/backendsapi/TransformerApi.scala (36:41) duplicated block id: 1328 size: 6 cleaned lines of code in 2 files: - cpp/core/shuffle/Spill.cc (48:53) - cpp/core/shuffle/Spill.h (45:50) duplicated block id: 1329 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/Parameterized.java (43:50) - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/Queries.java (29:36) duplicated block id: 1330 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/utils/ScanMergeTreePartsUtils.scala (81:86) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v2/clickhouse/utils/ScanMergeTreePartsUtils.scala (91:97) duplicated block id: 1331 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (660:665) - cpp/core/jni/JniWrapper.cc (671:676) duplicated block id: 1332 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/NamedExpressionsTransformer.scala (35:40) - gluten-core/src/main/scala/io/glutenproject/expression/PredicateExpressionTransformer.scala (85:90) duplicated block id: 1333 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (326:332) - cpp/core/benchmarks/CompressionBenchmark.cc (389:395) duplicated block id: 1334 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:69) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1335 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/expression/NamedExpressionsTransformer.scala (35:40) - gluten-core/src/main/scala/io/glutenproject/expression/PredicateExpressionTransformer.scala (114:119) duplicated block id: 1336 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1372:1381) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1992:1999) duplicated block id: 1337 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (67:72) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/TransformerApiImpl.scala (49:54) duplicated block id: 1338 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Operator/EmptyProjectStep.cpp (51:62) - cpp-ch/local-engine/Operator/StreamingAggregatingStep.cpp (89:98) duplicated block id: 1339 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:58) duplicated block id: 1340 size: 6 cleaned lines of code in 2 files: - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (90:95) - shims/spark34/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (77:82) duplicated block id: 1341 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (215:220) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (87:92) duplicated block id: 1342 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/vectorized/SplitResult.java (31:36) - gluten-data/src/main/java/io/glutenproject/vectorized/GlutenSplitResult.java (23:28) duplicated block id: 1343 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (502:508) - cpp-ch/local-engine/local_engine_jni.cpp (514:520) duplicated block id: 1344 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/AggregateFunctions/AggregateFunctionPartialMerge.cpp (53:58) - cpp-ch/local-engine/AggregateFunctions/AggregateFunctionPartialMerge.cpp (89:94) duplicated block id: 1345 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatWriter.scala (349:356) - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (232:237) duplicated block id: 1346 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (258:263) - gluten-core/src/main/scala/io/glutenproject/execution/WriteFilesExecTransformer.scala (49:54) duplicated block id: 1347 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1348 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (47:70) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayElement.h (44:57) duplicated block id: 1349 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/FileSourceScanExecTransformer.scala (37:42) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (35:40) duplicated block id: 1350 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (490:496) - cpp-ch/local-engine/local_engine_jni.cpp (514:520) duplicated block id: 1351 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (490:496) - cpp-ch/local-engine/local_engine_jni.cpp (502:508) duplicated block id: 1352 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (462:468) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (501:507) duplicated block id: 1353 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (44:49) - gluten-core/src/main/scala/io/glutenproject/utils/InputPartitionsUtil.scala (29:34) duplicated block id: 1354 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (280:285) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (361:366) duplicated block id: 1355 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (215:220) - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (93:98) duplicated block id: 1356 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (133:138) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (636:641) duplicated block id: 1357 size: 6 cleaned lines of code in 2 files: - cpp/core/utils/qpl/qpl_codec.cc (76:81) - cpp/core/utils/qpl/qpl_codec.cc (134:139) duplicated block id: 1358 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (204:209) - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (266:271) duplicated block id: 1359 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (280:285) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (489:494) duplicated block id: 1360 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:49) duplicated block id: 1361 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/divide.cpp (43:51) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:49) duplicated block id: 1362 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (110:115) - gluten-core/src/main/scala/io/glutenproject/execution/WindowExecTransformer.scala (176:181) duplicated block id: 1363 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (178:183) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (51:56) duplicated block id: 1364 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:57) duplicated block id: 1365 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (43:48) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (7:12) duplicated block id: 1366 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:57) duplicated block id: 1367 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxExpr.cc (91:97) - cpp/velox/substrait/SubstraitToVeloxExpr.cc (160:166) duplicated block id: 1368 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/Parameterized.scala (107:113) - tools/gluten-it/common/src/main/scala/io/glutenproject/integration/tpc/action/QueriesCompare.scala (35:41) duplicated block id: 1369 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:69) - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:58) duplicated block id: 1370 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (133:138) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (604:609) duplicated block id: 1371 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:57) duplicated block id: 1372 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/VeloxColumnarToRowExec.scala (129:134) - gluten-data/src/main/scala/org/apache/spark/sql/execution/ColumnarBuildSideRelation.scala (142:147) duplicated block id: 1373 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ArrowParquetBlockInputFormat.cpp (44:52) - cpp-ch/local-engine/Storages/SubstraitSource/OrcUtil.cpp (128:136) duplicated block id: 1374 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/java/io/glutenproject/substrait/rel/RelBuilder.java (140:145) - gluten-core/src/main/java/io/glutenproject/substrait/rel/RelBuilder.java (153:158) duplicated block id: 1375 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/findInset.cpp (43:58) duplicated block id: 1376 size: 6 cleaned lines of code in 2 files: - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (30:35) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/FileSourceScanExecShim.scala (35:40) duplicated block id: 1377 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (204:209) - cpp-ch/local-engine/Functions/SparkFunctionRegexpExtractAll.cpp (345:350) duplicated block id: 1378 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (671:676) - cpp/core/jni/JniWrapper.cc (682:687) duplicated block id: 1379 size: 6 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (174:179) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala (139:144) duplicated block id: 1380 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1578:1587) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) duplicated block id: 1381 size: 6 cleaned lines of code in 2 files: - cpp/core/utils/macros.h (41:48) - cpp/core/utils/macros.h (52:59) duplicated block id: 1382 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (219:224) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/utils/CHExecUtil.scala (232:237) duplicated block id: 1383 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (49:54) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (200:205) duplicated block id: 1384 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) duplicated block id: 1385 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Shuffle/PartitionWriter.cpp (145:154) - cpp-ch/local-engine/Shuffle/PartitionWriter.cpp (352:365) duplicated block id: 1386 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (170:175) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (45:50) duplicated block id: 1387 size: 6 cleaned lines of code in 2 files: - cpp/core/shuffle/HashPartitioner.h (28:33) - cpp/core/shuffle/SinglePartitioner.h (27:32) duplicated block id: 1388 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (478:484) - cpp-ch/local-engine/local_engine_jni.cpp (490:496) duplicated block id: 1389 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (3270:3276) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (3321:3327) duplicated block id: 1390 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (489:494) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (516:521) duplicated block id: 1391 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHSparkPlanExecApi.scala (219:224) - gluten-core/src/main/scala/org/apache/spark/sql/execution/ColumnarShuffleExchangeExec.scala (183:188) duplicated block id: 1392 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (604:609) - gluten-core/src/main/scala/io/glutenproject/backendsapi/SparkPlanExecApi.scala (87:92) duplicated block id: 1393 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1787:1796) duplicated block id: 1394 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (49:54) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (120:125) duplicated block id: 1395 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayUnion.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/decode.cpp (41:49) duplicated block id: 1396 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (704:714) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (762:772) duplicated block id: 1397 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/DataGenOnly.java (28:35) - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/SparkShell.java (29:36) duplicated block id: 1398 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (121:126) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (103:108) duplicated block id: 1399 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/RowToVeloxColumnarExec.scala (143:148) - backends-velox/src/main/scala/io/glutenproject/execution/RowToVeloxColumnarExec.scala (167:172) duplicated block id: 1400 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHMetricsApi.scala (266:272) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala (321:327) duplicated block id: 1401 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/clickhouse/MergeTreeFileFormatDataWriter.scala (680:685) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatDataWriter.scala (641:646) duplicated block id: 1402 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (478:484) - cpp-ch/local-engine/local_engine_jni.cpp (502:508) duplicated block id: 1403 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/scala/org/apache/spark/sql/execution/ColumnarBuildSideRelation.scala (50:55) - gluten-data/src/main/scala/org/apache/spark/sql/execution/ColumnarBuildSideRelation.scala (102:107) duplicated block id: 1404 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/local_engine_jni.cpp (478:484) - cpp-ch/local-engine/local_engine_jni.cpp (514:520) duplicated block id: 1405 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1398:1407) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) duplicated block id: 1406 size: 6 cleaned lines of code in 2 files: - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (105:110) - shims/spark34/src/main/scala/io/glutenproject/sql/shims/spark34/Spark34Shims.scala (125:130) duplicated block id: 1407 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/softaffinity/SoftAffinityManager.scala (77:82) - gluten-core/src/main/scala/io/glutenproject/softaffinity/SoftAffinityManager.scala (111:116) duplicated block id: 1408 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/execution/CHHashAggregateExecTransformer.scala (63:68) - backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/SparkPlanExecApiImpl.scala (178:183) duplicated block id: 1409 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkFunctionPositionUTF8.cpp (162:169) - cpp-ch/local-engine/Functions/SparkFunctionPositionUTF8.cpp (226:233) duplicated block id: 1410 size: 6 cleaned lines of code in 2 files: - cpp/velox/benchmarks/ParquetWriteBenchmark.cc (79:87) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (92:100) duplicated block id: 1411 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (489:494) - gluten-core/src/main/scala/io/glutenproject/execution/HashAggregateExecBaseTransformer.scala (361:366) duplicated block id: 1412 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q56.sql (59:64) - tools/gluten-it/common/src/main/resources/tpcds-queries/q60.sql (68:73) duplicated block id: 1413 size: 6 cleaned lines of code in 2 files: - cpp/core/shuffle/LocalPartitionWriter.cc (515:520) - cpp/core/shuffle/rss/CelebornPartitionWriter.cc (53:58) duplicated block id: 1414 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q5.sql (114:119) - tools/gluten-it/common/src/main/resources/tpcds-queries/q80.sql (82:87) duplicated block id: 1415 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/AggregateFunctions/AggregateFunctionPartialMerge.cpp (78:83) - cpp-ch/local-engine/AggregateFunctions/AggregateFunctionPartialMerge.cpp (89:94) duplicated block id: 1416 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (49:54) - tools/gluten-it/common/src/main/resources/tpcds-queries/q66.sql (209:214) duplicated block id: 1417 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHIteratorApi.scala (165:173) - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHIteratorApi.scala (330:337) duplicated block id: 1418 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayDistinct.cpp (43:49) - cpp-ch/local-engine/Parser/scalar_function_parser/logarithm.h (44:57) duplicated block id: 1419 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Functions/SparkParseURL.cpp (418:425) - cpp-ch/local-engine/Functions/SparkParseURL.cpp (501:507) duplicated block id: 1420 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (215:220) - cpp/velox/benchmarks/ShuffleSplitBenchmark.cc (186:191) duplicated block id: 1421 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlan.cc (2035:2040) - cpp/velox/substrait/SubstraitToVeloxPlan.cc (2091:2096) duplicated block id: 1422 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:49) - cpp-ch/local-engine/Parser/scalar_function_parser/trunc.cpp (44:51) duplicated block id: 1423 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/expression/CHExpressionTransformer.scala (317:324) - gluten-core/src/main/scala/io/glutenproject/expression/DateTimeExpressionsTransformer.scala (134:141) duplicated block id: 1424 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2025:2031) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (2212:2218) duplicated block id: 1425 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-22/org/apache/spark/sql/execution/datasources/v2/clickhouse/source/DeltaMergeTreeFileFormat.scala (96:101) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/CHMergeTreeWriterInjects.scala (80:85) duplicated block id: 1426 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1518:1527) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) duplicated block id: 1427 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/VeloxColumnarToRowExec.scala (142:150) - gluten-data/src/main/scala/org/apache/spark/sql/execution/ColumnarBuildSideRelation.scala (194:202) duplicated block id: 1428 size: 6 cleaned lines of code in 2 files: - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/Parameterized.java (43:50) - tools/gluten-it/common/src/main/java/io/glutenproject/integration/tpc/command/QueriesCompare.java (29:36) duplicated block id: 1429 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/backendsapi/clickhouse/CHTransformerApi.scala (44:49) - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (33:38) duplicated block id: 1430 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayContains.cpp (45:69) - cpp-ch/local-engine/Parser/scalar_function_parser/sortArray.cpp (42:49) duplicated block id: 1431 size: 6 cleaned lines of code in 2 files: - cpp/core/shuffle/LocalPartitionWriter.h (38:43) - cpp/core/shuffle/rss/CelebornPartitionWriter.h (40:45) duplicated block id: 1432 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/org/apache/spark/sql/execution/VeloxColumnarWriteFilesExec.scala (258:263) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/WriteFiles.scala (70:75) duplicated block id: 1433 size: 6 cleaned lines of code in 2 files: - shims/common/src/main/scala/io/glutenproject/sql/shims/SparkShims.scala (64:69) - shims/spark32/src/main/scala/io/glutenproject/sql/shims/spark32/Spark32Shims.scala (71:76) duplicated block id: 1434 size: 6 cleaned lines of code in 2 files: - tools/workload/tpcds/run_tpcds/tpcds_parquet.scala (35:40) - tools/workload/tpch/run_tpch/tpch_parquet.scala (27:32) duplicated block id: 1435 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/scala/io/glutenproject/utils/CHInputPartitionsUtil.scala (77:83) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/utils/MergeTreePartsPartitionsUtil.scala (84:90) duplicated block id: 1436 size: 6 cleaned lines of code in 2 files: - cpp/core/benchmarks/CompressionBenchmark.cc (259:264) - cpp/velox/benchmarks/ColumnarToRowBenchmark.cc (105:110) duplicated block id: 1437 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (424:430) - cpp/core/jni/JniWrapper.cc (460:466) duplicated block id: 1438 size: 6 cleaned lines of code in 2 files: - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1471:1480) - gluten-data/src/main/java/io/glutenproject/vectorized/ArrowWritableColumnVector.java (1724:1733) duplicated block id: 1439 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayIntersect.cpp (44:51) - cpp-ch/local-engine/Parser/scalar_function_parser/encode.cpp (41:49) duplicated block id: 1440 size: 6 cleaned lines of code in 2 files: - backends-clickhouse/src/main/delta-20/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndex.scala (28:33) - backends-clickhouse/src/main/scala/org/apache/spark/sql/execution/datasources/v1/ClickHouseFileIndexBase.scala (35:40) duplicated block id: 1441 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (424:430) - cpp/core/jni/JniWrapper.cc (440:446) duplicated block id: 1442 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1359:1366) - cpp-ch/local-engine/Storages/ch_parquet/arrow/encoding.cc (1974:1983) duplicated block id: 1443 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/ScanTransformerFactory.scala (58:63) - gluten-delta/src/main/scala/io/glutenproject/execution/DeltaScanTransformer.scala (60:65) duplicated block id: 1444 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (211:216) - backends-velox/src/main/scala/io/glutenproject/execution/HashAggregateExecTransformer.scala (244:250) duplicated block id: 1445 size: 6 cleaned lines of code in 2 files: - backends-velox/src/main/scala/io/glutenproject/utils/SharedLibraryLoaderUbuntu2004.scala (49:54) - backends-velox/src/main/scala/io/glutenproject/utils/SharedLibraryLoaderUbuntu2204.scala (36:41) duplicated block id: 1446 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (420:427) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (966:973) duplicated block id: 1447 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/arrayPosition.cpp (41:62) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:57) duplicated block id: 1448 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/example_udf/myMd5.cpp (40:48) - cpp-ch/local-engine/Parser/scalar_function_parser/arrayMaxAndMin.cpp (40:46) duplicated block id: 1449 size: 6 cleaned lines of code in 2 files: - cpp/velox/shuffle/VeloxShuffleReader.cc (324:329) - cpp/velox/shuffle/VeloxShuffleReader.cc (336:341) duplicated block id: 1450 size: 6 cleaned lines of code in 2 files: - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (420:427) - cpp/velox/substrait/SubstraitToVeloxPlanValidator.cc (912:919) duplicated block id: 1451 size: 6 cleaned lines of code in 2 files: - gluten-core/src/main/scala/io/glutenproject/execution/WriteFilesExecTransformer.scala (49:54) - shims/spark33/src/main/scala/org/apache/spark/sql/execution/datasources/WriteFiles.scala (70:75) duplicated block id: 1452 size: 6 cleaned lines of code in 2 files: - cpp/core/jni/JniWrapper.cc (914:920) - cpp/core/jni/JniWrapper.cc (957:963) duplicated block id: 1453 size: 6 cleaned lines of code in 2 files: - cpp-ch/local-engine/Parser/scalar_function_parser/mapFromArraysUnaligned.cpp (48:60) - cpp-ch/local-engine/Parser/scalar_function_parser/unixTimestamp.cpp (44:57)