aws-samples / pyflink-getting-started
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 71% duplication:
    • 426 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 304 duplicated lines
  • 27 duplicates
system71% (304 lines)
Duplication per Extension
py71% (304 lines)
Duplication per Component (primary)
pyflink-examples/TumblingWindows100% (77 lines)
pyflink-examples/UDF80% (63 lines)
pyflink-examples/GettingStarted75% (63 lines)
pyflink-examples/SlidingWindows65% (52 lines)
pyflink-examples/StreamingFileSink53% (49 lines)
datagen0% (0 lines)

Duplication Between Components (50+ lines)

G pyflink-examples/TumblingWindows pyflink-examples/TumblingWindows pyflink-examples/UDF pyflink-examples/UDF pyflink-examples/TumblingWindows--pyflink-examples/UDF 126 pyflink-examples/GettingStarted pyflink-examples/GettingStarted pyflink-examples/GettingStarted--pyflink-examples/TumblingWindows 126 pyflink-examples/GettingStarted--pyflink-examples/UDF 126 pyflink-examples/SlidingWindows pyflink-examples/SlidingWindows pyflink-examples/GettingStarted--pyflink-examples/SlidingWindows 104 pyflink-examples/StreamingFileSink pyflink-examples/StreamingFileSink pyflink-examples/GettingStarted--pyflink-examples/StreamingFileSink 66 pyflink-examples/SlidingWindows--pyflink-examples/TumblingWindows 104 pyflink-examples/SlidingWindows--pyflink-examples/UDF 104 pyflink-examples/StreamingFileSink--pyflink-examples/TumblingWindows 98 pyflink-examples/StreamingFileSink--pyflink-examples/UDF 66

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 27 duplicates...
Size#FoldersFilesLinesCode
33 x 2 pyflink-examples/TumblingWindows
pyflink-examples/UDF
tumbling-windows.py
udf.py
23:82 (42%)
24:84 (42%)
view
33 x 2 pyflink-examples/GettingStarted
pyflink-examples/UDF
getting-started.py
udf.py
33:92 (39%)
24:84 (42%)
view
33 x 2 pyflink-examples/GettingStarted
pyflink-examples/TumblingWindows
getting-started.py
tumbling-windows.py
33:92 (39%)
23:82 (42%)
view
28 x 2 pyflink-examples/GettingStarted
pyflink-examples/SlidingWindows
getting-started.py
sliding-windows.py
39:92 (33%)
28:81 (35%)
view
28 x 2 pyflink-examples/SlidingWindows
pyflink-examples/TumblingWindows
sliding-windows.py
tumbling-windows.py
28:81 (35%)
29:82 (36%)
view
28 x 2 pyflink-examples/SlidingWindows
pyflink-examples/UDF
sliding-windows.py
udf.py
28:81 (35%)
30:84 (35%)
view
24 x 2 pyflink-examples/TumblingWindows
pyflink-examples/UDF
tumbling-windows.py
udf.py
100:137 (31%)
102:139 (30%)
view
24 x 2 pyflink-examples/GettingStarted
pyflink-examples/TumblingWindows
getting-started.py
tumbling-windows.py
109:146 (28%)
100:137 (31%)
view
24 x 2 pyflink-examples/SlidingWindows
pyflink-examples/TumblingWindows
sliding-windows.py
tumbling-windows.py
102:139 (30%)
100:137 (31%)
view
24 x 2 pyflink-examples/SlidingWindows
pyflink-examples/UDF
sliding-windows.py
udf.py
102:139 (30%)
102:139 (30%)
view
24 x 2 pyflink-examples/GettingStarted
pyflink-examples/UDF
getting-started.py
udf.py
109:146 (28%)
102:139 (30%)
view
24 x 2 pyflink-examples/GettingStarted
pyflink-examples/SlidingWindows
getting-started.py
sliding-windows.py
109:146 (28%)
102:139 (30%)
view
14 x 2 pyflink-examples/StreamingFileSink
pyflink-examples/TumblingWindows
streaming-file-sink.py
tumbling-windows.py
23:42 (15%)
23:41 (18%)
view
14 x 2 pyflink-examples/StreamingFileSink
pyflink-examples/UDF
streaming-file-sink.py
udf.py
23:42 (15%)
24:42 (17%)
view
14 x 2 pyflink-examples/GettingStarted
pyflink-examples/StreamingFileSink
getting-started.py
streaming-file-sink.py
33:51 (16%)
23:42 (15%)
view
13 x 2 pyflink-examples/StreamingFileSink
pyflink-examples/UDF
streaming-file-sink.py
udf.py
55:71 (14%)
44:61 (16%)
view
13 x 2 pyflink-examples/GettingStarted
pyflink-examples/StreamingFileSink
getting-started.py
streaming-file-sink.py
53:69 (15%)
55:71 (14%)
view
13 x 2 pyflink-examples/SlidingWindows
pyflink-examples/StreamingFileSink
sliding-windows.py
streaming-file-sink.py
42:58 (16%)
55:71 (14%)
view
13 x 2 pyflink-examples/StreamingFileSink
pyflink-examples/TumblingWindows
streaming-file-sink.py
tumbling-windows.py
55:71 (14%)
43:59 (16%)
view
12 x 2 pyflink-examples/StreamingFileSink
pyflink-examples/TumblingWindows
streaming-file-sink.py
tumbling-windows.py
114:131 (13%)
85:102 (15%)
view
Duplicated Units
The list of top 2 duplicated units.
See data for all 2 unit duplicates...
Size#FoldersFilesLinesCode
10 x 2 pyflink-examples/StreamingFileSink
pyflink-examples/TumblingWindows
streaming-file-sink.py
tumbling-windows.py
0:0 
0:0 
view
7 x 5 pyflink-examples/StreamingFileSink
pyflink-examples/TumblingWindows
pyflink-examples/GettingStarted
pyflink-examples/UDF
pyflink-examples/SlidingWindows
streaming-file-sink.py
tumbling-windows.py
getting-started.py
udf.py
sliding-windows.py
0:0 
0:0 
0:0 
0:0 
0:0 
view