The distribution of size of files (measured in lines of code).
File | # lines | # units |
---|---|---|
CloneDetector.cs in DuplicateCodeDetector |
137 | 9 |
Extractor.java in tokenizers/java/src/main/java/javatokenizer |
115 | 3 |
CloneGroups.cs in DuplicateCodeDetector |
84 | 4 |
parser.js in tokenizers/javascript |
77 | 3 |
Program.cs in tokenizers/CsharpTokenizer/CsharpTokenizer |
76 | 5 |
Program.fs in tokenizers/FSharpTokenizer/FSharpTokenizer |
60 | - |
CloneDetectorCli.cs in DuplicateCodeDetector |
58 | 1 |
SparseVector.cs in DuplicateCodeDetector/Utils |
57 | 3 |
tokenizepythoncorpus.py in tokenizers/python |
35 | 3 |
FeatureDictionary.cs in DuplicateCodeDetector/Utils |
28 | 3 |
baronetokenizer.py in tokenizers/python |
23 | 2 |
FSharpTokenizer.fsproj in tokenizers/FSharpTokenizer/FSharpTokenizer |
17 | - |
File | # lines | # units |
---|---|---|
CloneDetector.cs in DuplicateCodeDetector |
137 | 9 |
Program.cs in tokenizers/CsharpTokenizer/CsharpTokenizer |
76 | 5 |
CloneGroups.cs in DuplicateCodeDetector |
84 | 4 |
FeatureDictionary.cs in DuplicateCodeDetector/Utils |
28 | 3 |
SparseVector.cs in DuplicateCodeDetector/Utils |
57 | 3 |
Extractor.java in tokenizers/java/src/main/java/javatokenizer |
115 | 3 |
parser.js in tokenizers/javascript |
77 | 3 |
tokenizepythoncorpus.py in tokenizers/python |
35 | 3 |
baronetokenizer.py in tokenizers/python |
23 | 2 |
CloneDetectorCli.cs in DuplicateCodeDetector |
58 | 1 |
There are 5 files with lines longer than 120 characters. In total, there are 16 long lines.
File | # lines | # units | # long lines |
---|---|---|---|
CloneDetector.cs in DuplicateCodeDetector |
137 | 9 | 9 |
CloneDetectorCli.cs in DuplicateCodeDetector |
58 | 1 | 3 |
Extractor.java in tokenizers/java/src/main/java/javatokenizer |
115 | 3 | 2 |
Program.cs in tokenizers/CsharpTokenizer/CsharpTokenizer |
76 | 5 | 1 |
tokenizepythoncorpus.py in tokenizers/python |
35 | 3 | 1 |