facebookresearch / parcus
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 34 files with 4,125 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 10 medium size files (2,647 lines of codeclsfd_ftr_w_mp_ins)
    • 6 small files (696 lines of code)
    • 18 very small files (782 lines of code)
0% | 0% | 64% | 16% | 18%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 64% | 16% | 18%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
training0% | 0% | 94% | 0% | 5%
parsers/Spouse0% | 0% | 67% | 18% | 13%
parsers/MovieReview0% | 0% | 67% | 17% | 15%
models0% | 0% | 81% | 0% | 18%
parsers/Hatespeech0% | 0% | 35% | 48% | 15%
datasets0% | 0% | 0% | 33% | 66%
utils0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 34)
File# lines# units
NeuralPatternMatchingTraining.py
in training
421 6
BertBaselineTraining.py
in training
312 4
Spouse_Preprocess.py
in parsers/Spouse
279 8
NPM.py
in models
260 34
BertFinetuneTraining.py
in training
250 4
Hatespeech_Preprocess.py
in parsers/Hatespeech
241 7
NGramLogRegTraining.py
in training
238 5
MovieReview_Preprocess.py
in parsers/MovieReview
220 7
Spouse_Finetune_Preprocess.py
in parsers/Spouse
213 7
MovieReview_Finetune_Preprocess.py
in parsers/MovieReview
213 7
Spouse_Dataset_Builder.py
in parsers/Spouse
133 4
NREDataset.py
in datasets
129 12
Hatespeech_Dataset_Builder.py
in parsers/Hatespeech
112 4
Hatespeech_Dataset_Fasttext_Builder.py
in parsers/Hatespeech
111 4
MovieReview_Dataset_Builder.py
in parsers/MovieReview
110 4
Hatespeech_Fasttext_Preprocess.py
in parsers/Hatespeech
101 4
Spouse_Finetune_Dataset_Builder.py
in parsers/Spouse
99 4
MovieReview_Finetune_Dataset_Builder.py
in parsers/MovieReview
99 4
SpouseBaselineDataset.py
in datasets
81 10
utils.py
in training
76 4
utils.py
in datasets
66 10
Hatespeech_Preprocess_Ngrams.py
in parsers/Hatespeech
62 3
BertBaselineDataset.py
in datasets
60 8
Hatespeech_Ngram_Builder.py
in parsers/Hatespeech
43 3
ResultsParser.py
in root
38 1
bert.py
in utils
34 2
BertFinetune.py
in models
34 3
BertFinetuneDataset.py
in datasets
29 4
NgramDataset.py
in datasets
24 3
LogisticRegression.py
in models
19 4
spacy.py
in utils
10 1
LabelModelNoSeed.py
in models
6 2
__init__.py
in datasets
1 -
__init__.py
in root
1 -
Files With Most Units (Top 20)
File# lines# units
NPM.py
in models
260 34
NREDataset.py
in datasets
129 12
SpouseBaselineDataset.py
in datasets
81 10
utils.py
in datasets
66 10
Spouse_Preprocess.py
in parsers/Spouse
279 8
BertBaselineDataset.py
in datasets
60 8
Hatespeech_Preprocess.py
in parsers/Hatespeech
241 7
Spouse_Finetune_Preprocess.py
in parsers/Spouse
213 7
MovieReview_Finetune_Preprocess.py
in parsers/MovieReview
213 7
MovieReview_Preprocess.py
in parsers/MovieReview
220 7
NeuralPatternMatchingTraining.py
in training
421 6
NGramLogRegTraining.py
in training
238 5
Hatespeech_Fasttext_Preprocess.py
in parsers/Hatespeech
101 4
Hatespeech_Dataset_Builder.py
in parsers/Hatespeech
112 4
Hatespeech_Dataset_Fasttext_Builder.py
in parsers/Hatespeech
111 4
Spouse_Dataset_Builder.py
in parsers/Spouse
133 4
Spouse_Finetune_Dataset_Builder.py
in parsers/Spouse
99 4
MovieReview_Finetune_Dataset_Builder.py
in parsers/MovieReview
99 4
MovieReview_Dataset_Builder.py
in parsers/MovieReview
110 4
BertBaselineTraining.py
in training
312 4
Files With Long Lines (Top 17)

There are 17 files with lines longer than 120 characters. In total, there are 63 long lines.

File# lines# units# long lines
NeuralPatternMatchingTraining.py
in training
421 6 20
BertBaselineTraining.py
in training
312 4 8
BertFinetuneTraining.py
in training
250 4 7
NGramLogRegTraining.py
in training
238 5 6
Hatespeech_Dataset_Fasttext_Builder.py
in parsers/Hatespeech
111 4 3
Hatespeech_Fasttext_Preprocess.py
in parsers/Hatespeech
101 4 2
Hatespeech_Dataset_Builder.py
in parsers/Hatespeech
112 4 2
Hatespeech_Preprocess.py
in parsers/Hatespeech
241 7 2
Spouse_Finetune_Preprocess.py
in parsers/Spouse
213 7 2
Spouse_Preprocess.py
in parsers/Spouse
279 8 2
Spouse_Dataset_Builder.py
in parsers/Spouse
133 4 2
Spouse_Finetune_Dataset_Builder.py
in parsers/Spouse
99 4 2
Hatespeech_Preprocess_Ngrams.py
in parsers/Hatespeech
62 3 1
Hatespeech_Ngram_Builder.py
in parsers/Hatespeech
43 3 1
MovieReview_Finetune_Preprocess.py
in parsers/MovieReview
213 7 1
MovieReview_Dataset_Builder.py
in parsers/MovieReview
110 4 1
MovieReview_Preprocess.py
in parsers/MovieReview
220 7 1