amazon-research / sentence-representations
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 33 files with 2,535 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 1 medium size files (259 lines of codeclsfd_ftr_w_mp_ins)
    • 7 small files (1,082 lines of code)
    • 25 very small files (1,194 lines of code)
0% | 0% | 10% | 42% | 47%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 10% | 42% | 47%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
DownstreamEval/SentEval/senteval0% | 0% | 14% | 45% | 40%
DownstreamEval/clustering0% | 0% | 0% | 73% | 26%
PairSupCon0% | 0% | 0% | 63% | 36%
PairSupCon/utils0% | 0% | 0% | 0% | 100%
DownstreamEval0% | 0% | 0% | 0% | 100%
DownstreamEval/configs0% | 0% | 0% | 0% | 100%
PairSupCon/models0% | 0% | 0% | 0% | 100%
DownstreamEval/stseval0% | 0% | 0% | 0% | 100%
PairSupCon/dataloader0% | 0% | 0% | 0% | 100%
DownstreamEval/SentEval0% | 0% | 0% | 0% | 100%
Longest Files (Top 33)
File# lines# units
ranking.py
in DownstreamEval/SentEval/senteval/tools
259 12
validation.py
in DownstreamEval/SentEval/senteval/tools
186 7
sts.py
in DownstreamEval/SentEval/senteval
185 14
sick.py
in DownstreamEval/SentEval/senteval
167 8
metric.py
in DownstreamEval/clustering
160 23
classifier.py
in DownstreamEval/SentEval/senteval/tools
145 8
training.py
in PairSupCon
124 7
probing.py
in DownstreamEval/SentEval/senteval
115 14
engine.py
in DownstreamEval/SentEval/senteval
100 2
relatedness.py
in DownstreamEval/SentEval/senteval/tools
95 5
snli.py
in DownstreamEval/SentEval/senteval
86 4
rank.py
in DownstreamEval/SentEval/senteval
82 4
mrpc.py
in DownstreamEval/SentEval/senteval
77 4
sst.py
in DownstreamEval/SentEval/senteval
71 4
main.py
in PairSupCon
70 2
trec.py
in DownstreamEval/SentEval/senteval
66 4
utils.py
in DownstreamEval/SentEval/senteval
66 3
binary.py
in DownstreamEval/SentEval/senteval
65 8
configure.py
in DownstreamEval/configs
55 4
Transformers.py
in PairSupCon/models
52 5
eval_sts.py
in DownstreamEval
39 1
clustering_eval.py
in DownstreamEval/clustering
38 2
sts_eval.py
in DownstreamEval/stseval
38 1
contrastive_utils.py
in PairSupCon/utils
36 2
utils.py
in PairSupCon/utils
34 3
eval_cluster.py
in DownstreamEval
29 -
optimizer.py
in PairSupCon/utils
26 2
dataloader.py
in PairSupCon/dataloader
22 4
dataloader.py
in DownstreamEval/clustering
21 4
setup.py
in DownstreamEval/SentEval
12 -
utils.py
in DownstreamEval/configs
11 1
__init__.py
in DownstreamEval/SentEval/senteval
2 -
__init__.py
in DownstreamEval/SentEval/senteval/tools
1 -
Files With Most Units (Top 20)
File# lines# units
metric.py
in DownstreamEval/clustering
160 23
probing.py
in DownstreamEval/SentEval/senteval
115 14
sts.py
in DownstreamEval/SentEval/senteval
185 14
ranking.py
in DownstreamEval/SentEval/senteval/tools
259 12
sick.py
in DownstreamEval/SentEval/senteval
167 8
classifier.py
in DownstreamEval/SentEval/senteval/tools
145 8
binary.py
in DownstreamEval/SentEval/senteval
65 8
validation.py
in DownstreamEval/SentEval/senteval/tools
186 7
training.py
in PairSupCon
124 7
relatedness.py
in DownstreamEval/SentEval/senteval/tools
95 5
Transformers.py
in PairSupCon/models
52 5
dataloader.py
in DownstreamEval/clustering
21 4
mrpc.py
in DownstreamEval/SentEval/senteval
77 4
trec.py
in DownstreamEval/SentEval/senteval
66 4
rank.py
in DownstreamEval/SentEval/senteval
82 4
sst.py
in DownstreamEval/SentEval/senteval
71 4
snli.py
in DownstreamEval/SentEval/senteval
86 4
configure.py
in DownstreamEval/configs
55 4
dataloader.py
in PairSupCon/dataloader
22 4
utils.py
in DownstreamEval/SentEval/senteval
66 3
Files With Long Lines (Top 6)

There are 6 files with lines longer than 120 characters. In total, there are 10 long lines.

File# lines# units# long lines
clustering_eval.py
in DownstreamEval/clustering
38 2 2
engine.py
in DownstreamEval/SentEval/senteval
100 2 2
main.py
in PairSupCon
70 2 2
training.py
in PairSupCon
124 7 2
rank.py
in DownstreamEval/SentEval/senteval
82 4 1
sts_eval.py
in DownstreamEval/stseval
38 1 1