alibaba / LucaProt
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
13% | 37% | 38% | 6% | 3%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py13% | 37% | 39% | 6% | 3%
html0% | 0% | 0% | 100% | 0%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src13% | 37% | 38% | 6% | 3%
Longest Files (Top 50)
File# lines# units
1344 30
modeling_bert.py
in src/SSFN
1338 65
889 9
dnn.py
in src/baselines
805 16
run.py
in src/deep_baselines
778 10
run.py
in src
735 1
data_preprocess_for_rdrp_v1.py
in src/data_preprocess
711 15
708 10
683 10
app.py
in src/app
620 12
lgbm.py
in src/baselines
602 16
xgb.py
in src/baselines
566 16
predict_deep_baselines.py
in src/deep_baselines
546 10
cheer.py
in src/deep_baselines
497 9
492 -
structure_from_esm_v1.py
in src/protein_structure
447 7
428 17
metrics.py
in src/common
419 22
model.py
in src/SSFN
394 4
utils.py
in src
376 19
369 2
embedding_from_esmfold.py
in src/protein_structure
337 3
319 1
ncbi_id_2_uniprot.py
in src/data_preprocess
308 15
304 1
291 6
285 1
pooling.py
in src/SSFN
259 25
data_preprocess_for_rdrp_v2.py
in src/data_preprocess
256 10
252 9
251 1
232 2
contact_map_builder.py
in src/biotoolbox
229 20
virtifier.py
in src/deep_baselines
227 3
virhunter.py
in src/deep_baselines
212 4
virseeker.py
in src/deep_baselines
209 3
tf_records_generator.py
in src/data_preprocess
207 8
predict.py
in src/baselines
205 6
predict_structure.py
in src/protein_structure
203 5
subword.py
in src/data_preprocess
190 8
184 1
merge_embedding_pdb_result.py
in src/protein_structure
177 4
process_predict_result.py
in src/result_process
162 3
156 1
loss.py
in src/common
150 8
index.html
in src/app/templates
148 -
layers.py
in src/SSFN
135 9
structure_file_reader.py
in src/biotoolbox
129 10
contact_map_generator.py
in src/biotoolbox
100 11
87 -
Files With Most Units (Top 50)
File# lines# units
modeling_bert.py
in src/SSFN
1338 65
1344 30
pooling.py
in src/SSFN
259 25
metrics.py
in src/common
419 22
contact_map_builder.py
in src/biotoolbox
229 20
utils.py
in src
376 19
428 17
lgbm.py
in src/baselines
602 16
xgb.py
in src/baselines
566 16
dnn.py
in src/baselines
805 16
data_preprocess_for_rdrp_v1.py
in src/data_preprocess
711 15
ncbi_id_2_uniprot.py
in src/data_preprocess
308 15
app.py
in src/app
620 12
contact_map_generator.py
in src/biotoolbox
100 11
run.py
in src/deep_baselines
778 10
predict_deep_baselines.py
in src/deep_baselines
546 10
data_preprocess_for_rdrp_v2.py
in src/data_preprocess
256 10
683 10
708 10
structure_file_reader.py
in src/biotoolbox
129 10
889 9
cheer.py
in src/deep_baselines
497 9
252 9
layers.py
in src/SSFN
135 9
loss.py
in src/common
150 8
tf_records_generator.py
in src/data_preprocess
207 8
subword.py
in src/data_preprocess
190 8
structure_from_esm_v1.py
in src/protein_structure
447 7
291 6
predict.py
in src/baselines
205 6
predict_structure.py
in src/protein_structure
203 5
virhunter.py
in src/deep_baselines
212 4
merge_embedding_pdb_result.py
in src/protein_structure
177 4
gcn.py
in src/SSFN
66 4
model.py
in src/SSFN
394 4
virtifier.py
in src/deep_baselines
227 3
virseeker.py
in src/deep_baselines
209 3
process_predict_result.py
in src/result_process
162 3
52 3
embedding_from_esmfold.py
in src/protein_structure
337 3
369 2
69 2
232 2
run.py
in src
735 1
251 1
71 1
156 1
319 1
285 1
304 1
Files With Long Lines (Top 44)

There are 44 files with lines longer than 120 characters. In total, there are 460 long lines.

File# lines# units# long lines
1344 30 47
run.py
in src/deep_baselines
778 10 38
dnn.py
in src/baselines
805 16 38
cheer.py
in src/deep_baselines
497 9 22
683 10 22
app.py
in src/app
620 12 21
708 10 21
xgb.py
in src/baselines
566 16 21
lgbm.py
in src/baselines
602 16 20
889 9 19
predict_deep_baselines.py
in src/deep_baselines
546 10 15
data_preprocess_for_rdrp_v1.py
in src/data_preprocess
711 15 14
model.py
in src/SSFN
394 4 14
319 1 14
run.py
in src
735 1 10
492 - 9
embedding_from_esmfold.py
in src/protein_structure
337 3 9
virtifier.py
in src/deep_baselines
227 3 8
virseeker.py
in src/deep_baselines
209 3 8
merge_embedding_pdb_result.py
in src/protein_structure
177 4 8
metrics.py
in src/common
419 22 7
virhunter.py
in src/deep_baselines
212 4 7
304 1 7
statistics.py
in src/deep_baselines
79 - 6
contact_map_builder.py
in src/biotoolbox
229 20 6
predict.py
in src/baselines
205 6 6
369 2 5
252 9 5
structure_from_esm_v1.py
in src/protein_structure
447 7 5
subword.py
in src/data_preprocess
190 8 4
232 2 3
predict_structure.py
in src/protein_structure
203 5 3
285 1 3
428 17 2
291 6 2
contact_map_generator.py
in src/biotoolbox
100 11 2
gcn.py
in src/SSFN
66 4 2
loss.py
in src/common
150 8 1
process_predict_result.py
in src/result_process
162 3 1
data_preprocess_for_rdrp_v2.py
in src/data_preprocess
256 10 1
86 - 1
87 - 1
77 - 1
utils.py
in src
376 19 1