twitter / communitynotes
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
21% | 34% | 28% | 10% | 5%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py21% | 34% | 28% | 10% | 5%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
sourcecode21% | 34% | 28% | 10% | 5%
Longest Files (Top 44)
File# lines# units
run_scoring.py
in sourcecode/scoring
1579 27
pflip_plus_model.py
in sourcecode/scoring
1225 36
mf_base_scorer.py
in sourcecode/scoring
973 19
scoring_rules.py
in sourcecode/scoring
880 44
constants.py
in sourcecode/scoring
866 5
note_ratings.py
in sourcecode/scoring
598 10
process_data.py
in sourcecode/scoring
582 26
pflip_model.py
in sourcecode/scoring
535 22
pandas_utils.py
in sourcecode/scoring
481 21
matrix_factorization.py
in sourcecode/scoring/matrix_factorization
470 17
contributor_state.py
in sourcecode/scoring
442 17
reputation_matrix_factorization.py
in sourcecode/scoring/reputation_matrix_factorization
439 16
389 20
scorer.py
in sourcecode/scoring
339 20
pseudo_raters.py
in sourcecode/scoring/matrix_factorization
286 12
runner.py
in sourcecode/scoring
248 3
topic_model.py
in sourcecode/scoring
217 11
post_selection_similarity.py
in sourcecode/scoring
215 9
note_status_history.py
in sourcecode/scoring
213 7
mf_group_scorer.py
in sourcecode/scoring
184 13
helpfulness_scores.py
in sourcecode/scoring
173 4
mf_topic_scorer.py
in sourcecode/scoring
173 10
diligence_model.py
in sourcecode/scoring/reputation_matrix_factorization
167 4
reputation_scorer.py
in sourcecode/scoring
136 12
helpfulness_model.py
in sourcecode/scoring/reputation_matrix_factorization
129 3
incorrect_filter.py
in sourcecode/scoring
120 4
normalized_loss.py
in sourcecode/scoring/matrix_factorization
120 6
tag_consensus.py
in sourcecode/scoring
117 3
mf_expansion_scorer.py
in sourcecode/scoring
83 9
mf_core_with_topics_scorer.py
in sourcecode/scoring
83 9
mf_expansion_plus_scorer.py
in sourcecode/scoring
77 9
explanation_tags.py
in sourcecode/scoring
76 3
mf_core_scorer.py
in sourcecode/scoring
69 6
weighted_loss.py
in sourcecode/scoring/reputation_matrix_factorization
61 4
tag_filter.py
in sourcecode/scoring
59 5
model.py
in sourcecode/scoring/matrix_factorization
55 5
dataset.py
in sourcecode/scoring/reputation_matrix_factorization
41 1
mf_multi_group_scorer.py
in sourcecode/scoring
31 4
enums.py
in sourcecode/scoring
25 1
main.py
in sourcecode
5 -
__init__.py
in sourcecode
1 -
__init__.py
in sourcecode/scoring/matrix_factorization
1 -
__init__.py
in sourcecode/scoring
1 -
__init__.py
in sourcecode/scoring/reputation_matrix_factorization
1 -
Files With Most Units (Top 39)
File# lines# units
scoring_rules.py
in sourcecode/scoring
880 44
pflip_plus_model.py
in sourcecode/scoring
1225 36
run_scoring.py
in sourcecode/scoring
1579 27
process_data.py
in sourcecode/scoring
582 26
pflip_model.py
in sourcecode/scoring
535 22
pandas_utils.py
in sourcecode/scoring
481 21
389 20
scorer.py
in sourcecode/scoring
339 20
mf_base_scorer.py
in sourcecode/scoring
973 19
matrix_factorization.py
in sourcecode/scoring/matrix_factorization
470 17
contributor_state.py
in sourcecode/scoring
442 17
reputation_matrix_factorization.py
in sourcecode/scoring/reputation_matrix_factorization
439 16
mf_group_scorer.py
in sourcecode/scoring
184 13
pseudo_raters.py
in sourcecode/scoring/matrix_factorization
286 12
reputation_scorer.py
in sourcecode/scoring
136 12
topic_model.py
in sourcecode/scoring
217 11
note_ratings.py
in sourcecode/scoring
598 10
mf_topic_scorer.py
in sourcecode/scoring
173 10
post_selection_similarity.py
in sourcecode/scoring
215 9
mf_expansion_scorer.py
in sourcecode/scoring
83 9
mf_expansion_plus_scorer.py
in sourcecode/scoring
77 9
mf_core_with_topics_scorer.py
in sourcecode/scoring
83 9
note_status_history.py
in sourcecode/scoring
213 7
normalized_loss.py
in sourcecode/scoring/matrix_factorization
120 6
mf_core_scorer.py
in sourcecode/scoring
69 6
tag_filter.py
in sourcecode/scoring
59 5
model.py
in sourcecode/scoring/matrix_factorization
55 5
constants.py
in sourcecode/scoring
866 5
incorrect_filter.py
in sourcecode/scoring
120 4
mf_multi_group_scorer.py
in sourcecode/scoring
31 4
helpfulness_scores.py
in sourcecode/scoring
173 4
diligence_model.py
in sourcecode/scoring/reputation_matrix_factorization
167 4
weighted_loss.py
in sourcecode/scoring/reputation_matrix_factorization
61 4
tag_consensus.py
in sourcecode/scoring
117 3
explanation_tags.py
in sourcecode/scoring
76 3
helpfulness_model.py
in sourcecode/scoring/reputation_matrix_factorization
129 3
runner.py
in sourcecode/scoring
248 3
dataset.py
in sourcecode/scoring/reputation_matrix_factorization
41 1
enums.py
in sourcecode/scoring
25 1
Files With Long Lines (Top 7)

There are 7 files with lines longer than 120 characters. In total, there are 29 long lines.

File# lines# units# long lines
process_data.py
in sourcecode/scoring
582 26 15
contributor_state.py
in sourcecode/scoring
442 17 4
note_status_history.py
in sourcecode/scoring
213 7 3
run_scoring.py
in sourcecode/scoring
1579 27 2
note_ratings.py
in sourcecode/scoring
598 10 2
mf_base_scorer.py
in sourcecode/scoring
973 19 2
helpfulness_scores.py
in sourcecode/scoring
173 4 1
Correlations

File Size vs. Commits (all time): 44 points

sourcecode/scoring/mf_base_scorer.py x: 87 commits (all time) y: 973 lines of code sourcecode/scoring/mf_group_scorer.py x: 38 commits (all time) y: 184 lines of code sourcecode/scoring/run_scoring.py x: 91 commits (all time) y: 1579 lines of code sourcecode/scoring/scorer.py x: 39 commits (all time) y: 339 lines of code sourcecode/scoring/scoring_rules.py x: 76 commits (all time) y: 880 lines of code sourcecode/scoring/constants.py x: 86 commits (all time) y: 866 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 31 commits (all time) y: 470 lines of code sourcecode/scoring/matrix_factorization/model.py x: 12 commits (all time) y: 55 lines of code sourcecode/scoring/mf_core_scorer.py x: 40 commits (all time) y: 69 lines of code sourcecode/scoring/mf_core_with_topics_scorer.py x: 6 commits (all time) y: 83 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 14 commits (all time) y: 77 lines of code sourcecode/scoring/mf_expansion_scorer.py x: 35 commits (all time) y: 83 lines of code sourcecode/scoring/note_ratings.py x: 52 commits (all time) y: 598 lines of code sourcecode/scoring/note_status_history.py x: 31 commits (all time) y: 213 lines of code sourcecode/scoring/pflip_plus_model.py x: 4 commits (all time) y: 1225 lines of code sourcecode/scoring/topic_model.py x: 21 commits (all time) y: 217 lines of code sourcecode/scoring/enums.py x: 24 commits (all time) y: 25 lines of code sourcecode/scoring/process_data.py x: 68 commits (all time) y: 582 lines of code sourcecode/scoring/pflip_model.py x: 4 commits (all time) y: 535 lines of code sourcecode/scoring/helpfulness_scores.py x: 22 commits (all time) y: 173 lines of code sourcecode/scoring/pandas_utils.py x: 17 commits (all time) y: 481 lines of code sourcecode/scoring/runner.py x: 32 commits (all time) y: 248 lines of code sourcecode/scoring/tag_consensus.py x: 16 commits (all time) y: 117 lines of code sourcecode/main.py x: 23 commits (all time) y: 5 lines of code sourcecode/scoring/matrix_factorization/pseudo_raters.py x: 19 commits (all time) y: 286 lines of code sourcecode/scoring/post_selection_similarity.py x: 14 commits (all time) y: 215 lines of code sourcecode/scoring/post_selection_similarity_old.py x: 2 commits (all time) y: 389 lines of code sourcecode/scoring/contributor_state.py x: 18 commits (all time) y: 442 lines of code sourcecode/scoring/reputation_matrix_factorization/diligence_model.py x: 13 commits (all time) y: 167 lines of code sourcecode/scoring/reputation_matrix_factorization/helpfulness_model.py x: 6 commits (all time) y: 129 lines of code sourcecode/scoring/reputation_matrix_factorization/reputation_matrix_factorization.py x: 20 commits (all time) y: 439 lines of code sourcecode/scoring/reputation_scorer.py x: 16 commits (all time) y: 136 lines of code sourcecode/scoring/tag_filter.py x: 10 commits (all time) y: 59 lines of code sourcecode/scoring/mf_multi_group_scorer.py x: 2 commits (all time) y: 31 lines of code sourcecode/scoring/matrix_factorization/normalized_loss.py x: 4 commits (all time) y: 120 lines of code sourcecode/scoring/reputation_matrix_factorization/dataset.py x: 8 commits (all time) y: 41 lines of code sourcecode/scoring/mf_topic_scorer.py x: 12 commits (all time) y: 173 lines of code sourcecode/scoring/incorrect_filter.py x: 21 commits (all time) y: 120 lines of code sourcecode/scoring/explanation_tags.py x: 21 commits (all time) y: 76 lines of code sourcecode/scoring/matrix_factorization/__init__.py x: 4 commits (all time) y: 1 lines of code sourcecode/scoring/reputation_matrix_factorization/__init__.py x: 3 commits (all time) y: 1 lines of code sourcecode/scoring/reputation_matrix_factorization/weighted_loss.py x: 6 commits (all time) y: 61 lines of code sourcecode/__init__.py x: 1 commits (all time) y: 1 lines of code
1579.0
lines of code
  min: 1.0
  average: 294.66
  25th percentile: 63.0
  median: 170.0
  75th percentile: 441.25
  max: 1579.0
0 91.0
commits (all time)
min: 1.0 | average: 24.32 | 25th percentile: 6.0 | median: 17.5 | 75th percentile: 31.75 | max: 91.0

File Size vs. Contributors (all time): 44 points

sourcecode/scoring/mf_base_scorer.py x: 7 contributors (all time) y: 973 lines of code sourcecode/scoring/mf_group_scorer.py x: 4 contributors (all time) y: 184 lines of code sourcecode/scoring/run_scoring.py x: 6 contributors (all time) y: 1579 lines of code sourcecode/scoring/scorer.py x: 6 contributors (all time) y: 339 lines of code sourcecode/scoring/scoring_rules.py x: 7 contributors (all time) y: 880 lines of code sourcecode/scoring/constants.py x: 7 contributors (all time) y: 866 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 3 contributors (all time) y: 470 lines of code sourcecode/scoring/matrix_factorization/model.py x: 3 contributors (all time) y: 55 lines of code sourcecode/scoring/mf_core_scorer.py x: 5 contributors (all time) y: 69 lines of code sourcecode/scoring/mf_core_with_topics_scorer.py x: 3 contributors (all time) y: 83 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 3 contributors (all time) y: 77 lines of code sourcecode/scoring/mf_expansion_scorer.py x: 4 contributors (all time) y: 83 lines of code sourcecode/scoring/note_ratings.py x: 5 contributors (all time) y: 598 lines of code sourcecode/scoring/note_status_history.py x: 5 contributors (all time) y: 213 lines of code sourcecode/scoring/pflip_plus_model.py x: 2 contributors (all time) y: 1225 lines of code sourcecode/scoring/topic_model.py x: 3 contributors (all time) y: 217 lines of code sourcecode/scoring/enums.py x: 4 contributors (all time) y: 25 lines of code sourcecode/scoring/process_data.py x: 8 contributors (all time) y: 582 lines of code sourcecode/scoring/pflip_model.py x: 3 contributors (all time) y: 535 lines of code sourcecode/scoring/helpfulness_scores.py x: 4 contributors (all time) y: 173 lines of code sourcecode/scoring/pandas_utils.py x: 3 contributors (all time) y: 481 lines of code sourcecode/scoring/runner.py x: 3 contributors (all time) y: 248 lines of code sourcecode/scoring/tag_consensus.py x: 3 contributors (all time) y: 117 lines of code sourcecode/main.py x: 6 contributors (all time) y: 5 lines of code sourcecode/scoring/matrix_factorization/pseudo_raters.py x: 3 contributors (all time) y: 286 lines of code sourcecode/scoring/post_selection_similarity.py x: 1 contributors (all time) y: 215 lines of code sourcecode/scoring/post_selection_similarity_old.py x: 1 contributors (all time) y: 389 lines of code sourcecode/scoring/contributor_state.py x: 5 contributors (all time) y: 442 lines of code sourcecode/scoring/reputation_matrix_factorization/diligence_model.py x: 3 contributors (all time) y: 167 lines of code sourcecode/scoring/reputation_matrix_factorization/helpfulness_model.py x: 1 contributors (all time) y: 129 lines of code sourcecode/scoring/reputation_matrix_factorization/reputation_matrix_factorization.py x: 3 contributors (all time) y: 439 lines of code sourcecode/scoring/reputation_scorer.py x: 3 contributors (all time) y: 136 lines of code sourcecode/scoring/tag_filter.py x: 3 contributors (all time) y: 59 lines of code sourcecode/scoring/mf_multi_group_scorer.py x: 1 contributors (all time) y: 31 lines of code sourcecode/scoring/matrix_factorization/normalized_loss.py x: 3 contributors (all time) y: 120 lines of code sourcecode/scoring/reputation_matrix_factorization/dataset.py x: 3 contributors (all time) y: 41 lines of code sourcecode/scoring/mf_topic_scorer.py x: 3 contributors (all time) y: 173 lines of code sourcecode/scoring/incorrect_filter.py x: 4 contributors (all time) y: 120 lines of code sourcecode/scoring/explanation_tags.py x: 7 contributors (all time) y: 76 lines of code sourcecode/scoring/matrix_factorization/__init__.py x: 1 contributors (all time) y: 1 lines of code sourcecode/scoring/reputation_matrix_factorization/__init__.py x: 2 contributors (all time) y: 1 lines of code
1579.0
lines of code
  min: 1.0
  average: 294.66
  25th percentile: 63.0
  median: 170.0
  75th percentile: 441.25
  max: 1579.0
0 8.0
contributors (all time)
min: 1.0 | average: 3.61 | 25th percentile: 3.0 | median: 3.0 | 75th percentile: 5.0 | max: 8.0

File Size vs. Commits (30 days): 16 points

sourcecode/scoring/mf_base_scorer.py x: 4 commits (30d) y: 973 lines of code sourcecode/scoring/mf_group_scorer.py x: 4 commits (30d) y: 184 lines of code sourcecode/scoring/run_scoring.py x: 4 commits (30d) y: 1579 lines of code sourcecode/scoring/scorer.py x: 2 commits (30d) y: 339 lines of code sourcecode/scoring/scoring_rules.py x: 4 commits (30d) y: 880 lines of code sourcecode/scoring/constants.py x: 2 commits (30d) y: 866 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 2 commits (30d) y: 470 lines of code sourcecode/scoring/matrix_factorization/model.py x: 2 commits (30d) y: 55 lines of code sourcecode/scoring/mf_core_scorer.py x: 2 commits (30d) y: 69 lines of code sourcecode/scoring/mf_core_with_topics_scorer.py x: 2 commits (30d) y: 83 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 2 commits (30d) y: 77 lines of code sourcecode/scoring/note_ratings.py x: 2 commits (30d) y: 598 lines of code sourcecode/scoring/note_status_history.py x: 2 commits (30d) y: 213 lines of code sourcecode/scoring/pflip_plus_model.py x: 2 commits (30d) y: 1225 lines of code
1579.0
lines of code
  min: 55.0
  average: 494.44
  25th percentile: 83.0
  median: 278.0
  75th percentile: 876.5
  max: 1579.0
0 4.0
commits (30d)
min: 2.0 | average: 2.5 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 3.5 | max: 4.0

File Size vs. Contributors (30 days): 16 points

sourcecode/scoring/mf_base_scorer.py x: 3 contributors (30d) y: 973 lines of code sourcecode/scoring/mf_group_scorer.py x: 3 contributors (30d) y: 184 lines of code sourcecode/scoring/run_scoring.py x: 3 contributors (30d) y: 1579 lines of code sourcecode/scoring/scorer.py x: 1 contributors (30d) y: 339 lines of code sourcecode/scoring/scoring_rules.py x: 3 contributors (30d) y: 880 lines of code sourcecode/scoring/constants.py x: 2 contributors (30d) y: 866 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 2 contributors (30d) y: 470 lines of code sourcecode/scoring/matrix_factorization/model.py x: 2 contributors (30d) y: 55 lines of code sourcecode/scoring/mf_core_scorer.py x: 2 contributors (30d) y: 69 lines of code sourcecode/scoring/mf_core_with_topics_scorer.py x: 2 contributors (30d) y: 83 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 2 contributors (30d) y: 77 lines of code sourcecode/scoring/note_ratings.py x: 2 contributors (30d) y: 598 lines of code sourcecode/scoring/note_status_history.py x: 2 contributors (30d) y: 213 lines of code sourcecode/scoring/pflip_plus_model.py x: 2 contributors (30d) y: 1225 lines of code
1579.0
lines of code
  min: 55.0
  average: 494.44
  25th percentile: 83.0
  median: 278.0
  75th percentile: 876.5
  max: 1579.0
0 3.0
contributors (30d)
min: 1.0 | average: 2.19 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.75 | max: 3.0

File Size vs. Commits (90 days): 23 points

sourcecode/scoring/mf_base_scorer.py x: 18 commits (90d) y: 973 lines of code sourcecode/scoring/mf_group_scorer.py x: 8 commits (90d) y: 184 lines of code sourcecode/scoring/run_scoring.py x: 17 commits (90d) y: 1579 lines of code sourcecode/scoring/scorer.py x: 8 commits (90d) y: 339 lines of code sourcecode/scoring/scoring_rules.py x: 18 commits (90d) y: 880 lines of code sourcecode/scoring/constants.py x: 16 commits (90d) y: 866 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 2 commits (90d) y: 470 lines of code sourcecode/scoring/matrix_factorization/model.py x: 2 commits (90d) y: 55 lines of code sourcecode/scoring/mf_core_scorer.py x: 10 commits (90d) y: 69 lines of code sourcecode/scoring/mf_core_with_topics_scorer.py x: 6 commits (90d) y: 83 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 4 commits (90d) y: 77 lines of code sourcecode/scoring/mf_expansion_scorer.py x: 4 commits (90d) y: 83 lines of code sourcecode/scoring/note_ratings.py x: 10 commits (90d) y: 598 lines of code sourcecode/scoring/note_status_history.py x: 7 commits (90d) y: 213 lines of code sourcecode/scoring/pflip_plus_model.py x: 4 commits (90d) y: 1225 lines of code sourcecode/scoring/topic_model.py x: 11 commits (90d) y: 217 lines of code sourcecode/scoring/enums.py x: 8 commits (90d) y: 25 lines of code sourcecode/scoring/process_data.py x: 4 commits (90d) y: 582 lines of code sourcecode/scoring/pflip_model.py x: 2 commits (90d) y: 535 lines of code sourcecode/scoring/helpfulness_scores.py x: 3 commits (90d) y: 173 lines of code sourcecode/scoring/pandas_utils.py x: 3 commits (90d) y: 481 lines of code sourcecode/scoring/runner.py x: 3 commits (90d) y: 248 lines of code sourcecode/scoring/tag_consensus.py x: 3 commits (90d) y: 117 lines of code
1579.0
lines of code
  min: 25.0
  average: 437.91
  25th percentile: 83.0
  median: 248.0
  75th percentile: 598.0
  max: 1579.0
0 18.0
commits (90d)
min: 2.0 | average: 7.43 | 25th percentile: 3.0 | median: 6.0 | 75th percentile: 10.0 | max: 18.0

File Size vs. Contributors (90 days): 23 points

sourcecode/scoring/mf_base_scorer.py x: 3 contributors (90d) y: 973 lines of code sourcecode/scoring/mf_group_scorer.py x: 3 contributors (90d) y: 184 lines of code sourcecode/scoring/run_scoring.py x: 3 contributors (90d) y: 1579 lines of code sourcecode/scoring/scorer.py x: 3 contributors (90d) y: 339 lines of code sourcecode/scoring/scoring_rules.py x: 3 contributors (90d) y: 880 lines of code sourcecode/scoring/constants.py x: 3 contributors (90d) y: 866 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 2 contributors (90d) y: 470 lines of code sourcecode/scoring/matrix_factorization/model.py x: 2 contributors (90d) y: 55 lines of code sourcecode/scoring/mf_core_scorer.py x: 3 contributors (90d) y: 69 lines of code sourcecode/scoring/mf_core_with_topics_scorer.py x: 3 contributors (90d) y: 83 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 2 contributors (90d) y: 77 lines of code sourcecode/scoring/mf_expansion_scorer.py x: 2 contributors (90d) y: 83 lines of code sourcecode/scoring/note_ratings.py x: 3 contributors (90d) y: 598 lines of code sourcecode/scoring/note_status_history.py x: 3 contributors (90d) y: 213 lines of code sourcecode/scoring/pflip_plus_model.py x: 2 contributors (90d) y: 1225 lines of code sourcecode/scoring/enums.py x: 3 contributors (90d) y: 25 lines of code sourcecode/scoring/process_data.py x: 3 contributors (90d) y: 582 lines of code sourcecode/scoring/pflip_model.py x: 1 contributors (90d) y: 535 lines of code sourcecode/scoring/helpfulness_scores.py x: 2 contributors (90d) y: 173 lines of code sourcecode/scoring/pandas_utils.py x: 2 contributors (90d) y: 481 lines of code sourcecode/scoring/runner.py x: 2 contributors (90d) y: 248 lines of code sourcecode/scoring/tag_consensus.py x: 2 contributors (90d) y: 117 lines of code
1579.0
lines of code
  min: 25.0
  average: 437.91
  25th percentile: 83.0
  median: 248.0
  75th percentile: 598.0
  max: 1579.0
0 3.0
contributors (90d)
min: 1.0 | average: 2.52 | 25th percentile: 2.0 | median: 3.0 | 75th percentile: 3.0 | max: 3.0