facebookresearch / torcharrow
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 44 files with 10,777 lines of code.
    • 2 very long files (3,631 lines of code)
    • 5 long files (3,693 lines of code)
    • 5 medium size files (1,516 lines of codeclsfd_ftr_w_mp_ins)
    • 7 small files (1,032 lines of code)
    • 25 very small files (905 lines of code)
33% | 34% | 14% | 9% | 8%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
pyi100% | 0% | 0% | 0% | 0%
py26% | 35% | 11% | 15% | 11%
cpp0% | 57% | 36% | 0% | 6%
h0% | 63% | 29% | 0% | 6%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
csrc/velox50% | 35% | 11% | 0% | 2%
torcharrow/velox_rt53% | 22% | 8% | 10% | 4%
torcharrow0% | 53% | 16% | 14% | 15%
csrc/velox/functions0% | 0% | 85% | 0% | 14%
tutorial0% | 0% | 0% | 100% | 0%
ROOT0% | 0% | 0% | 100% | 0%
torcharrow/benchmark0% | 0% | 0% | 0% | 100%
tools/codegen0% | 0% | 0% | 0% | 100%
Longest Files (Top 44)
File# lines# units
pyi
_torcharrow.pyi
in csrc/velox
1904 1082
dataframe_cpu.py
in torcharrow/velox_rt
1727 123
icolumn.py
in torcharrow
946 119
numerical_column_cpu.py
in torcharrow/velox_rt
739 72
lib.cpp
in csrc/velox
685 8
column.h
in csrc/velox
680 60
dtypes.py
in torcharrow
643 77
column.cpp
in csrc/velox
434 22
string_functions.h
in csrc/velox/functions
317 15
scope.py
in torcharrow
280 17
string_column_cpu.py
in torcharrow/velox_rt
277 52
_pytorch.py
in torcharrow
208 10
idataframe.py
in torcharrow
193 40
list_column_cpu.py
in torcharrow/velox_rt
180 17
map_column_cpu.py
in torcharrow/velox_rt
159 18
tutorial.py
in tutorial
130 10
expression.py
in torcharrow
127 17
setup.py
in root
124 5
_interop.py
in torcharrow
119 10
istring_column.py
in torcharrow
86 26
trace.py
in torcharrow
76 11
ilist_column.py
in torcharrow
70 11
vector.cpp
in csrc/velox
69 2
benchmark_list_construction.py
in torcharrow/benchmark
61 12
functional.py
in torcharrow
55 6
functional.py
in torcharrow/velox_rt
55 4
interop_arrow.py
in torcharrow
52 2
typing.py
in torcharrow/velox_rt
48 1
benchmark_vmap.py
in torcharrow/benchmark
45 6
functions.h
in csrc/velox/functions
38 1
velox_binding_stubgen.py
in tools/codegen
38 1
column.py
in torcharrow/velox_rt
29 3
__init__.py
in torcharrow
28 -
interop.py
in torcharrow
25 3
imap_column.py
in torcharrow
24 5
inumerical_column.py
in torcharrow
20 3
vector.h
in csrc/velox
17 -
dispatcher.py
in torcharrow
16 2
pytorch.py
in torcharrow
12 1
numeric_functions.h
in csrc/velox/functions
11 1
__init__.py
in torcharrow/velox_rt
11 -
register_bindings.cpp
in csrc/velox
9 2
bindings.h
in csrc/velox
7 -
register_udf.cpp
in csrc/velox/functions
3 1
Files With Most Units (Top 20)
File# lines# units
pyi
_torcharrow.pyi
in csrc/velox
1904 1082
dataframe_cpu.py
in torcharrow/velox_rt
1727 123
icolumn.py
in torcharrow
946 119
dtypes.py
in torcharrow
643 77
numerical_column_cpu.py
in torcharrow/velox_rt
739 72
column.h
in csrc/velox
680 60
string_column_cpu.py
in torcharrow/velox_rt
277 52
idataframe.py
in torcharrow
193 40
istring_column.py
in torcharrow
86 26
column.cpp
in csrc/velox
434 22
map_column_cpu.py
in torcharrow/velox_rt
159 18
expression.py
in torcharrow
127 17
scope.py
in torcharrow
280 17
list_column_cpu.py
in torcharrow/velox_rt
180 17
string_functions.h
in csrc/velox/functions
317 15
benchmark_list_construction.py
in torcharrow/benchmark
61 12
trace.py
in torcharrow
76 11
ilist_column.py
in torcharrow
70 11
tutorial.py
in tutorial
130 10
_pytorch.py
in torcharrow
208 10
Files With Long Lines (Top 5)

There are 5 files with lines longer than 120 characters. In total, there are 5 long lines.

File# lines# units# long lines
trace.py
in torcharrow
76 11 1
dtypes.py
in torcharrow
643 77 1
icolumn.py
in torcharrow
946 119 1
scope.py
in torcharrow
280 17 1
dataframe_cpu.py
in torcharrow/velox_rt
1727 123 1