fbgemm_gpu/codegen/embedding_forward_quantized_cpu_template.cpp x: 31 commits (all time) y: 359 lines of code

pytorch / FBGEMM

File Size

The distribution of size of files (measured in lines of code).

Intro

File size measurements show the distribution of size of files.
Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.

Learn more...

File Size Overall

There are 234 files with 70,952 lines of code.

14 very long files (24,815 lines of code)
18 long files (13,278 lines of code)
70 medium size files (22,274 lines of codeclsfd_ftr_w_mp_ins)
47 small files (6,957 lines of code)
85 very small files (3,628 lines of code)

Legend:

1001+

501-1000

201-500

101-200

1-100

explore: zoomable circles | sunburst | 3D view

File Size per Extension

1001+

501-1000

201-500

101-200

1-100

File Size per Logical Decomposition

primary

1001+

501-1000

201-500

101-200

1-100

Longest Files (Top 50)

File	# lines	# units
FbgemmFP16UKernelsAvx512.cc in src	3130	14
FbgemmFP16UKernelsAvx512_256.cc in src	2247	8
split_table_batched_embeddings_ops.py in fbgemm_gpu/fbgemm_gpu	1963	48
sparse_ops.cu in fbgemm_gpu/src	1953	-
fbgemm_cuda_utils.cuh in fbgemm_gpu/include/fbgemm_gpu	1948	-
split_table_batched_embeddings_benchmark.py in fbgemm_gpu/bench	1916	21
split_embeddings_cache_cuda.cu in fbgemm_gpu/src	1803	-
QuantUtilsAvx2.cc in src	1761	15
RefImplementations.cc in src	1755	31
sparse_ops_cpu.cpp in fbgemm_gpu/src	1683	33
EmbeddingSpMDM.cc in src	1251	1
EmbeddingSpMDMNBit.cc in src	1170	1
jagged_tensor_ops.cu in fbgemm_gpu/src	1150	-
embedding_backward_split_template.cu in fbgemm_gpu/codegen	1085	-
FbgemmI8Depthwise3DAvx2.cc in src	989	5
jagged_tensor_ops_cpu.cpp in fbgemm_gpu/src	967	28
GroupwiseConv.cc in src	938	12
embedding_forward_quantized_split_template.cu in fbgemm_gpu/codegen	917	-
FbgemmFP16UKernelsAvx2.cc in src	886	6
UtilsAvx512.cc in src	845	9
SparseAdagrad.cc in src	842	4
RowWiseSparseAdagradFused.cc in src	823	1
Fbgemm.h in include/fbgemm	804	46
QuantUtils.cc in src	706	24
quantize_ops.cu in fbgemm_gpu/src	666	-
FbgemmI8Depthwise2DAvx2-inl.h in src	656	4
PackAWithIm2Col.cc in src	645	4
embedding_backward_code_generator.py in fbgemm_gpu/codegen	531	34
embedding_forward_split_cpu.cpp in fbgemm_gpu/codegen	526	5
EmbeddingSpMDMAvx512.cc in src	521	29
Fbgemm.cc in src	510	3
FbgemmSparseDenseInt8Avx512.cc in src	506	6
GenerateKernelDirectConvU8S8S32ACC32.cc in src	491	2
FbgemmI8DepthwiseAvx2-inl.h in src	487	2
ExecuteKernelU8S8.cc in src	486	-
GenerateI8Depthwise.cc in src	483	2
embedding_backward_split_host_template.cpp in fbgemm_gpu/codegen	467	2
codegen_fp16fp32.cc in src	461	4
embedding_forward_split_template.cu in fbgemm_gpu/codegen	450	-
ConvUnifiedBenchmark.cc in bench	449	2
split_embeddings_cache_benchmark.py in fbgemm_gpu/bench	448	15
merge_embeddings_benchmark.py in fbgemm_gpu/bench	440	10
FbgemmConv.cc in src	434	6
BenchUtils.h in bench	415	6
GroupwiseConvRequantizeBenchmark.cc in bench	413	2
PackWeightsForDirectConv.cc in src	412	3
FbgemmI64.cc in src	409	3
Utils.cc in src	397	19
histogram_binning_calibration_ops.cu in fbgemm_gpu/src	388	-
embedding_forward_quantized_host_cpu.cpp in fbgemm_gpu/codegen	380	19

Files With Most Units (Top 50)

File	# lines	# units
split_table_batched_embeddings_ops.py in fbgemm_gpu/fbgemm_gpu	1963	48
Fbgemm.h in include/fbgemm	804	46
PackingTraits-inl.h in include/fbgemm	298	35
embedding_backward_code_generator.py in fbgemm_gpu/codegen	531	34
sparse_ops_cpu.cpp in fbgemm_gpu/src	1683	33
RefImplementations.cc in src	1755	31
EmbeddingSpMDMAvx512.cc in src	521	29
jagged_tensor_ops_cpu.cpp in fbgemm_gpu/src	967	28
QuantUtils.cc in src	706	24
split_table_batched_embeddings_benchmark.py in fbgemm_gpu/bench	1916	21
embedding_forward_quantized_host_cpu.cpp in fbgemm_gpu/codegen	380	19
Utils.cc in src	397	19
FbgemmPackMatrixB.h in include/fbgemm	200	18
split_embeddings_cache_benchmark.py in fbgemm_gpu/bench	448	15
sparse_ops_utils.h in fbgemm_gpu/include/fbgemm_gpu	214	15
QuantUtilsAvx2.cc in src	1761	15
CodeGenHelpers.h in src	167	15
FbgemmFP16UKernelsIntrinsicAvx512.cc in src	111	15
quantize_ops_cpu.cpp in fbgemm_gpu/src	303	14
FbgemmFP16UKernelsAvx512.cc in src	3130	14
BenchUtils.cc in bench	182	13
GroupwiseConv.cc in src	938	12
merge_embeddings_benchmark.py in fbgemm_gpu/bench	440	10
QuantUtils.h in include/fbgemm	224	10
FbgemmFP16UKernelsIntrinsicAvx512_256.cc in src	92	9
UtilsAvx512.cc in src	845	9
histogram_binning_calibration_benchmark.py in fbgemm_gpu/bench	242	8
FbgemmFP16UKernelsAvx512_256.cc in src	2247	8
split_embedding_configs.py in fbgemm_gpu/fbgemm_gpu	88	7
merge_pooled_embeddings_gpu.cpp in fbgemm_gpu/src	341	7
TransposeUtilsAvx2.h in src	343	7
PackBMatrix.cc in src	264	7
FbgemmFP16UKernelsIntrinsicAvx2.cc in src	86	7
BenchUtils.h in bench	415	6
split_embedding_inference_converter.py in fbgemm_gpu/fbgemm_gpu	145	6
embedding_forward_quantized_cpu_template.cpp in fbgemm_gpu/codegen	359	6
setup.py in fbgemm_gpu	144	6
sparse_ops_gpu.cpp in fbgemm_gpu/src	157	6
FbgemmSparseDenseInt8Avx512.cc in src	506	6
FbgemmFP16UKernelsAvx2.cc in src	886	6
PackWeightMatrixForGConv.cc in src	189	6
FbgemmConv.cc in src	434	6
FbgemmSparseDense.cc in src	253	6
batched_unary_embeddings_ops.py in fbgemm_gpu/fbgemm_gpu	61	5
embedding_backward_dense_host.cpp in fbgemm_gpu/codegen	363	5
embedding_forward_split_cpu.cpp in fbgemm_gpu/codegen	526	5
input_combine_cpu.cpp in fbgemm_gpu/src	278	5
permute_pooled_embedding_ops_gpu.cpp in fbgemm_gpu/src	132	5
FbgemmI8Depthwise3DAvx2.cc in src	989	5
FbgemmBfloat16ConvertAvx2.cc in src	42	5

Files With Long Lines (Top 23)

There are 23 files with lines longer than 120 characters. In total, there are 121 long lines.

File	# lines	# units	# long lines
embedding_forward_quantized_split_template.cu in fbgemm_gpu/codegen	917	-	22
embedding_backward_split_template.cu in fbgemm_gpu/codegen	1085	-	16
sparse_ops_cpu.cpp in fbgemm_gpu/src	1683	33	12
split_table_batched_embeddings_benchmark.py in fbgemm_gpu/bench	1916	21	10
merge_embeddings_benchmark.py in fbgemm_gpu/bench	440	10	7
embedding_forward_quantized_cpu_template.cpp in fbgemm_gpu/codegen	359	6	7
embedding_backward_split_host_template.cpp in fbgemm_gpu/codegen	467	2	5
split_table_batched_embeddings.cpp in fbgemm_gpu/src	101	-	5
jagged_tensor_ops_cpu.cpp in fbgemm_gpu/src	967	28	5
quantize_ops_benchmark.py in fbgemm_gpu/bench	186	3	4
embedding_backward_split_indice_weights_template.cu in fbgemm_gpu/codegen	274	-	4
embedding_forward_split_template.cu in fbgemm_gpu/codegen	450	-	4
split_table_batched_embeddings_ops.py in fbgemm_gpu/fbgemm_gpu	1963	48	3
embedding_backward_split_host_cpu_template.cpp in fbgemm_gpu/codegen	200	2	3
embedding_forward_quantized_host_cpu.cpp in fbgemm_gpu/codegen	380	19	3
input_combine_cpu.cpp in fbgemm_gpu/src	278	5	3
permute_pooled_embedding_ops_gpu.cpp in fbgemm_gpu/src	132	5	2
split_embeddings_cache_benchmark.py in fbgemm_gpu/bench	448	15	1
embedding_bounds_check.cu in fbgemm_gpu/codegen	138	-	1
embedding_bounds_check_host_cpu.cpp in fbgemm_gpu/codegen	98	1	1
embedding_backward_dense_host_cpu.cpp in fbgemm_gpu/codegen	164	3	1
layout_transform_ops_cpu.cpp in fbgemm_gpu/src	63	1	1
merge_pooled_embeddings_gpu.cpp in fbgemm_gpu/src	341	7	1

Correlations

File Size vs. Commits (all time): 233 points

		3130.0	lines of code min: 1.0 average: 303.43 25th percentile: 54.5 median: 162.0 75th percentile: 313.0 max: 3130.0
0	93.0
commits (all time) min: 1.0 \| average: 15.36 \| 25th percentile: 6.0 \| median: 11.0 \| 75th percentile: 20.0 \| max: 93.0

File Size vs. Contributors (all time): 233 points

		3130.0	lines of code min: 1.0 average: 303.43 25th percentile: 54.5 median: 162.0 75th percentile: 313.0 max: 3130.0
0	28.0
contributors (all time) min: 1.0 \| average: 4.97 \| 25th percentile: 3.0 \| median: 4.0 \| 75th percentile: 7.0 \| max: 28.0

File Size vs. Commits (30 days): 61 points

		1963.0	lines of code min: 1.0 average: 415.98 25th percentile: 54.5 median: 200.0 75th percentile: 445.0 max: 1963.0
0	16.0
commits (30d) min: 1.0 \| average: 2.38 \| 25th percentile: 1.0 \| median: 1.0 \| 75th percentile: 2.0 \| max: 16.0

File Size vs. Contributors (30 days): 61 points

		1963.0	lines of code min: 1.0 average: 415.98 25th percentile: 54.5 median: 200.0 75th percentile: 445.0 max: 1963.0
0	4.0
contributors (30d) min: 1.0 \| average: 1.61 \| 25th percentile: 1.0 \| median: 1.0 \| 75th percentile: 2.0 \| max: 4.0

File Size vs. Commits (90 days): 231 points

		3130.0	lines of code min: 1.0 average: 305.65 25th percentile: 55.0 median: 164.0 75th percentile: 317.0 max: 3130.0
0	21.0
commits (90d) min: 1.0 \| average: 3.69 \| 25th percentile: 2.0 \| median: 3.0 \| 75th percentile: 4.0 \| max: 21.0

File Size vs. Contributors (90 days): 231 points

		3130.0	lines of code min: 1.0 average: 305.65 25th percentile: 55.0 median: 164.0 75th percentile: 317.0 max: 3130.0
0	8.0
contributors (90d) min: 1.0 \| average: 1.9 \| 25th percentile: 1.0 \| median: 2.0 \| 75th percentile: 2.0 \| max: 8.0