aws / amazon-sagemaker-examples
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 1,501 files with 188,951 lines of code.
    • 25 very long files (33,329 lines of code)
    • 15 long files (9,789 lines of code)
    • 193 medium size files (60,526 lines of codeclsfd_ftr_w_mp_ins)
    • 402 small files (52,672 lines of code)
    • 866 very small files (32,635 lines of code)
17% | 5% | 32% | 27% | 17%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py16% | 3% | 34% | 27% | 17%
yml62% | 21% | 7% | 0% | 7%
org90% | 0% | 0% | 9% | 0%
yaml0% | 22% | 0% | 67% | 10%
jsonl0% | 100% | 0% | 0% | 0%
cfg0% | 0% | 0% | 76% | 23%
html0% | 0% | 0% | 0% | 100%
R0% | 0% | 0% | 0% | 100%
Rmd0% | 0% | 0% | 0% | 100%
java0% | 0% | 0% | 0% | 100%
r0% | 0% | 0% | 0% | 100%
c0% | 0% | 0% | 0% | 100%
Dockerfile0% | 0% | 0% | 0% | 100%
jq0% | 0% | 0% | 0% | 100%
js0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
reinforcement_learning21% | 2% | 35% | 28% | 12%
ground_truth_labeling_jobs30% | 18% | 15% | 15% | 20%
training19% | 42% | 13% | 11% | 13%
sagemaker-python-sdk0% | 20% | 29% | 29% | 20%
sagemaker_processing0% | 76% | 0% | 9% | 14%
sagemaker_neo_compilation_jobs0% | 26% | 17% | 29% | 26%
advanced_functionality0% | 9% | 21% | 41% | 26%
sagemaker-training-compiler0% | 0% | 84% | 0% | 15%
aws_sagemaker_studio0% | 0% | 28% | 32% | 38%
introduction_to_amazon_algorithms0% | 0% | 49% | 34% | 16%
sagemaker-debugger0% | 0% | 15% | 36% | 47%
sagemaker_model_monitor0% | 0% | 69% | 27% | 3%
sagemaker-experiments0% | 0% | 57% | 0% | 42%
sagemaker_batch_transform0% | 0% | 28% | 18% | 53%
step-functions-data-science-sdk0% | 0% | 81% | 0% | 18%
use-cases0% | 0% | 25% | 22% | 52%
hyperparameter_tuning0% | 0% | 8% | 61% | 29%
sagemaker-pipelines0% | 0% | 23% | 22% | 53%
end_to_end0% | 0% | 0% | 52% | 47%
frameworks0% | 0% | 0% | 64% | 35%
contrib0% | 0% | 0% | 38% | 61%
introduction_to_applying_machine_learning0% | 0% | 0% | 100% | 0%
patterns0% | 0% | 0% | 60% | 39%
scientific_details_of_algorithms0% | 0% | 0% | 76% | 23%
aws_marketplace0% | 0% | 0% | 0% | 100%
sagemaker-script-mode0% | 0% | 0% | 0% | 100%
r_examples0% | 0% | 0% | 0% | 100%
sagemaker-jumpstart0% | 0% | 0% | 0% | 100%
sagemaker-clarify0% | 0% | 0% | 0% | 100%
sagemaker-fundamentals0% | 0% | 0% | 0% | 100%
prep_data0% | 0% | 0% | 0% | 100%
sagemaker-pipeline-parameterization0% | 0% | 0% | 0% | 100%
autopilot0% | 0% | 0% | 0% | 100%
sagemaker-pipeline-compare-model-versions0% | 0% | 0% | 0% | 100%
sagemaker_edge_manager0% | 0% | 0% | 0% | 100%
sagemaker-triton0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
sagemaker-inference-recommender0% | 0% | 0% | 0% | 100%
sagemaker-lineage0% | 0% | 0% | 0% | 100%
utils0% | 0% | 0% | 0% | 100%
_static0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
experiment_manager.py
in reinforcement_learning/bandits_statlog_vw_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_cartpole_coach/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_network_compression_ray_custom/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_hvac_coach_energyplus/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_portfolio_management_coach_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_cartpole_ray/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_game_server_autopilot/sagemaker/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_mountain_car_coach_gymEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_traveling_salesman_vehicle_routing_coach/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_managed_spot_cartpole_coach/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_stock_trading_coach_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_hvac_ray_energyplus/source/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_resource_allocation_ray_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_roboschool_ray/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_knapsack_coach_custom/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_predictive_autoscaling_coach_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_unity_ray/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_cartpole_batch_coach/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_roboschool_stable_baselines/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
main-packaged.yml
in ground_truth_labeling_jobs/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/deploy
1145 -
org
1ZoneDataCenterCRAC_wPumpedDXCoolingCoil.idf.org
in reinforcement_learning/rl_hvac_coach_energyplus/src/eplus/envs/buildings/1ZoneDataCenter
1099 -
main-merged.yml
in ground_truth_labeling_jobs/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/cloudformation
1030 -
train_gpt_simple.py
in training/distributed_training/pytorch/model_parallel/gpt2
1012 15
jsonl
data.jsonl
in sagemaker_processing/spark_distributed_data_processing/data
1000 -
sagemaker_smp_pretrain.py
in training/distributed_training/pytorch/model_parallel/bert/bert_example
883 20
workflow.yml
in ground_truth_labeling_jobs/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/cloudformation
756 -
rollout_agent_ctrl.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/src/markov/agent_ctrl
745 31
modeling.py
in training/distributed_training/pytorch/model_parallel/bert/bert_example
721 71
multi_agent_graph_manager.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/src/markov/multi_agent_coach
649 43
train_mask_rcnn.py
in sagemaker-python-sdk/mxnet_horovod_maskrcnn/source
645 9
virtual_event_manager.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/src/markov/virtual_event
591 20
template.yaml
in ground_truth_labeling_jobs/bring_your_own_model_for_sagemaker_labeling_workflows_with_active_learning/src
575 -
fp16.py
in training/distributed_training/pytorch/model_parallel/gpt2/fp16
560 40
train_faster_rcnn.py
in sagemaker-python-sdk/mxnet_horovod_fasterrcnn/source
548 8
train_yolo.py
in sagemaker_neo_compilation_jobs/gluoncv_yolo
543 8
cfn-sm.yaml
in advanced_functionality/distributed_tensorflow_mask_rcnn
538 -
rollout_worker.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/src/markov
534 4
resnet.py
in reinforcement_learning/rl_network_compression_ray_custom/src/tensorflow_resnet/compressor
501 14
run_mlm.py
in sagemaker-training-compiler/huggingface/pytorch_multiple_gpu_multiple_node/scripts
464 4
run_mlm.py
in sagemaker-training-compiler/huggingface/pytorch_multiple_gpu_single_node/scripts
459 4
evaluation_worker.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/src/markov
456 2
s3_metrics.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/src/markov/metrics
456 21
run_clm.py
in sagemaker-training-compiler/huggingface/pytorch_multiple_gpu_multiple_node/scripts
444 4
knapsack_baseline.py
in reinforcement_learning/rl_knapsack_coach_custom/src
442 10
data.py
in sagemaker-python-sdk/dgl_gcmc
438 15
run_clm.py
in sagemaker-training-compiler/huggingface/pytorch_multiple_gpu_single_node/scripts
434 4
join_manager.py
in reinforcement_learning/bandits_statlog_vw_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
419 20
join_manager.py
in reinforcement_learning/rl_cartpole_coach/common/sagemaker_rl/orchestrator/workflow/manager
419 20
Files With Most Units (Top 20)
File# lines# units
modeling.py
in training/distributed_training/pytorch/model_parallel/bert/bert_example
721 71
multi_agent_graph_manager.py
in reinforcement_learning/rl_deepracer_robomaker_coach_gazebo/src/markov/multi_agent_coach
649 43
experiment_manager.py
in reinforcement_learning/bandits_statlog_vw_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_cartpole_coach/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_network_compression_ray_custom/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_hvac_coach_energyplus/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_portfolio_management_coach_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_cartpole_ray/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_game_server_autopilot/sagemaker/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_mountain_car_coach_gymEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_traveling_salesman_vehicle_routing_coach/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_managed_spot_cartpole_coach/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_stock_trading_coach_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_hvac_ray_energyplus/source/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_resource_allocation_ray_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_roboschool_ray/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_knapsack_coach_custom/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_predictive_autoscaling_coach_customEnv/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
experiment_manager.py
in reinforcement_learning/rl_unity_ray/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
Files With Long Lines (Top 20)

There are 107 files with lines longer than 120 characters. In total, there are 500 long lines.

File# lines# units# long lines
label_arn.py
in ground_truth_labeling_jobs/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/src/lambda_src/shared
326 5 64
model_package_arns.py
in aws_marketplace/using_model_packages/improving_industrial_workplace_safety/src
73 4 52
model_package_arns.py
in aws_marketplace/using_model_packages/evaluating_aws_marketplace_models_for_person_counting_use_case/src
43 2 32
model_package_arns.py
in aws_marketplace/using_model_packages/auto_insurance/src
37 2 26
main-packaged.yml
in ground_truth_labeling_jobs/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/deploy
1145 - 23
model_package_arns.py
in aws_marketplace/using_model_packages/data_quality_monitoring/src
22 1 16
model_package_arns.py
in aws_marketplace/using_model_packages/creative-writing-using-gpt-2-text-generation/src
22 1 16
scikit_product_arns.py
in aws_marketplace/using_model_packages/amazon_demo_product/src
19 1 13
algorithm_arns.py
in aws_marketplace/using_algorithms/automl/src
19 1 13
scikit_product_arns.py
in aws_marketplace/using_algorithms/amazon_demo_product/src
19 1 13
train_gpt_simple.py
in training/distributed_training/pytorch/model_parallel/gpt2
1012 15 8
fp16.py
in training/distributed_training/pytorch/model_parallel/gpt2/fp16
560 40 7
main-merged.yml
in ground_truth_labeling_jobs/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/cloudformation
1030 - 5
create_datasets.py
in end_to_end/music_recommendation/code
112 1 5
demo_helpers.py
in end_to_end/music_recommendation/code
168 4 5
inference_specification.py
in end_to_end/music_recommendation/code
21 3 5
reporting.yml
in ground_truth_labeling_jobs/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/cloudformation
255 - 4
Rmd
breast_cancer_eda.Rmd
in r_examples/rsconnect_rmarkdown
67 - 4
template.yaml
in ground_truth_labeling_jobs/bring_your_own_model_for_sagemaker_labeling_workflows_with_active_learning/src
575 - 3
cloudwatch_logger.py
in reinforcement_learning/bandits_statlog_vw_customEnv/common/sagemaker_rl/orchestrator/utils
228 8 3