aws / amazon-sagemaker-examples
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
59% | 25% | 8% | 3% | 3%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
ipynb66% | 27% | 5% | <1% | 0%
py3% | 10% | 26% | 28% | 31%
org90% | 0% | 0% | 9% | 0%
jsonl0% | 100% | 0% | 0% | 0%
yaml0% | 72% | 0% | 9% | 17%
js0% | 100% | 0% | 0% | 0%
java0% | 0% | 0% | 28% | 71%
cfg0% | 0% | 0% | 66% | 33%
html0% | 0% | 0% | 0% | 100%
toml0% | 0% | 0% | 0% | 100%
css0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
proto0% | 0% | 0% | 0% | 100%
c0% | 0% | 0% | 0% | 100%
jq0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
ROOT59% | 25% | 8% | 3% | 3%
_static0% | 92% | 0% | 0% | 7%
_templates0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
4205 -
xgboost_customer_churn_studio.ipynb
in archived/getting_started
3904 -
3701 -
sm-pipelines_preprocess_train_evaluate_batch_transform.ipynb
in ml_ops/sm-pipelines_preprocess_train_evaluate_batch_transform
3521 -
huggingface_sentiment_outputs.ipynb
in archived/huggingface_sentiment_classification
3518 -
deep_demand_forecast.ipynb
in archived/deep_demand_forecasting
3203 -
sm-finetuning_huggingface_with_your_own_scripts_and_data.ipynb
in generative_ai/sm-finetuning_huggingface_with_your_own_scripts_and_data
3070 -
sagemaker-huggingface-tgi-hosting-examples.ipynb
in archived/sagemaker-huggingface-tgi-hosting-examples
2888 -
huggingface-large-model-inference-santacoder.ipynb
in archived/huggingface-large-model-inference-santacoder
2573 -
sm-model_monitor_introduction.ipynb
in deploy_and_monitor/sm-model_monitor_introduction
2443 -
1_object_detection_preprocessing.ipynb
in archived/object_detection_with_tensorflow_and_tfrecords
2365 -
sentiment-analysis-tf-distributed-training-bringyourownscript.ipynb
in archived/sagemaker-debugger/tensorflow_nlp_sentiment_analysis
2267 -
churn_prediction_multimodality_of_text_and_tabular.ipynb
in archived/churn_prediction_multimodality_of_text_and_tabular
2217 -
sm-hyperparameter_tuning_pytorch.ipynb
in build_and_train_models/sm-hyperparameter_tuning_pytorch
2195 -
tensorflow2-california-housing-sagemaker-pipelines-deploy-endpoint.ipynb
in archived/tensorflow2-california-housing-sagemaker-pipelines-deploy-endpoint
2179 -
sm-pipelines_selective_execution.ipynb
in ml_ops/sm-pipelines_selective_execution
2177 -
credit_risk_explainability_inference_pipelines.ipynb
in archived/clarify-explainability-inference-pipelines
2122 -
hpo_huggingface_text_classification_20_newsgroups.ipynb
in archived/huggingface_multiclass_text_classification_20_newsgroups
2120 -
sm-ground_truth_object_detection_example.ipynb
in prepare_data/sm-ground_truth_object_detection_example
2114 -
sm-pipelines_callback_step.ipynb
in ml_ops/sm-pipelines_callback_step
2081 -
sm-pipelines_lambda_step.ipynb
in ml_ops/sm-pipelines_lambda_step
2047 -
2028 -
2025 -
sm-object_detection_birds.ipynb
in build_and_train_models/sm-object_detection_birds
1993 -
1991 -
sm-serverless_inference_huggingface_text_classification.ipynb
in deploy_and_monitor/sm-serverless_inference_huggingface_text_classification
1982 -
fraud_detection_using_deep_graph_neural_networks.ipynb
in archived/fraud_detection_using_graph_neural_networks
1971 -
huggingface_sentiment_parallel_batch.ipynb
in archived/sentiment_parallel_batch
1967 -
1964 -
1_cust_churn_dataprep.ipynb
in archived/customer_churn
1939 -
document_question_answering.ipynb
in archived/identify_key_insights_from_textual_document
1930 -
1924 -
sm-model_monitor_bias_and_explainability_monitoring.ipynb
in deploy_and_monitor/sm-model_monitor_bias_and_explainability_monitoring
1903 -
linear_learner_multi_model_endpoint_inf_pipeline.ipynb
in archived/multi_model_linear_learner_home_value
1902 -
document_entity_recognition.ipynb
in archived/identify_key_insights_from_textual_document
1896 -
pytorch_script_change_smdebug.ipynb
in archived/sagemaker-debugger/pytorch_model_debugging
1878 -
geospatial_pipeline_processing.ipynb
in archived/geospatial/geospatial_pipeline_processing
1874 -
1863 -
end_to_end_pipeline.ipynb
in archived/end_to_end_music_recommendation
1832 -
1826 -
sm-clarify_object_detection.ipynb
in responsible_ai/sm-clarify_object_detection
1824 -
sm-introduction_to_object2vec_sentence_similarity.ipynb
in build_and_train_models/sm-introduction_to_object2vec_sentence_similarity
1813 -
1801 -
1783 -
1761 -
huggingface-inference-recommender.ipynb
in archived/huggingface-inference-recommender
1749 -
sm-jumpstart_private_model_hub_import_llama3-8B.ipynb
in build_and_train_models/sm-jumpstart_private_model_hub_import
1740 -
1718 -
albert-base-v2.ipynb
in archived/albert-base-v2
1717 -
autopilot-models-serverless-inference.ipynb
in archived/autopilot-serverless-inference
1694 -
Files With Most Units (Top 50)
File# lines# units
experiment_manager.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator/workflow/manager
1383 41
modelling_RW.py
in archived/falcon
784 40
512 30
tokenizers.py
in archived/identify_key_insights_from_textual_document/containers/relationship_extraction/package/data
229 29
vit.py
in archived/single_gpu_single_node/scripts
323 26
vit.py
in archived/tensorflow_single_gpu_single_node/scripts
323 26
vit.py
in archived/vision-transformer/scripts
325 26
resource_manager.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator
415 25
checkpoints.py
in build_and_train_models/sm-distributed_model_parallel_v2/shared-scripts
684 24
model_db_client.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator/clients/ddb
140 24
create_vocab_proto.py
in archived/seq2seq_translation_en-de
361 23
train_bert.py
in archived/Text_Classification_BERT/scripts
207 22
entry_point.py
in archived/churn_prediction_multimodality_of_text_and_tabular/containers/huggingface_transformer_randomforest
401 21
join_manager.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator/workflow/manager
419 20
db.py
in archived/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/src/lambda_src/shared
297 20
import-sagemaker-domain.py
in ml_ops/sm-datazone_import
533 20
experiment_db_client.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator/clients/ddb
118 19
coach_launcher.py
in archived/rl_gamerserver_ray/common/sagemaker_rl
230 18
utils.py
in archived/visualization
290 18
ground_truth_od.py
in prepare_data/sm-ground_truth_object_detection_example
191 18
mpi_launcher.py
in archived/rl_gamerserver_ray/common/sagemaker_rl
168 17
ray_launcher.py
in archived/rl_gamerserver_ray/common/sagemaker_rl
311 17
objects.py
in archived/identify_key_insights_from_textual_document/containers/relationship_extraction/package
115 17
main.py
in archived/multi_modal_parallel_sagemaker_labeling_workflows_with_step_functions/src/lambda_src/api_batch_create
192 16
environment.py
in archived/keras_bring_your_own/trainer
143 16
env_utils.py
in archived/rl_gamerserver_ray/common
152 15
model_record.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator/workflow/datatypes
121 15
join_db_client.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator/clients/ddb
102 15
ag_utils.py
in archived/autogluon-tabular/utils
199 14
vw_agent.py
in archived/bandits_recsys_movielens_testbed/src
234 14
mnist.py
in deploy_and_monitor/sm-batch_transform_pytorch/model-script
234 13
train.py
in archived/smp-train-gptj-sharded-data-parallel-tp
984 13
train.py
in archived/smp-train-gpt-neox-sharded-data-parallel
993 13
model_manager.py
in archived/rl_gamerserver_ray/common/sagemaker_rl/orchestrator/workflow/manager
340 13
train.py
in archived/smp-train-t5-sharded-data-parallel
1000 13
preprocessing_dataset.py
in archived/object_detection_with_tensorflow_and_tfrecords/preprocessing
227 13
train.py
in archived/smp-gpt-sharded-data-parallel
989 13
train.py
in archived/falcon
1004 13
trading_env.py
in archived/rl_stock_trading_coach_customEnv/src
137 13
train_data.py
in build_and_train_models/sm-heterogeneous_clusters_for_model_training/code
134 12
generate_data.py
in build_and_train_models/sm-introduction_to_ip_insights
152 12
VRP_abstract_env.py
in archived/rl_traveling_salesman_vehicle_routing_coach/src
285 12
VRP_env.py
in archived/rl_traveling_salesman_vehicle_routing_coach/src
269 12
sage_cluster_communicator.py
in archived/rl_gamerserver_ray/common/sagemaker_rl
120 12
docker_utils.py
in archived/rl_gamerserver_ray/common
112 12
runner.py
in archived/inference-benchmarking/benchmarking
371 12
env_setup.py
in archived/credit_card_fraud_detector
138 12
io_utils.py
in archived/bandits_statlog_vw_customEnv/src
112 12
mxnet.py
in archived/fraud_detection_using_graph_neural_networks/sagemaker_graph_fraud_detection/dgl_fraud_detection/model
142 12
pytorch.py
in archived/fraud_detection_using_graph_neural_networks/sagemaker_graph_fraud_detection/dgl_fraud_detection/model
135 12
Files With Long Lines (Top 50)

There are 692 files with lines longer than 120 characters. In total, there are 26745 long lines.

File# lines# units# long lines
4205 - 1297
3701 - 228
sm-pipelines_preprocess_train_evaluate_batch_transform.ipynb
in ml_ops/sm-pipelines_preprocess_train_evaluate_batch_transform
3521 - 158
sm-model_monitor_introduction.ipynb
in deploy_and_monitor/sm-model_monitor_introduction
2443 - 138
xgboost_customer_churn_studio.ipynb
in archived/getting_started
3904 - 129
1863 - 121
BERTtopic_extending_container.ipynb
in archived/pytorch_extend_container_train_deploy_bertopic
1395 - 119
sm-ground_truth_object_detection_example.ipynb
in prepare_data/sm-ground_truth_object_detection_example
2114 - 119
inference-pipeline.ipynb
in archived/inference_pipeline_custom_containers
1184 - 118
sm-marketplace_building_your_own_container_as_package.ipynb
in build_and_train_models/sm-marketplace_building_your_own_container_as_package
1436 - 115
1826 - 112
1404 - 100
sm-clarify_time_series_bring_your_own_model.ipynb
in responsible_ai/sm-clarify_time_series_bring_your_own_model
1465 - 97
1070 - 96
2025 - 95
SageMaker-Monitoring-Feature-Attribution-Drift-for-Endpoint.ipynb
in archived/fairness_and_explainability_jsonlines
1541 - 93
841 - 93
sm-introduction_to_ip_insights.ipynb
in build_and_train_models/sm-introduction_to_ip_insights
1148 - 92
huggingface_sentiment_outputs.ipynb
in archived/huggingface_sentiment_classification
3518 - 91
sm-pipelines_selective_execution.ipynb
in ml_ops/sm-pipelines_selective_execution
2177 - 89
2028 - 88
creative-writing-using-gpt-2-text-generation.ipynb
in archived/creative-writing-using-gpt-2-text-generation
1120 - 87
1783 - 87
sm-clarify_model_bias_monitor_batch_transform.ipynb
in deploy_and_monitor/sm-clarify_model_bias_monitor_batch_transform
1415 - 84
kohya-ss-fine-tuning.ipynb
in archived/text-to-image-fine-tuning
446 - 84
smp-train-gptj-sharded-data-parallel-tp.ipynb
in archived/smp-train-gptj-sharded-data-parallel-tp
1679 - 83
preprocessing-audio-data-using-a-machine-learning-model.ipynb
in archived/preprocessing-audio-data-using-a-machine-learning-model
965 - 83
evaluating_aws_marketplace_models_for_person_counting_use_case.ipynb
in archived/evaluating_aws_marketplace_models_for_person_counting_use_case
1593 - 83
deep_demand_forecast.ipynb
in archived/deep_demand_forecasting
3203 - 82
sm-ground_truth_video_quality_metrics.ipynb
in prepare_data/sm-ground_truth_video_quality_metrics
1468 - 82
sm-scikit_build_your_own_container.ipynb
in build_and_train_models/sm-scikit_build_your_own_container
718 - 79
rapids_sagemaker_hpo.ipynb
in archived/rapids_bring_your_own
1449 - 79
1531 - 79
LDA-Science.ipynb
in archived/scientific_details_of_algorithms/lda_topic_modeling
1154 - 78
Dashboard_SEC_Filings.ipynb
in archived/nlp_score_dashboard_sec
1131 - 78
sm-pipelines_lambda_step.ipynb
in ml_ops/sm-pipelines_lambda_step
2047 - 78
huggingface_sentiment_parallel_batch.ipynb
in archived/sentiment_parallel_batch
1967 - 77
sm-batch_transform_pca_dbscan_movie_clusters.ipynb
in deploy_and_monitor/sm-batch_transform_pca_dbscan_movie_clusters
1095 - 75
1_cust_churn_dataprep.ipynb
in archived/customer_churn
1939 - 75
OpenChat-streaming_tgi.ipynb
in archived/workshops
1159 - 75
859 - 75
1092 - 75
nlp_company_earnings_analysis_pipeline.ipynb
in archived/nlp_mlops_company_sentiment
1581 - 72
1029 - 72
gluon_recommender_system.ipynb
in archived/gluon_recommender_system
1046 - 72
sentiment-analysis-tf-distributed-training-bringyourownscript.ipynb
in archived/sagemaker-debugger/tensorflow_nlp_sentiment_analysis
2267 - 72
sm-jumpstart_private_model_hub_import_llama3-8B.ipynb
in build_and_train_models/sm-jumpstart_private_model_hub_import
1740 - 71
huggingface-inference-recommender.ipynb
in archived/huggingface-inference-recommender
1749 - 71
1098 - 71
tf-cloudwatch-inference-recommender.ipynb
in archived/tensorflow-cloudwatch
1635 - 70
Correlations

File Size vs. Commits (all time): 6 points

_static/kendrasearchtools.js x: 3 commits (all time) y: 512 lines of code _static/pagination.css x: 2 commits (all time) y: 14 lines of code _static/search_accessories.css x: 2 commits (all time) y: 27 lines of code _templates/search.html x: 2 commits (all time) y: 53 lines of code conf.py x: 12 commits (all time) y: 20 lines of code
512.0
lines of code
  min: 13.0
  average: 106.5
  25th percentile: 13.75
  median: 23.5
  75th percentile: 167.75
  max: 512.0
0 12.0
commits (all time)
min: 2.0 | average: 3.83 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 5.25 | max: 12.0

File Size vs. Contributors (all time): 6 points

_static/kendrasearchtools.js x: 2 contributors (all time) y: 512 lines of code _static/pagination.css x: 2 contributors (all time) y: 14 lines of code _static/search_accessories.css x: 2 contributors (all time) y: 27 lines of code _templates/search.html x: 2 contributors (all time) y: 53 lines of code conf.py x: 6 contributors (all time) y: 20 lines of code
512.0
lines of code
  min: 13.0
  average: 106.5
  25th percentile: 13.75
  median: 23.5
  75th percentile: 167.75
  max: 512.0
0 6.0
contributors (all time)
min: 2.0 | average: 2.67 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 3.0 | max: 6.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".