aws / aws-sdk-pandas
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
17% | 28% | 34% | 10% | 8%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
ipynb36% | 40% | 16% | 5% | <1%
py0% | 20% | 50% | 13% | 15%
pyi0% | 0% | 55% | 34% | 9%
toml0% | 0% | 0% | 100% | 0%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
tutorials36% | 40% | 16% | 5% | <1%
awswrangler0% | 19% | 50% | 14% | 15%
ROOT0% | 0% | 0% | 100% | 0%
Longest Files (Top 50)
File# lines# units
1766 -
1637 -
1309 -
1170 -
1127 -
1098 -
998 -
900 -
_read.py
in awswrangler/athena
848 18
840 -
_create.py
in awswrangler/catalog
765 17
_data_types.py
in awswrangler
737 33
711 -
_utils.py
in awswrangler/athena
693 23
691 -
679 -
607 -
_utils.py
in awswrangler
603 59
596 -
587 -
575 -
546 -
543 -
emr.py
in awswrangler
533 14
529 -
_fs.py
in awswrangler/s3
523 28
500 -
_config.py
in awswrangler
498 97
_read_parquet.py
in awswrangler/s3
489 10
_write_iceberg.py
in awswrangler/athena
487 9
_write_text.py
in awswrangler/s3
486 4
482 -
_write_parquet.py
in awswrangler/s3
481 10
463 -
_utils.py
in awswrangler/redshift
428 19
_read.py
in awswrangler/dynamodb
418 17
_get.py
in awswrangler/catalog
411 21
arrow_parquet_datasource.py
in awswrangler/distributed/ray/datasources
402 14
_write.py
in awswrangler/redshift
394 4
394 -
382 -
_write.py
in awswrangler/timestream
381 11
_write.py
in awswrangler/s3
380 12
_write_orc.py
in awswrangler/s3
378 9
372 -
postgresql.py
in awswrangler
363 17
oracle.py
in awswrangler
354 19
349 -
_definitions.py
in awswrangler/catalog
348 10
_read.py
in awswrangler/s3
336 17
Files With Most Units (Top 50)
File# lines# units
_config.py
in awswrangler
498 97
_utils.py
in awswrangler
603 59
_data_types.py
in awswrangler
737 33
_sql_formatter.py
in awswrangler
130 28
_fs.py
in awswrangler/s3
523 28
_get_list.py
in awswrangler/quicksight
229 24
_utils.py
in awswrangler/athena
693 23
_get.py
in awswrangler/catalog
411 21
rds.py
in awswrangler/data_api
319 20
oracle.py
in awswrangler
354 19
_utils.py
in awswrangler/redshift
428 19
_read.py
in awswrangler/athena
848 18
_client.py
in awswrangler/neptune
225 17
_databases.py
in awswrangler
326 17
_read.py
in awswrangler/dynamodb
418 17
_utils.py
in awswrangler/dynamodb
166 17
postgresql.py
in awswrangler
363 17
_read.py
in awswrangler/s3
336 17
_create.py
in awswrangler/catalog
765 17
_neptune.py
in awswrangler/neptune
296 16
pyi
_read.pyi
in awswrangler/athena
323 15
sqlserver.py
in awswrangler
304 15
mysql.py
in awswrangler
311 14
arrow_parquet_datasource.py
in awswrangler/distributed/ray/datasources
402 14
emr.py
in awswrangler
533 14
redshift.py
in awswrangler/data_api
165 14
_utils.py
in awswrangler/opensearch
257 14
_write.py
in awswrangler/opensearch
319 14
_distributed.py
in awswrangler
103 13
pyi
_read.pyi
in awswrangler/dynamodb
242 13
pyi
_list.pyi
in awswrangler/s3
111 13
pyi
_read_parquet.pyi
in awswrangler/s3
250 13
_connector.py
in awswrangler/data_api
99 12
_write.py
in awswrangler/s3
380 12
_cache.py
in awswrangler/athena
169 11
_write.py
in awswrangler/timestream
381 11
pyi
_read.pyi
in awswrangler/redshift
193 11
pyi
_read_text.pyi
in awswrangler/s3
199 10
_write_parquet.py
in awswrangler/s3
481 10
_read_parquet.py
in awswrangler/s3
489 10
_definitions.py
in awswrangler/catalog
348 10
_utils.py
in awswrangler/catalog
100 10
_utils.py
in awswrangler/distributed/ray/modin
93 9
_create.py
in awswrangler/quicksight
301 9
_delete.py
in awswrangler/quicksight
141 9
_write_iceberg.py
in awswrangler/athena
487 9
_read.py
in awswrangler/timestream
228 9
_write_orc.py
in awswrangler/s3
378 9
_write_dataset.py
in awswrangler/s3
231 9
_list.py
in awswrangler/s3
239 9
Files With Long Lines (Top 42)

There are 42 files with lines longer than 120 characters. In total, there are 191 long lines.

File# lines# units# long lines
679 - 20
231 - 12
382 - 12
1170 - 11
998 - 10
596 - 8
482 - 8
546 - 7
1637 - 7
122 - 7
587 - 7
607 - 6
169 - 5
106 - 5
246 - 5
_utils.py
in awswrangler/redshift
428 19 4
1127 - 4
394 - 4
178 - 4
500 - 4
529 - 4
mysql.py
in awswrangler
311 14 3
900 - 3
711 - 3
1309 - 3
1766 - 3
543 - 3
156 - 2
72 - 2
840 - 2
159 - 2
173 - 1
emr.py
in awswrangler
533 14 1
_data_types.py
in awswrangler
737 33 1
postgresql.py
in awswrangler
363 17 1
_write.py
in awswrangler/s3
380 12 1
691 - 1
463 - 1
575 - 1
186 - 1
372 - 1
284 - 1
Correlations

File Size vs. Commits (all time): 192 points

awswrangler/athena/_cache.py x: 14 commits (all time) y: 169 lines of code awswrangler/s3/_read.py x: 45 commits (all time) y: 336 lines of code awswrangler/s3/_read_parquet.py x: 171 commits (all time) y: 489 lines of code awswrangler/s3/_read_text.py x: 120 commits (all time) y: 280 lines of code awswrangler/_data_types.py x: 123 commits (all time) y: 737 lines of code awswrangler/data_api/rds.py x: 25 commits (all time) y: 319 lines of code awswrangler/data_api/redshift.py x: 21 commits (all time) y: 165 lines of code awswrangler/redshift/_write.py x: 9 commits (all time) y: 394 lines of code awswrangler/oracle.py x: 22 commits (all time) y: 354 lines of code awswrangler/athena/_write_iceberg.py x: 30 commits (all time) y: 487 lines of code pyproject.toml x: 175 commits (all time) y: 173 lines of code awswrangler/timestream/_write.py x: 5 commits (all time) y: 381 lines of code awswrangler/__metadata__.py x: 107 commits (all time) y: 4 lines of code awswrangler/athena/_read.py x: 165 commits (all time) y: 848 lines of code awswrangler/athena/_utils.py x: 92 commits (all time) y: 693 lines of code awswrangler/catalog/_create.py x: 70 commits (all time) y: 765 lines of code awswrangler/s3/_read_orc.py x: 21 commits (all time) y: 295 lines of code awswrangler/s3/_write_orc.py x: 23 commits (all time) y: 378 lines of code awswrangler/s3/_write_parquet.py x: 163 commits (all time) y: 481 lines of code awswrangler/s3/_write_text.py x: 157 commits (all time) y: 486 lines of code tutorials/001 - Introduction.ipynb x: 74 commits (all time) y: 122 lines of code tutorials/007 - Redshift, MySQL, PostgreSQL, SQL Server, Oracle.ipynb x: 54 commits (all time) y: 231 lines of code tutorials/014 - Schema Evolution.ipynb x: 72 commits (all time) y: 482 lines of code tutorials/021 - Global Configurations.ipynb x: 79 commits (all time) y: 607 lines of code tutorials/022 - Writing Partitions Concurrently.ipynb x: 74 commits (all time) y: 186 lines of code tutorials/023 - Flexible Partitions Filter.ipynb x: 70 commits (all time) y: 543 lines of code tutorials/030 - Data Api.ipynb x: 23 commits (all time) y: 106 lines of code awswrangler/opensearch/_read.py x: 16 commits (all time) y: 103 lines of code awswrangler/opensearch/_write.py x: 28 commits (all time) y: 319 lines of code awswrangler/distributed/ray/datasources/arrow_parquet_datasource.py x: 24 commits (all time) y: 402 lines of code awswrangler/s3/_fs.py x: 41 commits (all time) y: 523 lines of code awswrangler/catalog/_get.py x: 37 commits (all time) y: 411 lines of code awswrangler/catalog/_utils.py x: 25 commits (all time) y: 100 lines of code awswrangler/_databases.py x: 30 commits (all time) y: 326 lines of code awswrangler/_config.py x: 71 commits (all time) y: 498 lines of code awswrangler/redshift/_utils.py x: 7 commits (all time) y: 428 lines of code awswrangler/_arrow.py x: 13 commits (all time) y: 97 lines of code awswrangler/_utils.py x: 134 commits (all time) y: 603 lines of code awswrangler/athena/_executions.py x: 9 commits (all time) y: 108 lines of code awswrangler/athena/_spark.py x: 6 commits (all time) y: 141 lines of code awswrangler/athena/_statements.py x: 7 commits (all time) y: 74 lines of code awswrangler/catalog/_add.py x: 27 commits (all time) y: 178 lines of code awswrangler/catalog/_delete.py x: 20 commits (all time) y: 86 lines of code awswrangler/chime.py x: 6 commits (all time) y: 19 lines of code awswrangler/cleanrooms/_read.py x: 5 commits (all time) y: 79 lines of code awswrangler/cleanrooms/_utils.py x: 4 commits (all time) y: 26 lines of code awswrangler/cloudwatch.py x: 26 commits (all time) y: 230 lines of code awswrangler/data_api/_connector.py x: 8 commits (all time) y: 99 lines of code awswrangler/data_quality/_get.py x: 8 commits (all time) y: 18 lines of code awswrangler/distributed/ray/_core.py x: 14 commits (all time) y: 97 lines of code awswrangler/dynamodb/_delete.py x: 7 commits (all time) y: 26 lines of code awswrangler/dynamodb/_read.py x: 36 commits (all time) y: 418 lines of code awswrangler/dynamodb/_write.py x: 15 commits (all time) y: 117 lines of code awswrangler/emr.py x: 67 commits (all time) y: 533 lines of code awswrangler/emr_serverless.py x: 3 commits (all time) y: 143 lines of code awswrangler/mysql.py x: 34 commits (all time) y: 311 lines of code awswrangler/neptune/_client.py x: 8 commits (all time) y: 225 lines of code awswrangler/neptune/_neptune.py x: 15 commits (all time) y: 296 lines of code awswrangler/opensearch/_utils.py x: 19 commits (all time) y: 257 lines of code awswrangler/postgresql.py x: 36 commits (all time) y: 363 lines of code awswrangler/quicksight/_cancel.py x: 16 commits (all time) y: 23 lines of code awswrangler/quicksight/_create.py x: 28 commits (all time) y: 301 lines of code awswrangler/quicksight/_delete.py x: 19 commits (all time) y: 141 lines of code awswrangler/quicksight/_describe.py x: 17 commits (all time) y: 88 lines of code awswrangler/quicksight/_get_list.py x: 17 commits (all time) y: 229 lines of code awswrangler/redshift/_connect.py x: 3 commits (all time) y: 92 lines of code awswrangler/redshift/_read.py x: 7 commits (all time) y: 230 lines of code awswrangler/s3/_copy.py x: 34 commits (all time) y: 137 lines of code awswrangler/s3/_delete.py x: 35 commits (all time) y: 83 lines of code awswrangler/s3/_describe.py x: 32 commits (all time) y: 94 lines of code awswrangler/s3/_download.py x: 11 commits (all time) y: 31 lines of code awswrangler/s3/_list.py x: 41 commits (all time) y: 239 lines of code awswrangler/s3/_read_deltalake.py x: 14 commits (all time) y: 52 lines of code awswrangler/s3/_read_excel.py x: 17 commits (all time) y: 33 lines of code awswrangler/s3/_select.py x: 29 commits (all time) y: 190 lines of code awswrangler/s3/_upload.py x: 10 commits (all time) y: 29 lines of code awswrangler/s3/_wait.py x: 25 commits (all time) y: 96 lines of code awswrangler/s3/_write_deltalake.py x: 5 commits (all time) y: 66 lines of code awswrangler/s3/_write_excel.py x: 16 commits (all time) y: 33 lines of code awswrangler/secretsmanager.py x: 7 commits (all time) y: 17 lines of code awswrangler/sqlserver.py x: 29 commits (all time) y: 304 lines of code awswrangler/sts.py x: 10 commits (all time) y: 13 lines of code awswrangler/timestream/_create.py x: 3 commits (all time) y: 46 lines of code awswrangler/timestream/_delete.py x: 3 commits (all time) y: 20 lines of code awswrangler/timestream/_read.py x: 5 commits (all time) y: 228 lines of code awswrangler/distributed/ray/modin/_utils.py x: 21 commits (all time) y: 93 lines of code awswrangler/distributed/ray/modin/s3/_write_dataset.py x: 19 commits (all time) y: 154 lines of code awswrangler/s3/_write_dataset.py x: 48 commits (all time) y: 231 lines of code awswrangler/distributed/ray/modin/s3/_read_orc.py x: 5 commits (all time) y: 43 lines of code awswrangler/distributed/ray/modin/s3/_read_parquet.py x: 19 commits (all time) y: 58 lines of code awswrangler/distributed/ray/modin/s3/_read_text.py x: 20 commits (all time) y: 145 lines of code awswrangler/s3/_read_parquet.pyi x: 7 commits (all time) y: 250 lines of code awswrangler/s3/_read_text.pyi x: 5 commits (all time) y: 199 lines of code awswrangler/typing.py x: 16 commits (all time) y: 58 lines of code awswrangler/s3/_write.py x: 40 commits (all time) y: 380 lines of code awswrangler/_sql_formatter.py x: 6 commits (all time) y: 130 lines of code tutorials/035 - Distributing Calls on Ray Remote Cluster.ipynb x: 17 commits (all time) y: 382 lines of code awswrangler/distributed/ray/datasources/arrow_csv_datasink.py x: 3 commits (all time) y: 33 lines of code awswrangler/distributed/ray/datasources/arrow_parquet_datasink.py x: 3 commits (all time) y: 66 lines of code awswrangler/distributed/ray/datasources/file_datasink.py x: 3 commits (all time) y: 73 lines of code awswrangler/distributed/ray/datasources/filename_provider.py x: 1 commits (all time) y: 37 lines of code awswrangler/distributed/ray/datasources/pandas_text_datasink.py x: 4 commits (all time) y: 77 lines of code awswrangler/distributed/ray/modin/s3/_write_orc.py x: 6 commits (all time) y: 61 lines of code awswrangler/distributed/ray/modin/s3/_write_parquet.py x: 15 commits (all time) y: 64 lines of code awswrangler/distributed/ray/modin/s3/_write_text.py x: 16 commits (all time) y: 109 lines of code awswrangler/__init__.py x: 88 commits (all time) y: 67 lines of code awswrangler/distributed/ray/_register.py x: 27 commits (all time) y: 86 lines of code awswrangler/exceptions.py x: 72 commits (all time) y: 38 lines of code awswrangler/distributed/ray/datasources/arrow_parquet_base_datasource.py x: 7 commits (all time) y: 50 lines of code awswrangler/athena/__init__.py x: 23 commits (all time) y: 60 lines of code tutorials/003 - Amazon S3.ipynb x: 18 commits (all time) y: 1766 lines of code tutorials/031 - OpenSearch.ipynb x: 10 commits (all time) y: 1637 lines of code awswrangler/_distributed.py x: 17 commits (all time) y: 103 lines of code awswrangler/_executor.py x: 6 commits (all time) y: 42 lines of code awswrangler/athena/_read.pyi x: 8 commits (all time) y: 323 lines of code awswrangler/distributed/ray/_executor.py x: 8 commits (all time) y: 33 lines of code awswrangler/distributed/ray/_utils.py x: 9 commits (all time) y: 27 lines of code awswrangler/distributed/ray/datasources/arrow_json_datasource.py x: 5 commits (all time) y: 36 lines of code awswrangler/distributed/ray/datasources/pandas_text_datasource.py x: 14 commits (all time) y: 135 lines of code awswrangler/distributed/ray/modin/_core.py x: 8 commits (all time) y: 36 lines of code awswrangler/distributed/ray/s3/_list.py x: 4 commits (all time) y: 56 lines of code awswrangler/s3/_list.pyi x: 11 commits (all time) y: 111 lines of code awswrangler/s3/_write_concurrent.py x: 16 commits (all time) y: 40 lines of code awswrangler/athena/_executions.pyi x: 6 commits (all time) y: 64 lines of code awswrangler/catalog/_definitions.py x: 17 commits (all time) y: 348 lines of code awswrangler/data_quality/_utils.py x: 15 commits (all time) y: 167 lines of code awswrangler/distributed/ray/_core.pyi x: 6 commits (all time) y: 23 lines of code awswrangler/distributed/ray/datasources/arrow_csv_datasource.py x: 6 commits (all time) y: 52 lines of code awswrangler/distributed/ray/s3/_read_orc.py x: 2 commits (all time) y: 24 lines of code awswrangler/dynamodb/_read.pyi x: 4 commits (all time) y: 242 lines of code awswrangler/neptune/_gremlin_parser.py x: 5 commits (all time) y: 53 lines of code awswrangler/quicksight/_utils.py x: 15 commits (all time) y: 29 lines of code awswrangler/redshift/_read.pyi x: 4 commits (all time) y: 193 lines of code awswrangler/s3/_read_text_core.py x: 7 commits (all time) y: 104 lines of code awswrangler/timestream/_read.pyi x: 3 commits (all time) y: 53 lines of code awswrangler/distributed/ray/__init__.py x: 5 commits (all time) y: 8 lines of code awswrangler/quicksight/__init__.py x: 7 commits (all time) y: 82 lines of code awswrangler/s3/__init__.py x: 29 commits (all time) y: 53 lines of code tutorials/017 - Partition Projection.ipynb x: 16 commits (all time) y: 900 lines of code tutorials/002 - Sessions.ipynb x: 15 commits (all time) y: 156 lines of code tutorials/004 - Parquet Datasets.ipynb x: 13 commits (all time) y: 546 lines of code tutorials/005 - Glue Catalog.ipynb x: 16 commits (all time) y: 711 lines of code tutorials/006 - Amazon Athena.ipynb x: 25 commits (all time) y: 529 lines of code tutorials/008 - Redshift - Copy & Unload.ipynb x: 19 commits (all time) y: 575 lines of code tutorials/009 - Redshift - Append, Overwrite, Upsert.ipynb x: 13 commits (all time) y: 394 lines of code tutorials/010 - Parquet Crawler.ipynb x: 12 commits (all time) y: 840 lines of code tutorials/011 - CSV Datasets.ipynb x: 13 commits (all time) y: 596 lines of code tutorials/012 - CSV Crawler.ipynb x: 10 commits (all time) y: 691 lines of code tutorials/013 - Merging Datasets on S3.ipynb x: 10 commits (all time) y: 500 lines of code tutorials/015 - EMR.ipynb x: 10 commits (all time) y: 184 lines of code tutorials/016 - EMR & Docker.ipynb x: 11 commits (all time) y: 349 lines of code tutorials/018 - QuickSight.ipynb x: 12 commits (all time) y: 1309 lines of code tutorials/019 - Athena Cache.ipynb x: 17 commits (all time) y: 998 lines of code tutorials/025 - Redshift - Loading Parquet files with Spectrum.ipynb x: 11 commits (all time) y: 463 lines of code tutorials/026 - Amazon Timestream.ipynb x: 12 commits (all time) y: 284 lines of code tutorials/027 - Amazon Timestream 2.ipynb x: 11 commits (all time) y: 1098 lines of code tutorials/028 - DynamoDB.ipynb x: 10 commits (all time) y: 372 lines of code tutorials/033 - Amazon Neptune.ipynb x: 15 commits (all time) y: 679 lines of code tutorials/037 - Glue Data Quality.ipynb x: 3 commits (all time) y: 1127 lines of code tutorials/038 - OpenSearch Serverless.ipynb x: 3 commits (all time) y: 587 lines of code tutorials/039 - Athena Iceberg.ipynb x: 4 commits (all time) y: 1170 lines of code awswrangler/_sql_utils.py x: 1 commits (all time) y: 18 lines of code tutorials/036 - Distributing Calls with Glue Interactive Sessions on Ray.ipynb x: 5 commits (all time) y: 169 lines of code awswrangler/cleanrooms/__init__.py x: 1 commits (all time) y: 6 lines of code tutorials/040 - EMR Serverless.ipynb x: 2 commits (all time) y: 246 lines of code tutorials/041 - Apache Spark on Amazon Athena.ipynb x: 1 commits (all time) y: 178 lines of code awswrangler/neptune/__init__.py x: 5 commits (all time) y: 26 lines of code awswrangler/distributed/__init__.py x: 9 commits (all time) y: 1 lines of code awswrangler/distributed/ray/modin/__init__.py x: 2 commits (all time) y: 4 lines of code awswrangler/data_quality/__init__.py x: 2 commits (all time) y: 14 lines of code tutorials/024 - Athena Query Metadata.ipynb x: 6 commits (all time) y: 159 lines of code
1766.0
lines of code
  min: 1.0
  average: 242.71
  25th percentile: 46.0
  median: 132.5
  75th percentile: 348.75
  max: 1766.0
0 175.0
commits (all time)
min: 1.0 | average: 23.16 | 25th percentile: 5.25 | median: 12.5 | 75th percentile: 23.75 | max: 175.0

File Size vs. Contributors (all time): 192 points

awswrangler/athena/_cache.py x: 7 contributors (all time) y: 169 lines of code awswrangler/s3/_read.py x: 11 contributors (all time) y: 336 lines of code awswrangler/s3/_read_parquet.py x: 22 contributors (all time) y: 489 lines of code awswrangler/s3/_read_text.py x: 12 contributors (all time) y: 280 lines of code awswrangler/_data_types.py x: 25 contributors (all time) y: 737 lines of code awswrangler/data_api/rds.py x: 6 contributors (all time) y: 319 lines of code awswrangler/data_api/redshift.py x: 5 contributors (all time) y: 165 lines of code awswrangler/redshift/_write.py x: 3 contributors (all time) y: 394 lines of code awswrangler/oracle.py x: 7 contributors (all time) y: 354 lines of code awswrangler/athena/_write_iceberg.py x: 9 contributors (all time) y: 487 lines of code pyproject.toml x: 17 contributors (all time) y: 173 lines of code awswrangler/timestream/_write.py x: 3 contributors (all time) y: 381 lines of code awswrangler/__metadata__.py x: 8 contributors (all time) y: 4 lines of code awswrangler/athena/_read.py x: 18 contributors (all time) y: 848 lines of code awswrangler/athena/_utils.py x: 15 contributors (all time) y: 693 lines of code awswrangler/catalog/_create.py x: 8 contributors (all time) y: 765 lines of code awswrangler/s3/_read_orc.py x: 4 contributors (all time) y: 295 lines of code awswrangler/s3/_write_orc.py x: 5 contributors (all time) y: 378 lines of code awswrangler/s3/_write_parquet.py x: 17 contributors (all time) y: 481 lines of code awswrangler/s3/_write_text.py x: 14 contributors (all time) y: 486 lines of code tutorials/001 - Introduction.ipynb x: 10 contributors (all time) y: 122 lines of code tutorials/007 - Redshift, MySQL, PostgreSQL, SQL Server, Oracle.ipynb x: 8 contributors (all time) y: 231 lines of code tutorials/014 - Schema Evolution.ipynb x: 9 contributors (all time) y: 482 lines of code tutorials/021 - Global Configurations.ipynb x: 11 contributors (all time) y: 607 lines of code tutorials/022 - Writing Partitions Concurrently.ipynb x: 10 contributors (all time) y: 186 lines of code tutorials/023 - Flexible Partitions Filter.ipynb x: 9 contributors (all time) y: 543 lines of code tutorials/030 - Data Api.ipynb x: 8 contributors (all time) y: 106 lines of code awswrangler/opensearch/_read.py x: 6 contributors (all time) y: 103 lines of code awswrangler/opensearch/_write.py x: 8 contributors (all time) y: 319 lines of code awswrangler/distributed/ray/datasources/arrow_parquet_datasource.py x: 5 contributors (all time) y: 402 lines of code awswrangler/s3/_fs.py x: 11 contributors (all time) y: 523 lines of code awswrangler/catalog/_get.py x: 12 contributors (all time) y: 411 lines of code awswrangler/catalog/_utils.py x: 8 contributors (all time) y: 100 lines of code awswrangler/_databases.py x: 8 contributors (all time) y: 326 lines of code awswrangler/_config.py x: 9 contributors (all time) y: 498 lines of code awswrangler/redshift/_utils.py x: 3 contributors (all time) y: 428 lines of code awswrangler/_arrow.py x: 7 contributors (all time) y: 97 lines of code awswrangler/_utils.py x: 15 contributors (all time) y: 603 lines of code awswrangler/athena/_executions.py x: 3 contributors (all time) y: 108 lines of code awswrangler/athena/_spark.py x: 4 contributors (all time) y: 141 lines of code awswrangler/athena/_statements.py x: 3 contributors (all time) y: 74 lines of code awswrangler/catalog/_add.py x: 8 contributors (all time) y: 178 lines of code awswrangler/catalog/_delete.py x: 7 contributors (all time) y: 86 lines of code awswrangler/chime.py x: 4 contributors (all time) y: 19 lines of code awswrangler/cleanrooms/_read.py x: 2 contributors (all time) y: 79 lines of code awswrangler/cleanrooms/_utils.py x: 3 contributors (all time) y: 26 lines of code awswrangler/data_api/_connector.py x: 3 contributors (all time) y: 99 lines of code awswrangler/data_quality/_create.py x: 5 contributors (all time) y: 168 lines of code awswrangler/data_quality/_get.py x: 5 contributors (all time) y: 18 lines of code awswrangler/distributed/ray/_core.py x: 3 contributors (all time) y: 97 lines of code awswrangler/dynamodb/_read.py x: 6 contributors (all time) y: 418 lines of code awswrangler/dynamodb/_write.py x: 6 contributors (all time) y: 117 lines of code awswrangler/emr.py x: 12 contributors (all time) y: 533 lines of code awswrangler/emr_serverless.py x: 2 contributors (all time) y: 143 lines of code awswrangler/mysql.py x: 11 contributors (all time) y: 311 lines of code awswrangler/neptune/_client.py x: 3 contributors (all time) y: 225 lines of code awswrangler/neptune/_neptune.py x: 6 contributors (all time) y: 296 lines of code awswrangler/neptune/_utils.py x: 4 contributors (all time) y: 71 lines of code awswrangler/opensearch/_utils.py x: 7 contributors (all time) y: 257 lines of code awswrangler/postgresql.py x: 10 contributors (all time) y: 363 lines of code awswrangler/quicksight/_cancel.py x: 4 contributors (all time) y: 23 lines of code awswrangler/quicksight/_create.py x: 8 contributors (all time) y: 301 lines of code awswrangler/quicksight/_delete.py x: 5 contributors (all time) y: 141 lines of code awswrangler/quicksight/_describe.py x: 4 contributors (all time) y: 88 lines of code awswrangler/quicksight/_get_list.py x: 4 contributors (all time) y: 229 lines of code awswrangler/redshift/_connect.py x: 1 contributors (all time) y: 92 lines of code awswrangler/redshift/_read.py x: 2 contributors (all time) y: 230 lines of code awswrangler/s3/_copy.py x: 9 contributors (all time) y: 137 lines of code awswrangler/s3/_delete.py x: 10 contributors (all time) y: 83 lines of code awswrangler/s3/_describe.py x: 9 contributors (all time) y: 94 lines of code awswrangler/s3/_download.py x: 6 contributors (all time) y: 31 lines of code awswrangler/s3/_list.py x: 11 contributors (all time) y: 239 lines of code awswrangler/s3/_read_deltalake.py x: 6 contributors (all time) y: 52 lines of code awswrangler/s3/_read_excel.py x: 8 contributors (all time) y: 33 lines of code awswrangler/s3/_select.py x: 5 contributors (all time) y: 190 lines of code awswrangler/s3/_upload.py x: 5 contributors (all time) y: 29 lines of code awswrangler/s3/_write_deltalake.py x: 2 contributors (all time) y: 66 lines of code awswrangler/sqlserver.py x: 11 contributors (all time) y: 304 lines of code awswrangler/sts.py x: 4 contributors (all time) y: 13 lines of code awswrangler/timestream/_create.py x: 1 contributors (all time) y: 46 lines of code awswrangler/timestream/_delete.py x: 1 contributors (all time) y: 20 lines of code awswrangler/timestream/_list.py x: 2 contributors (all time) y: 24 lines of code awswrangler/timestream/_read.py x: 2 contributors (all time) y: 228 lines of code awswrangler/distributed/ray/modin/_data_types.py x: 3 contributors (all time) y: 19 lines of code awswrangler/distributed/ray/modin/_utils.py x: 6 contributors (all time) y: 93 lines of code awswrangler/distributed/ray/modin/s3/_write_dataset.py x: 5 contributors (all time) y: 154 lines of code awswrangler/s3/_write_dataset.py x: 9 contributors (all time) y: 231 lines of code awswrangler/distributed/ray/modin/s3/_read_orc.py x: 2 contributors (all time) y: 43 lines of code awswrangler/distributed/ray/modin/s3/_read_parquet.py x: 6 contributors (all time) y: 58 lines of code awswrangler/s3/_read_parquet.pyi x: 3 contributors (all time) y: 250 lines of code awswrangler/s3/_read_text.pyi x: 2 contributors (all time) y: 199 lines of code awswrangler/typing.py x: 4 contributors (all time) y: 58 lines of code awswrangler/s3/_write.py x: 11 contributors (all time) y: 380 lines of code awswrangler/_sql_formatter.py x: 2 contributors (all time) y: 130 lines of code awswrangler/distributed/ray/datasources/__init__.py x: 4 contributors (all time) y: 33 lines of code awswrangler/distributed/ray/datasources/arrow_csv_datasink.py x: 2 contributors (all time) y: 33 lines of code awswrangler/distributed/ray/datasources/file_datasink.py x: 2 contributors (all time) y: 73 lines of code awswrangler/distributed/ray/datasources/filename_provider.py x: 1 contributors (all time) y: 37 lines of code awswrangler/distributed/ray/modin/s3/_write_orc.py x: 3 contributors (all time) y: 61 lines of code awswrangler/distributed/ray/modin/s3/_write_parquet.py x: 6 contributors (all time) y: 64 lines of code awswrangler/distributed/ray/modin/s3/_write_text.py x: 6 contributors (all time) y: 109 lines of code awswrangler/__init__.py x: 14 contributors (all time) y: 67 lines of code awswrangler/distributed/ray/_register.py x: 5 contributors (all time) y: 86 lines of code awswrangler/exceptions.py x: 10 contributors (all time) y: 38 lines of code awswrangler/distributed/ray/datasources/arrow_parquet_base_datasource.py x: 4 contributors (all time) y: 50 lines of code awswrangler/athena/__init__.py x: 9 contributors (all time) y: 60 lines of code tutorials/003 - Amazon S3.ipynb x: 10 contributors (all time) y: 1766 lines of code tutorials/031 - OpenSearch.ipynb x: 6 contributors (all time) y: 1637 lines of code awswrangler/_executor.py x: 3 contributors (all time) y: 42 lines of code awswrangler/athena/_read.pyi x: 2 contributors (all time) y: 323 lines of code awswrangler/distributed/ray/_executor.py x: 3 contributors (all time) y: 33 lines of code awswrangler/distributed/ray/datasources/arrow_json_datasource.py x: 3 contributors (all time) y: 36 lines of code awswrangler/distributed/ray/datasources/pandas_text_datasource.py x: 5 contributors (all time) y: 135 lines of code awswrangler/distributed/ray/modin/_core.py x: 4 contributors (all time) y: 36 lines of code awswrangler/distributed/ray/s3/_list.py x: 2 contributors (all time) y: 56 lines of code awswrangler/s3/_list.pyi x: 5 contributors (all time) y: 111 lines of code awswrangler/s3/_write_concurrent.py x: 5 contributors (all time) y: 40 lines of code awswrangler/catalog/_definitions.py x: 6 contributors (all time) y: 348 lines of code awswrangler/distributed/ray/datasources/arrow_csv_datasource.py x: 3 contributors (all time) y: 52 lines of code awswrangler/distributed/ray/s3/_read_orc.py x: 1 contributors (all time) y: 24 lines of code awswrangler/dynamodb/_read.pyi x: 1 contributors (all time) y: 242 lines of code awswrangler/redshift/_read.pyi x: 2 contributors (all time) y: 193 lines of code awswrangler/s3/_read_text_core.py x: 4 contributors (all time) y: 104 lines of code awswrangler/timestream/_read.pyi x: 2 contributors (all time) y: 53 lines of code awswrangler/distributed/ray/__init__.py x: 2 contributors (all time) y: 8 lines of code awswrangler/s3/__init__.py x: 8 contributors (all time) y: 53 lines of code tutorials/017 - Partition Projection.ipynb x: 8 contributors (all time) y: 900 lines of code tutorials/002 - Sessions.ipynb x: 7 contributors (all time) y: 156 lines of code tutorials/004 - Parquet Datasets.ipynb x: 6 contributors (all time) y: 546 lines of code tutorials/005 - Glue Catalog.ipynb x: 7 contributors (all time) y: 711 lines of code tutorials/006 - Amazon Athena.ipynb x: 8 contributors (all time) y: 529 lines of code tutorials/008 - Redshift - Copy & Unload.ipynb x: 6 contributors (all time) y: 575 lines of code tutorials/009 - Redshift - Append, Overwrite, Upsert.ipynb x: 6 contributors (all time) y: 394 lines of code tutorials/010 - Parquet Crawler.ipynb x: 6 contributors (all time) y: 840 lines of code tutorials/011 - CSV Datasets.ipynb x: 6 contributors (all time) y: 596 lines of code tutorials/012 - CSV Crawler.ipynb x: 6 contributors (all time) y: 691 lines of code tutorials/013 - Merging Datasets on S3.ipynb x: 6 contributors (all time) y: 500 lines of code tutorials/015 - EMR.ipynb x: 7 contributors (all time) y: 184 lines of code tutorials/018 - QuickSight.ipynb x: 8 contributors (all time) y: 1309 lines of code tutorials/019 - Athena Cache.ipynb x: 8 contributors (all time) y: 998 lines of code tutorials/025 - Redshift - Loading Parquet files with Spectrum.ipynb x: 7 contributors (all time) y: 463 lines of code tutorials/026 - Amazon Timestream.ipynb x: 9 contributors (all time) y: 284 lines of code tutorials/027 - Amazon Timestream 2.ipynb x: 7 contributors (all time) y: 1098 lines of code tutorials/028 - DynamoDB.ipynb x: 6 contributors (all time) y: 372 lines of code tutorials/033 - Amazon Neptune.ipynb x: 7 contributors (all time) y: 679 lines of code tutorials/037 - Glue Data Quality.ipynb x: 2 contributors (all time) y: 1127 lines of code tutorials/038 - OpenSearch Serverless.ipynb x: 2 contributors (all time) y: 587 lines of code tutorials/039 - Athena Iceberg.ipynb x: 3 contributors (all time) y: 1170 lines of code tutorials/036 - Distributing Calls with Glue Interactive Sessions on Ray.ipynb x: 2 contributors (all time) y: 169 lines of code awswrangler/cleanrooms/__init__.py x: 1 contributors (all time) y: 6 lines of code tutorials/040 - EMR Serverless.ipynb x: 2 contributors (all time) y: 246 lines of code tutorials/041 - Apache Spark on Amazon Athena.ipynb x: 1 contributors (all time) y: 178 lines of code awswrangler/redshift/__init__.py x: 2 contributors (all time) y: 14 lines of code awswrangler/distributed/__init__.py x: 3 contributors (all time) y: 1 lines of code awswrangler/distributed/ray/s3/__init__.py x: 2 contributors (all time) y: 1 lines of code tutorials/020 - Spark Table Interoperability.ipynb x: 6 contributors (all time) y: 72 lines of code
1766.0
lines of code
  min: 1.0
  average: 242.71
  25th percentile: 46.0
  median: 132.5
  75th percentile: 348.75
  max: 1766.0
0 25.0
contributors (all time)
min: 1.0 | average: 5.65 | 25th percentile: 3.0 | median: 5.0 | 75th percentile: 8.0 | max: 25.0

File Size vs. Commits (30 days): 5 points

awswrangler/athena/_cache.py x: 1 commits (30d) y: 169 lines of code awswrangler/s3/_read.py x: 1 commits (30d) y: 336 lines of code awswrangler/s3/_read_parquet.py x: 1 commits (30d) y: 489 lines of code awswrangler/s3/_read_text.py x: 1 commits (30d) y: 280 lines of code awswrangler/_data_types.py x: 1 commits (30d) y: 737 lines of code
737.0
lines of code
  min: 169.0
  average: 402.2
  25th percentile: 224.5
  median: 336.0
  75th percentile: 613.0
  max: 737.0
0 1.0
commits (30d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (30 days): 5 points

awswrangler/athena/_cache.py x: 1 contributors (30d) y: 169 lines of code awswrangler/s3/_read.py x: 1 contributors (30d) y: 336 lines of code awswrangler/s3/_read_parquet.py x: 1 contributors (30d) y: 489 lines of code awswrangler/s3/_read_text.py x: 1 contributors (30d) y: 280 lines of code awswrangler/_data_types.py x: 1 contributors (30d) y: 737 lines of code
737.0
lines of code
  min: 169.0
  average: 402.2
  25th percentile: 224.5
  median: 336.0
  75th percentile: 613.0
  max: 737.0
0 1.0
contributors (30d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Commits (90 days): 10 points

awswrangler/athena/_cache.py x: 1 commits (90d) y: 169 lines of code awswrangler/s3/_read.py x: 1 commits (90d) y: 336 lines of code awswrangler/s3/_read_parquet.py x: 2 commits (90d) y: 489 lines of code awswrangler/s3/_read_text.py x: 1 commits (90d) y: 280 lines of code awswrangler/_data_types.py x: 2 commits (90d) y: 737 lines of code awswrangler/data_api/rds.py x: 1 commits (90d) y: 319 lines of code awswrangler/data_api/redshift.py x: 1 commits (90d) y: 165 lines of code awswrangler/redshift/_write.py x: 1 commits (90d) y: 394 lines of code awswrangler/oracle.py x: 1 commits (90d) y: 354 lines of code awswrangler/athena/_write_iceberg.py x: 2 commits (90d) y: 487 lines of code
737.0
lines of code
  min: 165.0
  average: 373.0
  25th percentile: 252.25
  median: 345.0
  75th percentile: 487.5
  max: 737.0
0 2.0
commits (90d)
min: 1.0 | average: 1.3 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 2.0

File Size vs. Contributors (90 days): 10 points

awswrangler/athena/_cache.py x: 1 contributors (90d) y: 169 lines of code awswrangler/s3/_read.py x: 1 contributors (90d) y: 336 lines of code awswrangler/s3/_read_parquet.py x: 2 contributors (90d) y: 489 lines of code awswrangler/s3/_read_text.py x: 1 contributors (90d) y: 280 lines of code awswrangler/_data_types.py x: 2 contributors (90d) y: 737 lines of code awswrangler/data_api/rds.py x: 1 contributors (90d) y: 319 lines of code awswrangler/data_api/redshift.py x: 1 contributors (90d) y: 165 lines of code awswrangler/redshift/_write.py x: 1 contributors (90d) y: 394 lines of code awswrangler/oracle.py x: 1 contributors (90d) y: 354 lines of code awswrangler/athena/_write_iceberg.py x: 1 contributors (90d) y: 487 lines of code
737.0
lines of code
  min: 165.0
  average: 373.0
  25th percentile: 252.25
  median: 345.0
  75th percentile: 487.5
  max: 737.0
0 2.0
contributors (90d)
min: 1.0 | average: 1.2 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.25 | max: 2.0