The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
Main Code: 6,408 LOC (34 files) = JAVA (100%) | |||
Duplication: 40% | |||
File Size: 63% long (>1000 LOC), 18% short (<= 200 LOC) | |||
Unit Size: 0% long (>100 LOC), 76% short (<= 10 LOC) | |||
Conditional Complexity: 0% complex (McCabe index > 50), 83% simple (McCabe index <= 5) | |||
|
Logical Component Decomposition: primary (9 components) | ||
|
3 years old
|
|
|
|
0% of code updated more than 50 times Also see temporal dependencies for files frequently changed in same commits. |
|
|
|
Goals: Keep the system simple and easy to change (4) |
|
|
Features of interest:
TODOs
4 files |
|
Latest commit date: 2021-11-29
0
commits
(30 days)
0
contributors
(30 days) |
|
generated by sokrates.dev (configuration) on 2022-01-31