73 results for “topic:hive-metastore”
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
Apache Hive Metastore as a Standalone server in Docker
Reference Architectures for Datalakes on AWS
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
A client for connecting and running DDLs on hive metastore.
Service for automatically managing and cleaning up unreferenced data
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
Dockerizing an Apache Spark Standalone Cluster
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.
Apiary provides modules which can be combined to create a federated cloud data lake
Sample code with integration between Data Catalog and Hive data source.
Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.
"hms-mirror" is a utility used to bridge the gap between two clusters and migrate hive metadata.
Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store
Go Client for Hive Metastore
Apache Hive Metastore in Standalone Mode With Docker
A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service
A Python Client for Hive Metastore
Stand alone Thrift HMS client and benchmarking tools
Este projeto é um laboratório prático que implementa uma Pilha de Dados Moderna (MDS) usando containers Docker, projetado para aprendizado e experimentação com ferramentas open-source como MinIO (armazenamento S3), PostgreSQL, Apache Hive, Spark, Kyuubi, JupyterLab e Dremio.
This Project explains how to use HiveMetaStoreClient, HiveJdbcDriver, HiveServer2
Kubernetes Hive Minio connection example
PoC: s3 + hive metastore + presto
An example of how Presto can be configured to run on a desktop machine with the Hive Connector configured for an Azure Blob Storage account to query blob data using SQL.
Efficient Iceberg table management and distributed querying with Trino, Hive Metastore, MySQL, and MinIO.
Open-source composable lakehouse platform with Spark 4 + Delta Lake 4. Complete local environment ready in minutes via Docker Compose.
Deploying an analytics platform on kubernetes cluster