"topic:hive-metastore" — Search

End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)

Python4813Updated 1 year ago

adventureworksairflowdata-pipelinedata-platformdbtdelta-lakedocker-composeend-to-endhive-metastorelightdashsparktrino

gmrqs/lasagna

A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka

Jupyter Notebook4716Updated 1 year ago

dockerdocker-composehive-metastorejupyterjupyterlabminiopysparksparkspark-streamingtrino

Wittline/apache-spark-docker

Dockerizing an Apache Spark Standalone Cluster

VBA4227Updated 3 years ago

apache-sparkdataengineerdataengineeringdockerdocker-composehadoop-clusterhadoop-dockerhdfshivehive-metastorehuepyspark

san089/Cloudera_Material

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

4131Updated 5 years ago

big-databigdataccacca175certificationclouderaflumehadoophivehive-metastorepysparksparksqoopsqoop-exportsqoop-importsqoop-session

harrydevforlife/building-lakehouse

Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.

Python398Updated 3 months ago

airflowdbtdelta-lakeflask-apihive-metastorelakehousemetabaseminiopythons3spark

ExpediaGroup/apiary

Apiary provides modules which can be combined to create a federated cloud data lake

3710Updated 1 year ago

awsdatalakehivehive-metastore

GoogleCloudPlatform/datacatalog-connectors-hive

Sample code with integration between Data Catalog and Hive data source.

Python2414Updated 1 year ago

analyticsapache-atlasdata-warehousedatacataloggcphivehive-metastoremetadata-managementpython

ExpediaGroup/shunting-yardArchived

Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.

Java203Updated 4 years ago

big-datacircus-trainhivehive-metastorehive-tablereplicate-datareplication

cloudera-labs/hms-mirror

"hms-mirror" is a utility used to bridge the gap between two clusters and migrate hive metadata.

Java1810Updated 4 months ago

hivehive-metastore

UrbanOS-Public/kdp

Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store

Dockerfile173Updated 3 years ago

hive-metastorekubernetesminioprestodb

akolb1/gometastore

Go Client for Hive Metastore

Go1419Updated 3 years ago

gogolanghivehive-metastorehive-metastore-clienthmsmetastorerest-apirest-clientthrift

criccomini/hive-metastore-standalone

Apache Hive Metastore in Standalone Mode With Docker

Dockerfile143Updated 1 year ago

dockergithub-workflowgithub-workflowshadoophcataloghivehive-metastoreprestoprestodbtrinotrinodb

ExpediaGroup/drone-fly

A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service

Java135Updated 3 days ago

hivehive-metastore

criccomini/pymetastore

A Python Client for Hive Metastore

Python123Updated 4 months ago

data-engineeringhcataloghivehive-metastorepythonthrift

akolb1/hclient

Stand alone Thrift HMS client and benchmarking tools

Java812Updated 7 years ago

benchmark-frameworkbenchmarkinghivehive-metastorehive-metastore-clientmicrobenchmarksthrift-client

guaradata/spark-minio-delta-jupyter-dremio-lab

Este projeto é um laboratório prático que implementa uma Pilha de Dados Moderna (MDS) usando containers Docker, projetado para aprendizado e experimentação com ferramentas open-source como MinIO (armazenamento S3), PostgreSQL, Apache Hive, Spark, Kyuubi, JupyterLab e Dremio.

Shell74Updated 9 months ago

dbeaverdelta-lakedockerdocker-composedremiohive-metastorejupyter-notebookkyuubiminiomodern-data-stackportainerspark

spider-123-eng/HiveMetaStoreClient

This Project explains how to use HiveMetaStoreClient, HiveJdbcDriver, HiveServer2

Java76Updated 8 years ago

hive-jdbchive-jdbc-driverhive-jdbc-examplehive-metastorehive-metastore-apihive-metastore-clienthive-metastore-examplehiveserver2

AhmetFurkanDEMIR/minio-hive-example

Kubernetes Hive Minio connection example

Shell60Updated 2 years ago

apache-hivehadoophivehive-metastorehive-serverk8skuberneteskubernetes-clusterkubernetes-deploymentminiopostgresqls3s3-bucket

zhenik-poc/big-data-stack-practice

PoC: s3 + hive metastore + presto

Makefile55Updated 5 years ago

hivehive-metastorehive-serverminiopostgresprestos3

benoutram/prestodb-hive-azure-storage

An example of how Presto can be configured to run on a desktop machine with the Hive Connector configured for an Azure Blob Storage account to query blob data using SQL.

Shell53Updated 5 years ago

azure-storagehivehive-metastoreprestoprestodb

yuhexiong/deploy-trino-iceberg-hive-metastore-minio-guide

Efficient Iceberg table management and distributed querying with Trino, Hive Metastore, MySQL, and MinIO.

Shell41Updated 1 year ago

apache-iceberghive-metastoreminiotrino

lucianomauda/FlumenData

Open-source composable lakehouse platform with Spark 4 + Delta Lake 4. Complete local environment ready in minutes via Docker Compose.

Python41Updated 4 months ago

apache-sparkdata-engineeringdelta-lakedocker-composeeducationalhive-metastorejupyterlakehouselearningminiopostgresqlsupersettrino

sbakiu/analytics-platform-diy

Deploying an analytics platform on kubernetes cluster

Dockerfile40Updated 4 years ago

analyticshive-metastorehive-serverjupyterjupyterlabkubernetessparktrino

Page 1 of 3