Carlos Costa
epilif1017a
Director Data & AI @ adidas & Invited Professor @ University of Minho
Languages
Top Repositories
The definitive open source big data operating system.
Code and Documents related to the SSB+ Benchmark
Framework and templates to build a data lakehouse from scratch.
Ansible Playbook to install a Kerberized Hortonworks Hadoop Cluster with some of the good practices from the documentation (e.g., ambari as non-root, dedicated mysql server, encrypted ambari database)
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Repositories
8Framework and templates to build a data lakehouse from scratch.
The definitive open source big data operating system.
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Code and Documents related to the SSB+ Benchmark
Ansible Playbook to install a Kerberized Hortonworks Hadoop Cluster with some of the good practices from the documentation (e.g., ambari as non-root, dedicated mysql server, encrypted ambari database)
No description provided.
No description provided.
O'Neil et al.'s Star Schema Benchmark