GitHunt
CH

chezou/molehill

Hivemall SQLs and digdag workflows generator

molehill

CircleCI

Combine Apache Hivemall(incubating) and digdag together.

Generate Hivemall queries and Digdag workflow for TreasureData from YAML file.

Installation

$ pip install git+https://github.com/chezou/molehill#egg=molehill

Usage

# will generate queries directory and titanic.dig
$ generate_workflow --overwrite -dest titanic.dig resources/titanic_pipeline.yml
$ td wf push proj-name
$ td wf start proj-name titanic --session now

Examples

Example YAML files can be found as follows:

The following YAMLs are example config files for Titanic survival prediction with Logistic Regression.

Languages

Python100.0%

Contributors

Apache License 2.0
Created February 7, 2019
Updated March 13, 2023