GitHunt

Holden Karau

holdenk

Holden Karau is trans Canadian, and open source contributor. She is a Spark committer co-author of Learning Spark, High Performance Spark and Kubeflow for ML.

Open Source Big Data Dev
San Francisco, CA, USA

Organizations

Languages

Scala28%Python20%Shell16%HTML8%Jupyter Notebook8%TypeScript4%Go4%Java4%JavaScript4%TeX4%

Loading contributions...

Top Repositories

Repositories

293
HO
holdenk/spark-testing-base

Base classes to use when writing tests with Spark

Scala1.5k355Updated 2 months ago
HO
holdenk/spark-flowchart

Flowchart for debugging Spark applications

Shell10528Updated 1 year ago
HO
holdenk/wyoming-moonshine-extFork

Wyoming protocol server for faster whisper speech to text system

Python00Updated 3 days ago
HO
holdenk/points-plugin-expirement

Experiment of a unified shopping plugin because I'm lazy

TypeScript00Updated 3 days ago
HO
holdenk/colo-scripts

No description provided.

Shell21Updated 4 days ago
HO
holdenk/sparkling-pink-pandas-static

Static simpler version of the SPP website so I don't have to deal with it.

Python01Updated 6 days ago
HO
holdenk/blahaj-church

No description provided.

HTML00Updated 6 days ago
HO
holdenk/spp-event-bot

Event bot for publishing SPP events to matrix

Go00Updated 1 week ago
HO
holdenk/high-performance-spark-2e-website

Marketing webstie for High Performance Spark 2e (ORM book)

00Updated 1 week ago
HO
holdenk/spp-matrix

No description provided.

Shell00Updated 1 week ago
HO
holdenk/learning-spark-examples

Examples for learning spark

Java333269Updated 10 years ago
HO
holdenk/mydotfiles

My dotfiles. You probably don't care about this.

Shell20Updated 2 weeks ago
HO
holdenk/elasticsearchspark

Elastic Search on Spark

Scala11242Updated 11 years ago
HO
holdenk/green-iotForkArchived

No description provided.

JavaScript00Updated 4 years ago
HO
holdenk/sparkFork

Mirror of Apache Spark

Scala71Updated 3 days ago
HO
holdenk/gearbrake

Gearbrake website

HTML00Updated 3 weeks ago
HO
holdenk/remote-python-debugging-4-spark

Set up PDB on Spark

Jupyter Notebook90Updated 7 years ago
HO
holdenk/intro-to-pyspark-demos

Examples from Holden's intro to PySpark workshop. This is an intro level workshop focused on using Spark with Python.

1510Updated 8 years ago
HO
holdenk/sparkProjectTemplate.g8

Template for Spark Projects

Scala10341Updated 1 year ago
apachesparkg8spark
HO
holdenk/VOXCOMM-intercomFork

ESP based MESH intercom for motorcycle or general use

00Updated 3 months ago
HO
holdenk/sdrangelFork

SDR Rx/Tx software for Airspy, Airspy HF+, BladeRF, HackRF, LimeSDR, PlutoSDR, RTL-SDR, SDRplay and FunCube

00Updated 1 month ago
HO
holdenk/firmware-investigate

Investigating Firmware

Python00Updated 5 months ago
HO
holdenk/spark-upgrade

Magic to help Spark pipelines upgrade

Python3418Updated 1 year ago
HO
holdenk/spark-validator

A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support.

Scala10823Updated 8 years ago
HO
holdenk/spark-structured-streaming-ml

Structured Streaming Machine Learning example with Spark 2.0

Scala9451Updated 8 years ago
HO
holdenk/resume

latex resume

TeX42Updated 7 months ago
HO
holdenk/distributedcomputing4kids

distributedcomputing4kids

Jupyter Notebook80Updated 2 years ago
HO
holdenk/spark-expectationsFork

A Python Library to support running data quality rules while the spark job is running⚡

Python10Updated 2 years ago
HO
holdenk/sparklensFork

Qubole Sparklens tool for performance tuning Apache Spark

10Updated 1 year ago
HO
holdenk/high-performance-spark-examplesFork

Examples for High Performance Spark

Scala166Updated 4 months ago

Gists

Recent Activity