Repos
54
Stars
4
Forks
147
Top Language
Scala
Loading contributions...
Top Repositories
Connectors for Delta Lake
A benchmark for LLMs on complicated tasks in the terminal
ChatLLaMA π’ Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
ShadowMask is a platform to analyze sensitive big data.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Repositories
54A benchmark for LLMs on complicated tasks in the terminal
ChatLLaMA π’ Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
Connectors for Delta Lake
ShadowMask is a platform to analyze sensitive big data.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
An Open Source Machine Learning Framework for Everyone
No description provided.
12 Lessons, Get Started Building with Generative AI π https://microsoft.github.io/generative-ai-for-beginners/
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.md file below.
The low-level, core functionality of boto3 and the AWS CLI.
S3 Filesystem
AWS SDK for C++
Code and documentation to train Stanford's Alpaca models, and generate the data.
No description provided.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
No description provided.
Kyuubi is an enhanced editon of Apache Spark's primordial Thrift JDBC/ODBC Server.
debug presto by myself to learn presto
Framework to quickly build and maintain Smart Data Lakes
No description provided.
Big Data Toolkit for the JVM
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Connectors for Delta Lake
An open-source storage layer that brings scalable, ACID transactions to Apache Sparkβ’ and big data workloads.
Mirror of Apache Spark
Deep Learning Pipelines for Apache Spark
A code-completion & code-comprehension server
MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.