GitHunt
AL

allenai/smashed

SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.

No README found.

Languages

Python99.7%Shell0.3%

Contributors

Apache License 2.0
Created July 21, 2022
Updated October 10, 2025
allenai/smashed | GitHunt