"topic:sparsity" — Search

172 results for “topic:sparsity”

vllm-project/llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python2.9k438Updated 1 hour ago

compressionquantizationsparsity

pytorch/ao

PyTorch native quantization and sparsity for training and inference

Python2.7k459Updated just now

brrrcudadtypesfloat8inferencellamamxpytorchquantizationsparsitytrainingtransformer

intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python2.6k297Updated 15 hours ago

auto-tuningawqfp4gptqint4int8knowledge-distillationlarge-language-modelslow-precisionmxformatpost-training-quantizationpruningquantizationquantization-aware-trainingsmoothquantsparsegptsparsity

neuralmagic/sparsemlArchived

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python2.1k157Updated 9 months ago

automlcomputer-vision-algorithmsdeep-learning-algorithmsdeep-learning-librarydeep-learning-modelsimage-classificationkerasnlpobject-detectiononnxpruningpruning-algorithmspytorchsmaller-modelssparsificationsparsification-recipessparsitytensorflowtransfer-learning

PaddlePaddle/PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

Python1.6k353Updated 2 months ago

bertcompressiondetectiondistillationernienaspruningquantizationsegmentationsparsitytensorrttransformeryolov5yolov6yolov7

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Python1.6k346Updated 1 week ago

compressiondeep-learningkerasmachine-learningmlmodel-compressionoptimizationpruningquantizationquantized-networksquantized-neural-networksquantized-trainingsparsitytensorflow

openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

Python1.1k291Updated 16 hours ago

bertclassificationcompressiondeep-learninggenaillmmixed-precision-trainingnlpobject-detectiononnxopenvinopruningpytorchquantizationquantization-aware-trainingsemantic-segmentationsparsitytensorflowtransformers

Eric-mingjie/network-slimming

Network Slimming (Pytorch) (ICCV 2017)

Python919217Updated 5 years ago

channel-pruningconvolutional-neural-networksdeep-learningpytorchsparsity

Bobo-y/flexible-yolov5

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam，dcn and so on), and tensorrt

Python685119Updated 1 year ago

backbonecbamdcnv2gcnhrnetmoblienetneckobject-detectionptqpytorchqatresnetshufflenetsparsityswin-transformertensorrttriton-serveryolov3yolov5

FMInference/H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python50676Updated 1 year ago

gpt-3heavy-hittershigh-throughputkv-cachelarge-language-modelssparsity

wenwei202/caffe

Caffe for Sparse and Low-rank Deep Neural Networks

C++382131Updated 6 years ago

accelerationcaffecompressiondeep-neural-networkslow-rank-approximationsparse-convolutionsparsity

intel/neural-speedArchived

An innovative library for efficient LLM inference via low-bit quantization

C++35238Updated 1 year ago

cpufp4fp8gaudi2gpuint1int2int3int4int5int6int7int8llamacppllm-fine-tuningllm-inferencelow-bitmxformatnf4sparsity

mehtadushy/SelecSLS-Pytorch

Reference ImageNet implementation of SelecSLS CNN architecture proposed in the SIGGRAPH 2020 paper "XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera". The repository also includes code for pruning the model based on implicit sparsity emerging from adaptive gradient descent methods, as detailed in the CVPR 2019 paper "On implicit filter level sparsity in Convolutional Neural Networks".

Python33945Updated 5 years ago

cnncvpr2019deep-learningefficientefficient-architecturesimagenetpruningpytorchpytorch-implementationsiggraphsparsity

bwohlberg/sporco

Sparse Optimisation Research Code

Python27440Updated 3 months ago

admmconvolutional-dictionary-learningconvolutional-sparse-codingcudadictionary-learningfistaoptimizationoptimization-algorithmsplug-and-play-priorspythonrobust-pcasparse-codingsparse-representationssparsitytotal-variationtotal-variation-minimization

dcmocanu/sparse-evolutionary-artificial-neural-networks

Always sparse. Never dense. But never say never. A Sparse Training repository for the Adaptive Sparse Connectivity concept and its algorithmic instantiation, i.e. Sparse Evolutionary Training, to boost Deep Learning scalability on various aspects (e.g. memory and computational time efficiency, representation and generalization power).

Python25765Updated 4 years ago