"topic:model-compression" — Search

363 results for “topic:model-compression”

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python14.3k1.9kUpdated 11 hours ago

automated-machine-learningautomlbayesian-optimizationdata-sciencedeep-learningdeep-neural-networkdistributedfeature-engineeringhyperparameter-optimizationhyperparameter-tuningmachine-learningmachine-learning-algorithmsmlopsmodel-compressionnasneural-architecture-searchneural-networkpythonpytorchtensorflow

huawei-noah/Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Python4.4k735Updated 16 hours ago

convolutional-neural-networksefficient-inferenceghostnetimagenetmodel-compressionpretrained-modelspytorchtensorflowtransformervision-transformer

dkozlov/awesome-knowledge-distillation

Awesome Knowledge Distillation

3.8k513Updated 1 day ago

co-trainingdeep-learningdistillationdistillation-modelkdknowldge-distillationknowledge-distillationknowledge-transfermodel-compressionmodel-distillationteacher-student

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python3.3k374Updated 1 day ago

efficient-deep-learningllmmodel-compressionpruningtransformersvision

huawei-noah/Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python3.2k643Updated 1 week ago

knowledge-distillationlarge-scale-distributedmodel-compressionpretrained-modelsquantization

Tencent/PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

Python2.9k492Updated 1 week ago

automlcomputer-visiondeep-learningmobile-appmodel-compression

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2.7k335Updated 1 day ago

deep-learningdistillationkdknowldge-distillationmodel-compressiontransfer-learning

he-y/Awesome-Pruning

A curated list of neural network pruning resources.

2.5k332Updated 7 hours ago

awesome-listmodel-accelerationmodel-compressionpruning

Efficient-ML/Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

2.3k231Updated 5 hours ago

awesomebinarized-neural-networksbinary-networkdeep-learningefficient-deep-learninglightweight-neural-networkmodel-accelerationmodel-compressionmodel-quantizationquantization

666DZY666/micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Python2.3k477Updated 3 days ago

batch-normalization-fusebnnconvolutional-networksdorefagroup-convolutioninteger-arithmetic-onlymodel-compressionnetwork-in-networknetwork-slimmingneuromorphic-computingonnxpost-training-quantizationpruningpytorchquantizationquantization-aware-trainingtensorrttensorrt-int8-pythontwnxnor-net

haitongli/knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

Python2.0k352Updated 1 week ago

cifar10computer-visiondark-knowledgedeep-neural-networksknowledge-distillationmodel-compressionpytorch

AberHu/Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

Python1.7k269Updated 1 week ago

distillationkdkd-methodsknowledge-distillationknowledge-transfermodel-compressionteacher-student

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Python1.6k346Updated 2 weeks ago

compressiondeep-learningkerasmachine-learningmlmodel-compressionoptimizationpruningquantizationquantized-networksquantized-neural-networksquantized-trainingsparsitytensorflow

microsoft/NeuronBlocks

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

Python1.5k192Updated 1 week ago

artificial-intelligencedeep-learningdnnknowledge-distillationmodel-compressionnatural-language-processingpytorchqnaquestion-answeringsequence-labelingtext-classificationtext-matching

huawei-noah/Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

Jupyter Notebook1.3k220Updated 4 days ago

binary-neural-networksknowledge-distillationmodel-compressionpruningquantizationself-supervised

ethanhe42/channel-pruning

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Python1.1k308Updated 2 weeks ago

accelerationchannel-pruningdeep-neural-networksimage-classificationimage-recognitionmodel-compressionobject-detection

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python95850Updated 2 days ago

diffusion-modelsefficient-inferencemodel-compressionstable-diffusiontraining-free

MingSun-Tse/Efficient-Deep-Learning

Collection of recent methods on (deep) neural network compression and acceleration.

955132Updated 1 week ago

deep-learningdeep-neural-networksefficient-deep-learningknowledge-distillationmodel-compressionnetwork-pruning

alibaba/TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Python872131Updated 2 days ago

deep-learningdeep-neural-networksmodel-compressionmodel-converterpost-training-quantizationpruningpytorchquantization-aware-training

guan-yuan/Awesome-AutoML-and-Lightweight-Models

A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.

857160Updated 1 month ago

architecture-searchautomated-feature-engineeringautomlawesome-listhyperparameter-optimizationmeta-learningmodel-accelerationmodel-compressionnasneural-architecture-searchpytorchquantizationquantized-neural-networkquantized-trainingtensorflow

Zhen-Dong/Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

80659Updated 1 day ago

awesome-listdiffusion-modelsedge-computingefficient-inferencelarge-language-modelsmodel-compressionneural-networkspapersquantization

lhyfst/knowledge-distillation-papers

knowledge distillation papers

76787Updated 1 day ago

dark-knowledgeknowledge-distillationmodel-compressionpaperreading-list

SqueezeAILab/SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python71349Updated 6 days ago

efficient-inferencelarge-language-modelsllamallmlocalllmmodel-compressionnatural-language-processingpost-training-quantizationquantizationsmall-modelstext-generationtransformer

cnkuangshi/LightCTR

Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosophy of Parameter Server and Ring-AllReduce collective communication.

C++671139Updated 1 week ago

computational-graphsdeep-learningdistributed-systemsfactorization-machinesmachine-learningmodel-compressionparameter-server

SforAiDl/KD_Lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

Python65261Updated 5 days ago

algorithm-implementationsbenchmarkingdata-sciencedeep-learning-libraryknowledge-distillationmachine-learningmodel-compressionpruningpytorchquantization

he-y/filter-pruning-geometric-median

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (CVPR 2019 Oral)

Python618115Updated 4 days ago

model-compressionpruningpytorch

cedrickchee/awesome-ml-model-compression

Awesome machine learning model compression research papers, quantization, tools, and learning material.

53961Updated 5 days ago

awesome-listmachine-learningmodel-compressionneural-networkspruningquantization

iamhankai/ghostnet.pytorchArchived

[CVPR2020] GhostNet: More Features from Cheap Operations

Python537116Updated 1 month ago

convolutional-neural-networksfbnetmobilenetv3model-compressionpytorch

microsoft/archai

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

Python48393Updated 2 weeks ago

automated-machine-learningautomldartsdeep-learninghyperparameter-optimizationmachine-learningmodel-compressionnasneural-architecture-searchpetridishpythonpytorch

Zhen-Dong/HAWQ

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python45484Updated 3 days ago

4-bit8-bitdistillationefficient-neural-networkshardware-awarehessianmixed-precisionmodel-compressionpytorchquantizationquantized-neural-networkstensorcoretvm

Page 1 of 13