"topic:trustworthy-machine-learning" — Search

53 results for “topic:trustworthy-machine-learning”

The open-sourced Python toolbox for backdoor attacks and defenses.

backdoor-attacksbackdoor-defensesbackdoor-learningtrustworthy-aitrustworthy-machine-learning

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

aibenchmarkdatasetevaluationlarge-language-modelsllmnatural-language-processingnlppypi-packagetoolkittrustworthy-aitrustworthy-machine-learning

torch-uncertainty/torch-uncertainty

Open-source framework for uncertainty and deep learning models in PyTorch 🌱

Python48138Updated 13 hours ago

bayesian-networkcomputer-visionensemblesneural-networkspredictive-uncertaintypytorchreliable-aitrustworthy-machine-learninguncertaintyuncertainty-quantification

verivital/nnv

Neural Network Verification Software Tool

MATLAB14061Updated 4 days ago

assured-autonomyautonomycyber-physicalcyber-physical-systemsformal-methodsformal-verificationhybrid-systemsneural-networkneural-network-certificationneural-network-verificationreachabilityreachability-analysisrobustness-verificationsafe-aisafe-autonomytrustworthy-aitrustworthy-machine-learningverification

UCSC-REAL/negative-label-smoothing

[ICML2022 Long Talk] Official Pytorch implementation of "To Smooth or Not? When Label Smoothing Meets Noisy Labels"

Python998Updated 2 months ago

label-noiselabel-smoothingnoisy-label-learningnoisy-labelsrobustnesstrustworthy-machine-learning

dlmacedo/entropic-out-of-distribution-detection

A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Perform efficient inferences (i.e., do not increase inference time) and detection without classification accuracy drop, hyperparameter tuning, or collecting additional data.

Python7910Updated 6 months ago

ai-safetyanomaly-detectiondeep-learningmachine-learningnovelty-detectionoodood-detectionopen-setopen-set-recognitionosrout-of-distributionout-of-distribution-detectionpytorchrobust-machine-learningtrustworthy-aitrustworthy-machine-learning

brandeis-machine-learning/awesome-ml-fairness

Papers and online resources related to machine learning fairness

756Updated 3 months ago

awesomefairnessfairness-aifairness-mlhuman-ai-interactionmachine-learningpapersresearch-papertrustworthy-machine-learning

ai4ce/FLAT

[ICCV2021 Oral] Fooling LiDAR by Attacking GPS Trajectory

Python7112Updated 3 weeks ago

3d-object-detection3d-perceptionadversarial-attacksai-safetyautonomous-drivingdeep-learninggnsslidarpoint-cloudroboticstrustworthy-aitrustworthy-machine-learning

mtuann/federated-learning-updated-papers

Papers related to Federated Learning in all top venues

6713Updated 3 days ago

data-heterogeneitydistributed-learningfederated-learningfederated-transfer-learninghorizontal-federated-learningtrustworthy-federated-learningtrustworthy-machine-learningvertical-federated-learning

IBM/inFairness

PyTorch package to train and audit ML models for Individual Fairness

Python668Updated 11 months ago

fairnessfairness-aiindividual-fairnessmachine-learningpytorchresponsible-aitrustworthy-machine-learning

AaltoDictionaryofML/AaltoDictionaryofML.github.io

Welcome! 👋 This is the working draft of the Aalto Dictionary of Machine Learning (ADictML) — a growing collection of short, clear definitions for key terms in machine learning.

TeX5723Updated 10 hours ago

artificial-intelligencemachine-learningmachine-learning-algorithmsprivacy-protectiontaxonomytrustworthy-machine-learning

dlmacedo/distinction-maximization-loss

A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in your project! Perform efficient inferences (i.e., do not increase inference time) without repetitive model training, hyperparameter tuning, or collecting additional data.

Python445Updated 6 months ago

ai-safetyanomaly-detectionclassificationdeep-learningmachine-learningnovelty-detectionoodood-detectionopen-setopen-set-recognitionosrout-of-distributionout-of-distribution-detectionpytorchrobust-machine-learningtrustworthy-aitrustworthy-machine-learninguncertainty-estimation

BirkhoffG/Explainable-ML-Papers

A list of research papers of explainable machine learning.

444Updated 3 months ago

academicawesomecounterfactual-explanationsexplainabilityexplainable-mlexplanationshuman-ai-interactionhuman-in-the-loophuman-in-the-loop-machine-learninginterpretabilityinterpretable-mlinterpretable-modelsmachine-learningpaperrecourseresearchsurveytrustworthy-machine-learningxai

OPTML-Group/Unlearn-Simple

[NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"

Python4312Updated 3 days ago

data-privacylanguage-modellarge-language-modelsllm-unlearningmachine-unlearningtrustworthy-aitrustworthy-machine-learning

leriomaggio/ppml-tutorial

Privacy-Preserving Machine Learning (PPML) Tutorial

Jupyter Notebook439Updated 1 month ago

data-sciencedeep-learningmachine-learningprivacy-enhancing-technologiesprivacy-preservingprivacy-preserving-machine-learningtrustworthy-machine-learningtutorial

95616ARG/SyReNN

SyReNN: Symbolic Representations for Neural Networks

Python415Updated 8 months ago

deep-neural-networksintegrated-gradientsnetwork-patchingneural-networktrustworthy-aitrustworthy-machine-learningverification

zRapha/FAME

Framework for Adversarial Malware Evaluation.

Python358Updated 3 months ago

adversarial-attacksadversarial-examplesadversarial-machine-learningevasiongenetic-programmingmachine-learningmalwarereinforcement-learningtrustworthy-aitrustworthy-machine-learning

LucasFidon/trustworthy-ai-fetal-brain-segmentation

Trustworthy AI method based on Dempster-Shafer theory - application to fetal brain 3D T2w MRI segmentation

Python343Updated 3 months ago

deep-learningfetal-mrisegmentationtrustworthy-aitrustworthy-machine-learning

Crisp-Unimib/ContrXT

a tool for comparing the predictions of any text classifiers

Python272Updated 4 months ago

code-qualitydata-sciencedata-visualizationdatascience-machinelearningexplainable-mlhuman-computer-interactioninterpretable-machine-learningmachine-learningnatural-languagenatural-language-processingnlptext-classificationtext-classification-pythontrustworthy-aitrustworthy-machine-learningxaixai-library

um-dsp/Morphence

Morphence: An implementation of a moving target defense against adversarial example attacks demonstrated for image classification models trained on MNIST and CIFAR-10.

Python235Updated 9 months ago

adversarial-attacksadversarial-examplesmachine-learningmoving-target-defensesecuritytrustworthy-machine-learning

Crisp-Unimib/MERLIN

MERLIN is a global, model-agnostic, contrastive explainer for any tabular or text classifier. It provides contrastive explanations of how the behaviour of two machine learning models differs.

Python195Updated 3 months ago

data-scienceexplainable-aihuman-computer-interactioninterpretable-machine-learningmachine-learningnatural-language-processingtabular-datatext-classificationtrustworthy-aitrustworthy-machine-learningxai

lancopku/Avg-Avg

[Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection

Python185Updated 1 year ago

ai-safetynatural-language-processingood-detectionrobust-machine-learningtrustworthy-machine-learning

dlmacedo/robust-deep-learning

A project to train your model from scratch or fine-tune a pretrained model using the losses provided in this library to improve out-of-distribution detection and uncertainty estimation performances. Calibrate your model to produce enhanced uncertainty estimations. Detect out-of-distribution data using the defined score type and threshold.

Python173Updated 1 year ago