"topic:trustworthiness" — Search

In this paper, we introduce SAShA, a new attack strategy that leverages semantic features extracted from a knowledge graph in order to strengthen the efficacy of the attack to standard CF models. We performed an extensive experimental evaluation in order to investigate whether SAShA is more effective than baseline attacks against CF models by taking into account the impact of various semantic features.

Python52Updated 4 years ago

knowledge-graphrecommender-systemsecuritysemantic-webshilling-attacktrustworthiness

jaiprakash1824/VLM_Adv_Attack

In the dynamic landscape of medical artificial intelligence, this study explores the vulnerabilities of the Pathology Language-Image Pretraining (PLIP) model, a Vision Language Foundation model, under targeted attacks like PGD adversarial attack.

Jupyter Notebook41Updated 1 year ago

adversarial-attacksattention-mechanismattention-visualizationclipcontrastive-learninghistopathology-image-classficationhistopathology-imagespathologypathology-imagepgd-adversarial-attacksplip-modelpytorchtrustworthinesstrustworthy-aitrustworthy-machine-learningvision-language-modelvision-transformervulnerability-detection

nmsa/tma-framework

Trustworthiness Monitoring & Assessment Framework

JavaScript35Updated 3 years ago

assessmentmonitoringtrustworthiness

rajdeep345/MTLTS

Codes and Datasets for our WSDM 2022 Paper: "MTLTS: A Multi-Task Framework To Obtain Trustworthy Summaries From Crisis-Related Microblogs"

Python32Updated 4 years ago

rumor-detectionsummarizationtrustworthinesstrustworthy-aiverification

Smendowski/data-embedding-and-visualization

Visualization and embedding of large datasets using various Dimensionality Reduction (DR) techniques such as t-SNE, UMAP, PaCMAP & IVHD. Implementation of custom metrics to assess DR quality with complete explaination and workflow.

Jupyter Notebook20Updated 3 years ago

autoencodersco-rankingdimensionalitydimiensionalitydrdrqualityisomapivhdknngainmdsmetricspacmappcareductionsheppardt-snetrustworthinessvisualization

J-

j-m/faktnews

Independent continuation of a project from AstonHack 2017

JavaScript20Updated 6 years ago

browser-extensionchromefake-newsfirefoxoperatrustworthiness

danielebifolco/CodeGenLink

CodeGenLink is a Visual Studio Code extension that interacts with GitHub Copilot Chat to generate code, analyze its origin, and identify the associated license.

TypeScript20Updated 8 months ago

code-provenancelicensingllmtrustworthiness

worldbank/pcn

Proof-Carrying Numbers (PCN): Trust is earned only by proof — the absence of a verification mark communicates uncertainty.

TypeScript21Updated 2 weeks ago

aiaifordatadataforaillmpcnproof-carrying-numberstrustworthiness

dshealthkdd/dshealth-2021

Website for health data science at KDD 2021

HTML12Updated 4 years ago

data-sciencehealthcarepaperstrustworthinessxai

A-

a-neti-neti/goemotions-eda-annotation-diagnostics

Emotion architecture from Reddit comments: rater behavior, semantic clusters, and contradiction mapping in GoEmotions.

Jupyter Notebook10Updated 7 months ago

annotation-qualitybias-detectiondata-cleaningedaemotion-analysisemotion-recognitiongoemotionslabel-noiselabelingmachine-learningnlppsychologypythonrater-agreementrater-reliabilityreddittf-idftrustworthinessunsupervised-learningword2vec

sensible-ki/sensible-ki.github.io

Secure and trustworthy mobile AI.

HTML11Updated 1 year ago

artificial-intelligencemobile-securitysecuritytrustworthiness

rishi-banerjee1/blindbench

Which LLM do you actually trust? Blind-test 100+ AI models with truth scoring and reasoning failure classification. No branding, no marketing — just data.

JavaScript10Updated 2 days ago

aiai-safetybenchmarkblind-testingchatgptclaudeevaluationgeminigpthallucination-detectionleaderboardllamallmmachine-learningopen-sourcereactreasoningsupabasesycophancytrustworthiness

tpertner/squeeze

Squeeze your model with pressure prompts to see if its behavior leaks.

Python10Updated 2 weeks ago

ai-safetyalignmentcalibrationevaluationhallucinationsllm-evalllm-evalsmetamorphic-testingprompt-engineeringquality-assurancereliabilitytrustworthiness

merrafelice/TAaMR

Proposal of a novel adversarial attack approach, called Target Adversarial Attack against Multimedia Recommender Systems (TAaMR), to investigate the modification of MR behavior when the images of a category of low recommended products (e.g., socks) are perturbed to misclassify the deep neural classifier towards the class of more recommended products (e.g., running shoes) with human-level slight images alterations.

Python02Updated 5 years ago

adversarial-attacksrecommender-systemsecuritytrustworthiness

esote/pof

Proof of Freshness: collate proof of an authorship date.

Go00Updated 5 years ago

golangsecuritytrustworthiness

eclipse-aerios/iota-tangle-peerer

Initializes IOTA tangle peering between the K8s nodes of an aeriOS K8s domain

Go00Updated 3 months ago

aeriosiota-node-peeringiota-tanglek8s-nodetrustworthiness

nmsa/tma-framework-k

Component K - Trustworthiness Monitoring & Assessment Framework

Java05Updated 3 years ago

knowledgemeasurementstrustworthiness

merrafelice/Assessing-Perceptual-and-Recommendation-Mutation-of-Adversarially-Poisoned-Visual-Recommenders

In this work, we provide 24 combinations of attack/defense strategies, and visual-based recommenders to 1) access performance alteration on recommendation and 2) empirically verify the effect on final users through offline visual metrics.

Python00Updated 5 years ago

adversarial-attacksdeep-learninghuman-in-the-looprecommender-systemtrustworthiness

eclipse-aerios/iota-tangle

Required files and configurations to bootstrap and operate an IOTA Tangle for trustworthiness management in aeriOS

Go Template00Updated 1 month ago

aeriosiota-tangletrustworthiness

nmsa/tma-framework-m

Component M - Trustworthiness Monitoring & Assessment Framework

Java05Updated 2 weeks ago

measurementsmonitoringtrustworthiness

FactCheck-AI/FactCheck-Exploration

[Frontend] -- Web-based platform enabling users to inspect every step involved in the RAG methodology for KG fact-checking process

HTML00Updated 1 year ago

knowledge-graphllmstrustworthiness

SESARLab/big-data-trustworthinessArchived

An Assurance Process for Big Data Trustworthiness - Marco Anisetti, Claudio A. Ardagna, Filippo Berto

Python00Updated 3 years ago

assurancebig-datatrustworthiness

eclipse-aerios/iota-messages-api

REST API to insert messages into an IOTA Tangle

Python00Updated 3 months ago

aeriosiota-messages-apiiota-tanglerest-apitrustworthiness

SnehaShukla937/TrustMIS

This repository is an implementation of the paper "Trustworthy Medical Image Segmentation with improved performance for in-distribution samples" published in Neural Networks.

Jupyter Notebook01Updated 2 years ago

deep-learningexplainabilityin-distributionmedical-image-segmentationtrustworthiness

Page 1 of 2