Build software better, together

zjunlp / EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Updated Jun 7, 2024
Jupyter Notebook

HowieHwong / TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

nlp benchmark natural-language-processing ai toolkit evaluation dataset pypi-package trustworthy-machine-learning trustworthy-ai large-language-models llm

Updated Jun 7, 2024
Python

AthenaCore / AwesomeResponsibleAI

Star

A curated list of awesome academic research, books, code of ethics, data sets, institutes, newsletters, principles, podcasts, reports, tools, regulations and standards related to Responsible AI, Trustworthy AI, and Human-Centered AI.

awesome-list interpretable-ai explainable-ai xai fairness-ai responsible-ai ethical-ai trustworthy-ai

Updated Jun 7, 2024

Giskard-AI / giskard

Sponsor

Star

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Updated Jun 6, 2024
Python

aiverify-foundation / moonshot

Star

Moonshot - A simple and modular tool to evaluate and red-team any LLM application.

benchmarking evaluation-framework red-teaming trustworthy-ai llm

Updated Jun 6, 2024
Python

siba987 / TrustAI

Star

AI-HCI research project with the aim to study the key factors affecting trust in an AI system recommendations.

hci trustworthy-ai ai-recommendation-system

Updated Jun 5, 2024
HTML

Trusted-AI / adversarial-robustness-toolbox

Star

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

python machine-learning privacy ai attack extraction inference artificial-intelligence evasion red-team poisoning adversarial-machine-learning blue-team adversarial-examples adversarial-attacks trusted-ai trustworthy-ai

Updated Jun 7, 2024
Python

moonwatcher-ai / moonwatcher

Star

Evaluation & testing framework for computer vision models

computer-vision ai-safety ethical-artificial-intelligence ai-security mlops ml-safety ml-validation trustworthy-ai ml-testing

Updated Jun 7, 2024
Python

aiverify-foundation / aiverify

Star

AI Verify

trustworthy-ai

Updated Jun 3, 2024
Python

THUYimingLi / BackdoorBox

Star

The open-sourced Python toolbox for backdoor attacks and defenses.

backdoor-attacks trustworthy-machine-learning backdoor-learning trustworthy-ai backdoor-defenses

Updated Jun 1, 2024
Python

95616ARG / APRNN

Star

Code from PLDI '23 paper "Architecture-Preserving Provable Repair of Deep Neural Networks."

deep-neural-networks trustworthy-ai provable-repair

Updated Jun 1, 2024
HCL

sleeepeer / PoisonedRAG

Star

code & data of PoisonedRAG paper

security machine-learning ai rag trustworthy-ai retrieval-augmented-generation

Updated May 30, 2024
Python

pittisl / FreezeAsGuard

Star

Code for paper "FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via Selective Tensor Freezing"

deep-learning fine-tuning diffusion-models trustworthy-ai

Updated May 30, 2024
Python

ResTech-AI / ezGPT

Star

We make Generative AI accessible to Federal agencies and businesses. Easy-to-use ezGPT™ platform eliminates the need for in-house expertise and delivers pre-built solutions for rapid innovation. With security and privacy at its core, we unlock the potential of AI. Our innovative chatbot guides users, ensuring a smooth and successful experience.

nist800-53 fips-140-2 responsible-ai trustworthy-ai generative-ai ezgpt

Updated May 29, 2024

ResTech-AI / .github

Star

We make Generative AI accessible to Federal agencies and businesses. Easy-to-use ezGPT™ platform eliminates the need for in-house expertise and delivers pre-built solutions for rapid innovation. With security and privacy at its core, we unlock the potential of AI. Our innovative chatbot guides users, ensuring a smooth and successful experience.

nist800-53 fips-140-2 responsible-ai trustworthy-ai ezgpt

Updated May 29, 2024
HTML

birhanu-eshete / birhanu-eshete.github.io

Star

Birhanu Eshete is an Associate Professor of Computer Science at the University of Michigan, Dearborn. His main research focus is in trustworthy machine learning with emphasis on security, safety, privacy, interpretability, fairness, and the dynamics thereof. He also studies online cybercrime and advanced and persistent threats (APTs).

adversarial-machine-learning ethical-artificial-intelligence privacy-preserving-machine-learning transparent-ml trustworthy-machine-learning fair-ml trustworthy-ai

Updated May 23, 2024
HTML

aiverify-foundation / aiverify-developer-tools

Star

trustworthy-ai

Updated May 20, 2024
JavaScript

guangyaodou / ConMU

Star

Breaking the Trilemma of Privacy, Utility, Efficiency via Controllable Machine Unlearning

data-privacy machine-unlearning deep- trustworthy-ai

Updated May 19, 2024
Python

jaiprakash1824 / VLM_Adv_Attack

Star

In the dynamic landscape of medical artificial intelligence, this study explores the vulnerabilities of the Pathology Language-Image Pretraining (PLIP) model, a Vision Language Foundation model, under targeted attacks like PGD adversarial attack.

pytorch attention-mechanism clip vulnerability-detection pathology trustworthiness adversarial-attacks attention-visualization pathology-image histopathology-images pgd-adversarial-attacks contrastive-learning trustworthy-machine-learning vision-transformer trustworthy-ai plip-model histopathology-image-classfication vision-language-model

Updated May 18, 2024
Jupyter Notebook

TypalAcademy / xai-l2o

Star

Optimization-based deep learning models can give explainability with output guarantees and certificates of trustworthiness.

optimization explainable-ai trustworthy-ai

Updated May 17, 2024
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trustworthy-ai

Here are 94 public repositories matching this topic...

zjunlp / EasyEdit

HowieHwong / TrustLLM

AthenaCore / AwesomeResponsibleAI

Giskard-AI / giskard

aiverify-foundation / moonshot

siba987 / TrustAI

Trusted-AI / adversarial-robustness-toolbox

moonwatcher-ai / moonwatcher

aiverify-foundation / aiverify

THUYimingLi / BackdoorBox

95616ARG / APRNN

sleeepeer / PoisonedRAG

pittisl / FreezeAsGuard

ResTech-AI / ezGPT

ResTech-AI / .github

birhanu-eshete / birhanu-eshete.github.io

aiverify-foundation / aiverify-developer-tools

guangyaodou / ConMU

jaiprakash1824 / VLM_Adv_Attack

TypalAcademy / xai-l2o

Improve this page

Add this topic to your repo