#

interpretability

Here are 622 public repositories matching this topic...

interpretml / interpret

Fit interpretable models. Explain blackbox machine learning.

machine-learning ai scikit-learn artificial-intelligence transparency blackbox bias differential-privacy gradient-boosting interpretability interpretable-ai interpretable-ml explainable-ai explainable-ml xai interpretable-machine-learning iml explainability interpretml

Updated Jun 9, 2024
C++

souravsaha / ir_explain

ir_explain: a Python Library of Explainable IR Methods

information-retrieval interpretability explainability document-ranking

Updated Jun 8, 2024
JavaScript

iancovert / sage

For calculating global feature importance using Shapley values.

machine-learning interpretability shapley explainability

Updated Jun 8, 2024
Python

imodelsX

csinva / imodelsX

Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.

machine-learning natural-language-processing ai deep-learning neural-network text-classification text scikit-learn ml pytorch language-model natural-language-understanding interpretability xai explainability huggingface transformer-models

Updated Jun 8, 2024
Python

automated-explanations

microsoft / automated-explanations

Explain a black-box module in natural language.

data-science machine-learning neuroscience artificial-intelligence fmri gpt explanation language-model interpretability xai fmri-data-analysis huggingface gpt4 large-language-models mechanistic-interpretability automated-interpretability

Updated Jun 8, 2024
HTML

OpenMOSS / Language-Model-SAEs

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research. Open-sourced and constantly updated.

interpretability mechanistic-interpretability

Updated Jun 9, 2024
Jupyter Notebook

shap / shap

A game theoretic approach to explain the output of any machine learning model.

machine-learning deep-learning gradient-boosting interpretability shapley shap explainability

Updated Jun 8, 2024
Jupyter Notebook

pytorch / captum

Model interpretability and understanding for PyTorch

interpretability feature-importance interpretable-ai interpretable-ml feature-attribution

Updated Jun 7, 2024
Python

jphall663 / awesome-machine-learning-interpretability

A curated list of awesome responsible machine learning resources.

Updated Jun 7, 2024

SINr-Embeddings / sinr

The SINr approach to train interpretable word and graph embeddings

machine-learning community-detection network-science louvain network-embedding interpretability word-embedding node-embedding 2vec

Updated Jun 7, 2024
Jupyter Notebook

zjunlp / KnowledgeCircuits

Knowledge Circuits in Pretrained Transformers

natural-language-processing artificial-intelligence transformer circuit interpretability hallucination large-language-models model-editing knowledge-editing knowledge-edting knowledge-circuit

Updated Jun 7, 2024
Python

DALEX

ModelOriented / DALEX

moDel Agnostic Language for Exploration and eXplanation

black-box data-science machine-learning predictive-modeling fairness interpretability explainable-artificial-intelligence explanations explainable-ai explainable-ml xai model-visualization interpretable-machine-learning iml dalex responsible-ai responsible-ml explanatory-model-analysis

Updated Jun 7, 2024
Python

google / yggdrasil-decision-forests

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

javascript python go cli machine-learning cpp random-forest tensorflow pypi distributed-computing ml cart decision-trees gradient-boosting interpretability decision-forest

Updated Jun 7, 2024
C++

stanfordnlp / pyreft

ReFT: Representation Finetuning for Language Models

interpretability reft representation-finetuning

Updated Jun 6, 2024
Python

stevenbischoff / NLP-Genre-Classification

Creating a PyTorch LSTM and Transformer to classify movies by genre and visualizing the LSTM's reasoning process

visualization nlp sqlalchemy google-cloud pytorch dash lstm interpretability

Updated Jun 6, 2024
Jupyter Notebook

AMfeta99 / Advanced_Computer_Vision

This repository is dedicated to small projects and some theoretical material that I used to get into Computer Vision using TensorFlow in a practical and efficient way.

tensorflow image-processing cnn pytorch medical medical-imaging image-classification alexnet fcn object-detection image-segmentation unet interpretability mask-rcnn 3d-medical-image ai-for-medicine

Updated Jun 6, 2024
Jupyter Notebook

evan-lloyd / graphpatch

graphpatch is a library for activation patching on PyTorch neural network models.

pytorch interpretability large-language-models mechanistic-interpretability

Updated Jun 6, 2024
Python

torch-cam

frgfm / torch-cam

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

python deep-learning grad-cam cnn pytorch saliency-map interpretability smoothgrad interpretable-deep-learning gradcam activation-maps class-activation-map gradcam-plus-plus score-cam

Updated Jun 6, 2024
Python

CristianoPatricio / concept-based-interpretability-VLM

Code for the paper "Towards Concept-based Interpretability of Skin Lesion Diagnosis using Vision-Language Models", ISBI 2024 (Oral).

deep-learning medical-imaging clip interpretability explainable-ai skin-lesion-classification melanoma-diagnosis concept-based-explanations visual-language-models ieee-isbi

Updated Jun 5, 2024
Jupyter Notebook

alanqrwang / keymorph

Robust multimodal brain registration via keypoints

deep-learning neural-network pytorch affine registration robust keypoints brain interpretability multimodal

Updated Jun 5, 2024
Python

Improve this page

Add a description, image, and links to the interpretability topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the interpretability topic, visit your repo's landing page and select "manage topics."