selective-prediction

Here are 24 public repositories matching this topic...

goergen95 / seapig

Uncertainty based selection of compatible inputs

deep-learning pytorch remote-sensing uncertainty-estimation selective-prediction torchgeo geospatial-ai confidence-scoring

Updated Jun 12, 2026
Python

Dan23RR / snc-core

Star

Behavioral Trust Clustering a thermodynamic governance layer that reduces LLM hallucination by 52% on HumanEval. Drop-in wrapper for any decoder. MIT.

abstention openai-api selective-prediction humaneval llm ollama qwen hallucination-mitigation trust-calibration regulated-ai behavioral-clustering

Updated May 4, 2026
Python

Guardrails watch what AI says. REMORA governs what AI does. A pre-execution governance layer for AI agent tool calls: ACCEPT, VERIFY, ABSTAIN, ESCALATE, with policy, evidence, uncertainty, and an auditable DecisionEnvelope. Research-grade, open source.

Updated Jun 11, 2026
Python

cleverhans-lab / confidential-guardian

Star

We show that a model owner can artificially introduce uncertainty into their model and provide a corresponding detection mechanism.

machine-learning uncertainty calibration zero-knowledge rejection abstention selective-prediction

Updated Jun 2, 2025
Jupyter Notebook

JiajunChen223 / DegradeRisk-Seg

Star

DegradeRisk-Seg: risk-controlled semantic segmentation under degraded multi-modal remote-sensing observations

pytorch calibration remote-sensing semantic-segmentation multimodal-learning risk-control selective-prediction degradation-benchmark

Updated Jun 4, 2026
Python

Tharun2908 / mistral-medqa-abstention

Star

Reliable medical QA with Mistral-7B, QLoRA, selective prediction, and learned abstention via warm-start SFT + DPO.

mistral peft dpo huggingface abstention medical-qa reliable-ai selective-prediction llm medqa qlora llm-safety

Updated May 31, 2026
Python

HrxuAlbert / cherry-pick-override

Star

Code and data release for the paper 'Cherry-pick Override: Unsafe Directional Commitment in LLM Judges under Mixed Evidence'

nlp reproducibility fact-checking multi-agent-systems ai-safety conformal-prediction fact-verification abstention selective-prediction llm llm-evaluation llm-as-judge

Updated Jun 5, 2026
Python

cleverhans-lab / sc-gap

Star

Code for our paper analyzing the looseness of the upper bound on selective classification performance.

machine-learning uncertainty-quantification rejection abstention selective-classification selective-prediction

Updated Nov 18, 2025
Jupyter Notebook

AnwarDebes / RobTM

Star

Tsetlin Machines with a certificate on every answer: the exact number of feature flips a prediction survives, computed per sample, with predict-or-abstain when the radius is too small.

research adversarial-attacks interpretable-ml tsetlin-machine adversarial-robustness abstention certified-robustness selective-prediction

Updated Jun 12, 2026
Python

KonkovaElena / airi-summer-school-2026

Star

Reproducible MEDAI deferral simulation (AIRI 2026). Synthetic research code.

python machine-learning reproducible-research monte-carlo-simulation decision-support medical-informatics human-in-the-loop fair-principles expert-in-the-loop research-software uncertainty-calibration selective-prediction learning-to-defer bootstrap-statistics

Updated May 20, 2026
Python

Arutselvan / selective_prediction_mtl

Star

Investigation of how sampling strategies affect Selective Prediction performance in Multi Task Learning

nlp swag multi-task bert-fine-tuning snli-dataset selective-prediction

Updated Jan 4, 2022
Python

stelioszach03 / TrustQueryNet

Star

Trustworthy medical image classification: noise-robust ConvNeXt-Tiny with 83.5% accuracy, calibrated selective prediction, HAM10000 + ISIC 2019.

research computer-vision deep-learning pytorch medical-imaging calibration dermatology noisy-labels selective-prediction convnext

Updated Apr 17, 2026
Python

steverab / incerto

Star

A comprehensive library for uncertainty quantification in machine learning.

calibration uncertainty-quantification active-learning conformal-prediction out-of-distribution-detection distribution-shift selective-prediction llm

Updated May 17, 2026
Python

Estella-Hu / deepfake-detection-bayesian-uncertainty

Star

Deepfake detection with Bayesian uncertainty quantification, selective prediction, and an interactive Streamlit demo.

computer-vision pytorch uncertainty-quantification deepfake-detection streamlit selective-prediction bayesian-ml trustworth-ai

Updated Mar 18, 2026
Jupyter Notebook

musicofhel / confgate

Star

Free confidence gate for LLM correctness — logistic regression on (generation length, mean logprob), with cascade routing and split-conformal certificates. The pinned topo-confidence result.

machine-learning calibration cascade uncertainty-quantification confidence conformal-prediction selective-prediction llm