Updated 9 months ago
xfinder
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
Updated 9 months ago
CalibrationAnalysis
Multi-language suite for analyzing calibration of probabilistic predictive models.
Updated 9 months ago
pycalibration
Estimation and hypothesis tests of calibration in Python using CalibrationErrors.jl and CalibrationTests.jl.
Updated 9 months ago
SurvivalSignature
Computation and numerical approximation of survival signatures.
Updated 9 months ago
awesome-safety-critical-ai
When the stakes are high, intelligence is only half the equation - reliability is the other ⚠️