Recent Releases of grouse
grouse - 0.2.0
Added
- Register Fireworks Llama 3.1 8b and 70b prices with litellm to better support these models as evaluators.
Changed
- Remove
instructorpackage to better understand what is really sent to the LLM. All the LLM generations are simply done usinglitellm.acompletion. - Removed black from justfile and dev dependencies as ruff plays the same role.
- Python
Published by antonioloison over 1 year ago
grouse - 0.1.0
Added
- Created
GroundedQAEvaluatorthat evaluates four metrics per sample: answer relevancy, completeness, faithfulness, usefulness, negative rejection and positive acceptance. - Created
MetaEvaluatorto evaluate evaluators on GroUSE unit tests. - Added
plotsub-command to show the matrices
- Python
Published by antonioloison over 1 year ago