Recent Releases of judy

judy - 2.1.0

What's Changed

  • Updated eval prompts to be optionally configurable in eval config by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/40
  • Fixes judy web app running in debg mode by default by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/40
  • Adds new RAG evaluation scenario by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/40
  • Updated pagination bar in raw results page of web app by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/40

Full Changelog: https://github.com/TNT-Hoopsnake/judy/compare/2.0.0...2.1.0

- Python
Published by tnt-rahul-raghavan about 2 years ago

judy - [2.0.0] - 2024-01-31

What's Changed

  • Adds async concurrent evaluations by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/31
  • Adds evaluation of closed-source models by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/31
  • Adds cost/usage tracking by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/31
  • Changes cli config command to no longer be interactive by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/31
  • Adds cli config list command by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/31
  • Adds validation checks for common config issues by @tnt-rahul-raghavan
  • Fixes error when using openai key specified in run config by @tnt-rahul-raghavan
  • Updates pipeline to only attempt saving successful results by @tnt-rahul-raghavan
  • Updates README with screenshots by @tnt-rahul-raghavan
  • Updates cli to print tracebacks to errors.log file by @tnt-rahul-raghavan

Full Changelog: https://github.com/TNT-Hoopsnake/judy/compare/1.0.1...2.0.0

- Python
Published by tnt-rahul-raghavan about 2 years ago

judy - [1.0.1] - 2024-01-31

What's Changed

  • Bump aiohttp from 3.8.6 to 3.9.0 by @dependabot in https://github.com/TNT-Hoopsnake/judy/pull/30
  • Create LICENSE by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/32
  • Update issue templates by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/33
  • Update issue templates by @tnt-rahul-raghavan in https://github.com/TNT-Hoopsnake/judy/pull/34
  • Bump jinja2 from 3.1.2 to 3.1.3 by @dependabot in https://github.com/TNT-Hoopsnake/judy/pull/35

Full Changelog: https://github.com/TNT-Hoopsnake/judy/compare/1.0.0...1.0.1

- Python
Published by tnt-rahul-raghavan about 2 years ago

judy - [1.0.0] - 2023-12-12

[1.0.0] - 2023-12-12

Added

  • cli app to load data from datasets, construct prompts and run evaluations on model responses
  • web app to display results and view graphs for aggregated results
  • a set of common scenarios to use for evaluation (flask, instruction following, question answering, disinformation)

- Python
Published by tnt-rahul-raghavan about 2 years ago