Recent Releases of agentdojo

agentdojo - v0.1.34

v0.1.34

Features

  • Add support for max_tokens parameter for GoogleLLM by @dedeswim (#90)

Breaking Changes

None

Bug Fixes

None

- Python
Published by dedeswim 9 months ago

agentdojo - v0.1.33

Features

  • Add support for new Google SDK (#84) by @dedeswim

Breaking Changes

  • The use of GoogleLLM without passing a client as argument is now deprecated (but still supported)

Bug Fixes

None

- Python
Published by dedeswim 9 months ago

agentdojo - v0.1.32

Features

None

Breaking Changes

None

Bug Fixes

  • Fix issue with TogetherAI LLM (#82) by @dedeswim.

- Python
Published by dedeswim 10 months ago

agentdojo - v0.1.31

Features

None

Breaking Changes

None

Bug Fixes

  • Fix issue with OpenAI LLM (#77) by @thavens.

- Python
Published by dedeswim 10 months ago

agentdojo - v0.1.30

v0.1.30

Features

  • Add support for multi-block chat messages (#63) by @dedeswim.
  • Add support for reasoning models (Anthropic and OpenAI) and arguments (#60) by @dedeswim.
  • Add support for vLLM models (#68) by @normster and @thavens.

Breaking Changes

  • Single-block chat messages are not supported anymore. Old chat logs are still correctly loaded, but all ChatMessage's 'content' usage should be updated. A unique text can be extracted from ChatMessage['content'] by using the agentdojo.types.get_text_content_as_str utility function. Check #63 for more information.

Bug Fixes

  • Fix some issues in Benchmark v1.2 (#75) by @dedeswim and @EricWinsorDSIT

- Python
Published by dedeswim 10 months ago

agentdojo - v0.1.29

Features

None

Breaking Changes

None

Bug Fixes

  • Add thinking budget argument to Anthropic calls by @dedeswim (#61)

- Python
Published by dedeswim 12 months ago

agentdojo - v0.1.28

Features

None

Breaking Changes

None

Bug Fixes

  • Upgrade Anthropic to a version that supports thinking in pyproject.toml by @dedeswim

- Python
Published by dedeswim 12 months ago

agentdojo - v0.1.27

Features

Add support for reasoning parameters for Anthropic and OpenAI models by @dedeswim (#60)

Breaking Changes

None

Bug Fixes

None

- Python
Published by dedeswim 12 months ago

agentdojo - v0.1.26

Features

None

Breaking Changes

None

Bug Fixes

  • Fix issue with Claude 3.5 Haiku returning invalid function names, that can't be returned back to the API by @dedeswim

- Python
Published by dedeswim about 1 year ago

agentdojo - v0.1.25

Features

None

Breaking Changes

None

Bug Fixes

  • Fix issue with reschedule_event tool that was computing the wrong end time in case the end time was unspecified by @dedeswim

- Python
Published by dedeswim about 1 year ago

agentdojo - v0.1.24

Features

None

Breaking Changes

None

Bug Fixes

  • Fix issue with newer Anthropic models and with the Anthropic tool calling API by @dedeswim (#55)

- Python
Published by dedeswim about 1 year ago

agentdojo - v1.1.23

Features

  • Put all models-related information in only one file, making it easier to add new models by @maxwbuckley (#42).
  • Migration to uv by @dedeswim (#53)

Breaking Changes

  • None

Bug Fixes

  • Fix various tasks by @alexandrasouly-aisi (#45, #48, #50, #52)

- Python
Published by dedeswim about 1 year ago

agentdojo - v0.1.22

Features

  • Add Gemini 1.5 Flash and Pro 002 and Claude Sonnet 3.5 October release by @maxwbuckley (#39).

Breaking Changes

  • None

Bug Fixes

  • Fix some JSON Schema formats for Google models by @maxwbuckley (#39).
  • Fix benchmark version inconsistencies (spotted by @maxwbuckley, fixed by @dedeswim, #40).

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.22

Features

  • Add Gemini 1.5 Flash and Pro 002 and Claude Sonnet 3.5 October release by @maxwbuckley (#39).

Breaking Changes

  • None

Bug Fixes

  • Fix some JSON Schema formats for Google models by @maxwbuckley (#39).
  • Fix benchmark version inconsistencies (spotted by @maxwbuckley, fixed by @dedeswim, #40).

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.21

Features

  • None

Breaking Changes

  • None

Bug Fixes

  • Fix several bugs in tasks verifications, as spotted by @cw00h (#20, #21, #22, #23, #24, #25, #26), and put together by @dedeswim in #28.

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.20

Features

  • Save the type itself to [Function.return_type][agentdojo.functionsruntime.Function.returntype] instead of its string representation to avoid losing information by @dedeswim.

Breaking Changes

  • None

Bug Fixes

  • None.

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.19

Features

  • None.

Breaking Changes

  • None

Bug Fixes

  • Fix missing support for generic return types (e.g., list[str]) in Function.return_type by @dedeswim

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.18

Features

  • Now Function stores also the return type of the function and the full docstring by @dedeswim.

Breaking Changes

  • None

Bug Fixes

  • None

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.17

Features

  • None

Breaking Changes

  • None

Bug Fixes

  • Fix breaking changes due to updated APIs from some LLM providers by @dedeswim.

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.16

v0.1.16

Features

  • Add the possibility to pass a custom pipeline element object as an LLM to a PipelineConfig by @lbeurerkellner and @dedeswim.

Breaking Changes

  • None

Bug Fixes

  • Updated some checks in the banking suite by @lbeurerkellner.

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.15

Features

  • None

Breaking Changes

  • None

Bug Fixes

  • Fix the security checks for Workspace's injections tasks 4 and 5. The checks were not precise and injection tasks 4 would fail in some edge cases even if the model was executing it correctly.

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.14

Features

None

Breaking Changes

  • Ellipsis is not supported anymore as a type for function calls. This was done to simplify the code as any value can be passed as an argument to a function call, since is going to be discarded by the utility/security checks anyways.

Bug Fixes

  • Fix issue with FunctionCall not being serializable when saving to JSON. This issue was introuced in v0.1.13 when the FunctionCall object was changed from a TypedDict

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.13

Features

  • Add support for nested function calls in the [FunctionCall][agentdojo.functions_runtime.FunctionCall] object. This allows for more complex function calls: if any of the args in a [FunctionCall][agentdojo.functions_runtime.FunctionCall] object is another [FunctionCall][agentdojo.functions_runtime.FunctionCall], the function call will be executed recursively.

Breaking Changes

None

Bug Fixes

None

Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.12...v0.1.13

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.12

Features

None

Breaking Changes

  • Change [FunctionCall][agentdojo.functions_runtime.FunctionCall] from a [TypedDict][typing.TypedDict] to a Pydantic [BaseModel][pydantic.BaseModel]. This was done to allow for distinguishing function calls at runtime (in preparation) for supporting nested function calls. This affects users who are implementing new LLMs, new runtimes, and pipeline components that use function calls. To update your code, you will need to access the [function][agentdojo.functions_runtime.FunctionCall.function] and [args][agentdojo.functions_runtime.FunctionCall.args] attributes of the [FunctionCall][agentdojo.functions_runtime.FunctionCall] object instead of using dictionary access.

Bug Fixes

None

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.11

What's Changed

  • Add missing travel suite results by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/9
  • Add gpt 4o mini by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/10

Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.10...v0.1.11

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.10

What's Changed

  • Fix issue with Gemini models, re-run travel suite on them by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/8

Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.9...v0.1.10

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.9

What's Changed

  • Add register attack to all attacks by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/6
  • Fix travel suite by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/7: this introduces a new benchmark minor version

Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.8...v0.1.9

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.8

What's Changed

  • Add results table by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/2
  • Add list and dict parsing for Python-like tool calling syntax by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/5

Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.7...v0.1.8

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.7

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.6

Update docs

Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.5...v0.1.6

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.5

Add Claude 3.5 Sonnet

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.4

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.3

Improve information in pyproject.toml

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.2

Fix issue with dotenv dependency

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.1

Test for GH Action

- Python
Published by dedeswim over 1 year ago

agentdojo - v0.1.0

Initial release

- Python
Published by dedeswim over 1 year ago