Recent Releases of agentdojo
agentdojo - v0.1.30
v0.1.30
Features
- Add support for multi-block chat messages (#63) by @dedeswim.
- Add support for reasoning models (Anthropic and OpenAI) and arguments (#60) by @dedeswim.
- Add support for vLLM models (#68) by @normster and @thavens.
Breaking Changes
- Single-block chat messages are not supported anymore. Old chat logs are still correctly loaded, but all
ChatMessage's'content'usage should be updated. A unique text can be extracted fromChatMessage['content']by using theagentdojo.types.get_text_content_as_strutility function. Check #63 for more information.
Bug Fixes
- Fix some issues in Benchmark v1.2 (#75) by @dedeswim and @EricWinsorDSIT
- Python
Published by dedeswim 10 months ago
agentdojo - v1.1.23
Features
- Put all models-related information in only one file, making it easier to add new models by @maxwbuckley (#42).
- Migration to
uvby @dedeswim (#53)
Breaking Changes
- None
Bug Fixes
- Fix various tasks by @alexandrasouly-aisi (#45, #48, #50, #52)
- Python
Published by dedeswim about 1 year ago
agentdojo - v0.1.22
Features
- Add Gemini 1.5 Flash and Pro 002 and Claude Sonnet 3.5 October release by @maxwbuckley (#39).
Breaking Changes
- None
Bug Fixes
- Fix some JSON Schema formats for Google models by @maxwbuckley (#39).
- Fix benchmark version inconsistencies (spotted by @maxwbuckley, fixed by @dedeswim, #40).
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.22
Features
- Add Gemini 1.5 Flash and Pro 002 and Claude Sonnet 3.5 October release by @maxwbuckley (#39).
Breaking Changes
- None
Bug Fixes
- Fix some JSON Schema formats for Google models by @maxwbuckley (#39).
- Fix benchmark version inconsistencies (spotted by @maxwbuckley, fixed by @dedeswim, #40).
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.16
v0.1.16
Features
- Add the possibility to pass a custom pipeline element object as an LLM to a
PipelineConfigby @lbeurerkellner and @dedeswim.
Breaking Changes
- None
Bug Fixes
- Updated some checks in the banking suite by @lbeurerkellner.
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.15
Features
- None
Breaking Changes
- None
Bug Fixes
- Fix the security checks for Workspace's injections tasks 4 and 5. The checks were not precise and injection tasks 4 would fail in some edge cases even if the model was executing it correctly.
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.14
Features
None
Breaking Changes
Ellipsisis not supported anymore as a type for function calls. This was done to simplify the code as any value can be passed as an argument to a function call, since is going to be discarded by the utility/security checks anyways.
Bug Fixes
- Fix issue with
FunctionCallnot being serializable when saving to JSON. This issue was introuced in v0.1.13 when theFunctionCallobject was changed from aTypedDict
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.13
Features
- Add support for nested function calls in the [
FunctionCall][agentdojo.functions_runtime.FunctionCall] object. This allows for more complex function calls: if any of theargsin a [FunctionCall][agentdojo.functions_runtime.FunctionCall] object is another [FunctionCall][agentdojo.functions_runtime.FunctionCall], the function call will be executed recursively.
Breaking Changes
None
Bug Fixes
None
Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.12...v0.1.13
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.12
Features
None
Breaking Changes
- Change [
FunctionCall][agentdojo.functions_runtime.FunctionCall] from a [TypedDict][typing.TypedDict] to a Pydantic [BaseModel][pydantic.BaseModel]. This was done to allow for distinguishing function calls at runtime (in preparation) for supporting nested function calls. This affects users who are implementing new LLMs, new runtimes, and pipeline components that use function calls. To update your code, you will need to access the [function][agentdojo.functions_runtime.FunctionCall.function] and [args][agentdojo.functions_runtime.FunctionCall.args] attributes of the [FunctionCall][agentdojo.functions_runtime.FunctionCall] object instead of using dictionary access.
Bug Fixes
None
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.11
What's Changed
- Add missing travel suite results by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/9
- Add gpt 4o mini by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/10
Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.10...v0.1.11
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.9
What's Changed
- Add register attack to all attacks by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/6
- Fix travel suite by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/7: this introduces a new benchmark minor version
Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.8...v0.1.9
- Python
Published by dedeswim over 1 year ago
agentdojo - v0.1.8
What's Changed
- Add results table by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/2
- Add list and dict parsing for Python-like tool calling syntax by @dedeswim in https://github.com/ethz-spylab/agentdojo/pull/5
Full Changelog: https://github.com/ethz-spylab/agentdojo/compare/v0.1.7...v0.1.8
- Python
Published by dedeswim over 1 year ago