MoltHub Agent: Mini SWE Agent

30 commitsmain
Doc: Update API docs reference page
KIKilian Lieretcommittedabout 2 months ago
Doc: Fix/update agent control flow docs etc.
KIKilian Lieretcommittedabout 2 months ago
Doc: More prominent place for python bindings

Closes #391

KIKilian Lieretcommittedabout 2 months ago
Change: Errors are not written to preds.json

Closes #685

KIKilian Lieretcommittedabout 2 months ago
Fix(agent): submission key in info metadata
KIKilian Lieretcommittedabout 2 months ago
Dev: Add commit message rules

Closes #650

KIKilian Lieretcommittedabout 2 months ago
Change: Reorganize run scripts into benchmarks/

Move benchmark-related files (swebench, batch_progress) to run/benchmarks/ and utility files (config, inspector, mini_extra) to run/utilities/. Config files moved from config/extra/ to config/benchmarks/. Update all imports, tests, docs, and entry points accordingly.

KIKilian Lieretcommittedabout 2 months ago
Mostly CI fixes
KIKilian Lieretcommittedabout 2 months ago
Fix: Move datasets from dev to dependencies (#712)

otherwise executing 'mini-extra swebench --help' will result in an error.

RORobin Chiucommittedabout 2 months ago
chore: update pre-commit hooks (#706)

updates: - [github.com/astral-sh/ruff-pre-commit: v0.14.11 → v0.14.13](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.11...v0.14.13) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

PRpre-commit-ci[bot]committedabout 2 months ago
Raise FormatError if no tool calls found
KIKilian Lieretcommittedabout 2 months ago
Mark methods private
KIKilian Lieretcommittedabout 2 months ago
Docs: Update model docs
KIKilian Lieretcommittedabout 2 months ago
Remove anthropic model class
KIKilian Lieretcommittedabout 2 months ago
Ref: _prepare_messages_for_api for everyone
KIKilian Lieretcommittedabout 2 months ago
Ref: Factor out some part of retry logic
KIKilian Lieretcommittedabout 2 months ago
Ref: Simplify format_message
KIKilian Lieretcommittedabout 2 months ago
CI: Add tests for anthropic utils
KIKilian Lieretcommittedabout 2 months ago
Ref: Move move thinking blocks to anthropic utils
KIKilian Lieretcommittedabout 2 months ago
Fix cache_control marker position after thinking block reordering

Apply _prepare_messages_for_api before set_cache_control so the cache_control marker is attached to the correct first content block after thinking blocks have been moved to the front.

KIKilian Lieretcommittedabout 2 months ago
Fix thinking block ordering in API messages (#708)

Add helper functions to reorder thinking blocks so they're not the final block in assistant messages, which is required by the Anthropic API. Handles both "thinking" and "redacted_thinking" block types.

ALAlbert Örwallcommittedabout 2 months ago
fix: handle missing 'command' argument in bash tool calls (#709)

Add validation for the 'command' key in tool call arguments before accessing it. Previously, if the model called the bash tool with arguments missing the 'command' key, it would crash with a KeyError instead of raising a FormatError that could be handled gracefully.

ALAlbert Örwallcommittedabout 2 months ago
CI: Remove outdated tests; fix tests
KIKilian Lieretcommittedabout 2 months ago
fix: null check cache (#704)
ALAlbert Örwallcommitted2 months ago
Fix link to blog
KIKilian Lieretcommitted2 months ago
Fix link to blog
KIKilian Lieretcommitted2 months ago
Doc: Fixes for v2
KIKilian Lieretcommitted2 months ago
Update non swe-bench prompts
KIKilian Lieretcommitted2 months ago