Skip to content

Actions: getappmap/navie-benchmark

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
826 workflow runs
826 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

test_trouble 2,2,2,2 16k sonnet-20241022
Run the benchmark #409: Manually run by dividedmind
November 8, 2024 14:59 14m 13s wip/moar-fixed
November 8, 2024 14:59 14m 13s
test_trouble 2,2,2,2 16k sonnet-20241022
Run the benchmark #408: Manually run by dividedmind
November 8, 2024 14:44 12m 1s wip/moar-fixed
November 8, 2024 14:44 12m 1s
test_trouble 2,2,2,2 16k sonnet-20241022
Run the benchmark #407: Manually run by dividedmind
November 8, 2024 14:31 8m 58s swe-bench-2
November 8, 2024 14:31 8m 58s
test_trouble 2,2,2,2 16k sonnet-20241022
Run the benchmark #406: Manually run by dividedmind
November 8, 2024 14:28 1m 29s wip/moar-fixed
November 8, 2024 14:28 1m 29s
test_fixup 2,2,2,2 16k sonnet-20241022
Run the benchmark #405: Manually run by kgilpin
November 7, 2024 22:00 1h 23m 41s wip/moar-fixed
November 7, 2024 22:00 1h 23m 41s
test_fixup 2,2,2,2 16k sonnet-20241022
Run the benchmark #404: Manually run by kgilpin
November 7, 2024 21:58 45s wip/moar-fixed
November 7, 2024 21:58 45s
test_fixup 16k sonnet-20241022
Run the benchmark #403: Manually run by kgilpin
November 7, 2024 19:35 1h 45m 44s wip/moar-fixed
November 7, 2024 19:35 1h 45m 44s
33pct_3 3,3,3,3 64k +test sonnet-20241022
Run the benchmark #402: Manually run by kgilpin
November 5, 2024 22:01 1h 35m 35s fix/whitespace-adjuster
November 5, 2024 22:01 1h 35m 35s
33pct_2 3,3,3,3 64k +test sonnet-20241022
Run the benchmark #401: Manually run by kgilpin
November 5, 2024 20:01 1h 56m 28s fix/whitespace-adjuster
November 5, 2024 20:01 1h 56m 28s
fix: Whitespace adjuster
Run tests #200: Pull request #96 opened by kgilpin
November 5, 2024 19:53 47s fix/whitespace-adjuster
November 5, 2024 19:53 47s
fix: Whitespace adjuster
Run the benchmark #400: Pull request #96 opened by kgilpin
November 5, 2024 19:53 7m 0s fix/whitespace-adjuster
November 5, 2024 19:53 7m 0s
fix: Retry file choosing
Run the benchmark #399: Pull request #95 synchronize by dividedmind
November 5, 2024 13:21 19s fix/retry-file-choosing
November 5, 2024 13:21 19s
fix: Retry file choosing
Run tests #199: Pull request #95 synchronize by dividedmind
November 5, 2024 13:21 47s fix/retry-file-choosing
November 5, 2024 13:21 47s
33pct_3 choose-code-files 64k gemini
Run the benchmark #398: Manually run by dividedmind
November 4, 2024 20:59 1h 22m 7s fix/retry-file-choosing
November 4, 2024 20:59 1h 22m 7s
fix: Retry file choosing
Run the benchmark #397: Pull request #95 synchronize by dividedmind
November 4, 2024 20:57 21s fix/retry-file-choosing
November 4, 2024 20:57 21s
fix: Retry file choosing
Run tests #198: Pull request #95 synchronize by dividedmind
November 4, 2024 20:57 54s fix/retry-file-choosing
November 4, 2024 20:57 54s
33pct_3 3,3,3,3 16k +test haiku-20240307
Run the benchmark #396: Manually run by dividedmind
November 4, 2024 14:04 3h 6m 42s fix/retry-file-choosing
November 4, 2024 14:04 3h 6m 42s
33pct_3 3,3,3,3 64k +test haiku-20240307
Run the benchmark #395: Manually run by dividedmind
November 4, 2024 00:46 44m 26s fix/retry-file-choosing
November 4, 2024 00:46 44m 26s
fix: Retry file choosing
Run the benchmark #394: Pull request #95 synchronize by dividedmind
November 4, 2024 00:44 18s fix/retry-file-choosing
November 4, 2024 00:44 18s
fix: Retry file choosing
Run tests #197: Pull request #95 synchronize by dividedmind
November 4, 2024 00:44 57s fix/retry-file-choosing
November 4, 2024 00:44 57s
fix: Retry file choosing
Run the benchmark #393: Pull request #95 synchronize by dividedmind
November 3, 2024 21:54 22s fix/retry-file-choosing
November 3, 2024 21:54 22s
fix: Retry file choosing
Run tests #196: Pull request #95 synchronize by dividedmind
November 3, 2024 21:54 51s fix/retry-file-choosing
November 3, 2024 21:54 51s
fix: Retry file choosing
Run the benchmark #392: Pull request #95 opened by dividedmind
November 3, 2024 21:36 20s fix/retry-file-choosing
November 3, 2024 21:36 20s
fix: Retry file choosing
Run tests #195: Pull request #95 opened by dividedmind
November 3, 2024 21:36 45s fix/retry-file-choosing
November 3, 2024 21:36 45s
Claude produces mangled change output
Plan issue with Navie #132: Issue #94 labeled by kgilpin
October 31, 2024 17:17 29s
October 31, 2024 17:17 29s