Skip to content

Actions: getappmap/navie-benchmark

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
826 workflow runs
826 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

verified 3,3,choose3 16k sonnet +maps
Run the benchmark #430: Manually run by dividedmind
November 19, 2024 11:00 1h 24m 54s wip/observe
November 19, 2024 11:00 1h 24m 54s
verified 3,3,choose3 16k sonnet no-maps
Run the benchmark #429: Manually run by dividedmind
November 19, 2024 10:59 2h 35m 5s wip/observe
November 19, 2024 10:59 2h 35m 5s
wip_with_maps choose-code-files-no-appmaps 16k gemini
Run the benchmark #428: Manually run by dividedmind
November 19, 2024 10:46 9m 34s wip/observe
November 19, 2024 10:46 9m 34s
wip_with_maps choose-code-files-no-appmaps 16k gemini
Run the benchmark #427: Manually run by dividedmind
November 19, 2024 10:36 9m 31s wip/observe
November 19, 2024 10:36 9m 31s
wip_with_maps choose-code-files+appmaps 16k gemini
Run the benchmark #426: Manually run by dividedmind
November 19, 2024 10:35 11m 29s wip/observe
November 19, 2024 10:35 11m 29s
Add a use_appmaps flag in solve.yml
Run the benchmark #425: Commit a0c0d80 pushed by dividedmind
November 19, 2024 10:25 Failure wip/observe
November 19, 2024 10:25 Failure
Add a use_appmaps flag in solve.yml
Run the benchmark #424: Commit b2034cf pushed by dividedmind
November 19, 2024 10:22 Failure wip/observe
November 19, 2024 10:22 Failure
33pct_1 observe
Run the benchmark #423: Manually run by dividedmind
November 16, 2024 11:59 14m 35s wip/observe
November 16, 2024 11:59 14m 35s
33pct_1 3,3,3,3 64k +test gemini-exp-1114
Run the benchmark #422: Manually run by dividedmind
November 15, 2024 14:44 22m 9s wip/gemini-exp
November 15, 2024 14:44 22m 9s
lite-rest 3,3,3,3 16k +test sonnet-20241022
Run the benchmark #421: Manually run by kgilpin
November 14, 2024 15:13 1h 50m 50s wip/lite-rest-retry
November 14, 2024 15:13 1h 50m 50s
feat: Document architecture
Run the benchmark #420: Pull request #98 opened by kgilpin
November 13, 2024 22:24 20s feat/document-architecture
November 13, 2024 22:24 20s
feat: Document architecture
Run tests #203: Pull request #98 opened by kgilpin
November 13, 2024 22:24 52s feat/document-architecture
November 13, 2024 22:24 52s
lite-rest-smoke gemini
Official solver #33: Manually run by kgilpin
November 13, 2024 20:11 13h 40m 9s wip/lite-rest-retry
November 13, 2024 20:11 13h 40m 9s
lite-rest-smoke 8k gemini
Run the benchmark #419: Manually run by kgilpin
November 13, 2024 20:10 13h 29m 42s wip/lite-rest-retry
November 13, 2024 20:10 13h 29m 42s
lite-rest 3,3,3,3 64k +test sonnet-20241022
Run the benchmark #418: Manually run by dividedmind
November 13, 2024 14:34 3h 14m 22s wip/lite-run
November 13, 2024 14:34 3h 14m 22s
lite-rest-smoke 3,3,3,3 64k +test sonnet-20241022
Run the benchmark #417: Manually run by dividedmind
November 13, 2024 13:44 19m 53s wip/lite-run
November 13, 2024 13:44 19m 53s
lite-rest 3,3,3,3 64k +test sonnet-20241022
Run the benchmark #416: Manually run by dividedmind
November 13, 2024 08:58 3m 57s wip/moar-fixed
November 13, 2024 08:58 3m 57s
33pct_1 3,3,3,3 64k +test gemini-1.5-pro-002
Run the benchmark #415: Manually run by dividedmind
November 12, 2024 14:29 5h 51m 49s sonnet-nov-6-2024
November 12, 2024 14:29 5h 51m 49s
Fix test environments
Run the benchmark #414: Pull request #97 opened by dividedmind
November 12, 2024 13:41 20s fix/test-environments
November 12, 2024 13:41 20s
Fix test environments
Run tests #202: Pull request #97 opened by dividedmind
November 12, 2024 13:41 51s fix/test-environments
November 12, 2024 13:41 51s
fix: Retry file choosing
Run the benchmark #413: Pull request #95 synchronize by dividedmind
November 12, 2024 13:18 23s fix/retry-file-choosing
November 12, 2024 13:18 23s
fix: Retry file choosing
Run tests #201: Pull request #95 synchronize by dividedmind
November 12, 2024 13:18 1m 1s fix/retry-file-choosing
November 12, 2024 13:18 1m 1s
test_fixup 3,3,3,3 64k sonnet
Run the benchmark #412: Manually run by kgilpin
November 10, 2024 22:37 2h 6m 54s wip/moar-fixed
November 10, 2024 22:37 2h 6m 54s
test_fixup 3,3,3,3 64k sonnet
Run the benchmark #411: Manually run by kgilpin
November 8, 2024 20:02 2m 24s wip/moar-fixed
November 8, 2024 20:02 2m 24s
test_fixup 2,2,2,2 16k sonnet
Run the benchmark #410: Manually run by kgilpin
November 8, 2024 18:19 1h 21m 17s wip/moar-fixed
November 8, 2024 18:19 1h 21m 17s