JetBrains / teamcity-ai-agent-testing-demo
End-to-end TeamCity framework to run AI agents on SWE-Bench Lite. Spin up isolated Docker images per task, extract patches, score with the official harness, and aggregate success rates. As an example, we'll look at Junie and Google Gemini CLI
GitHub Repo 
0.9K
lines of main code
14 files
0
lines of test code
0 files
0.3K
lines of other code
5 files
<1y
age
175 days
100%
main code touched
1 year (949 LOC)
100%
new main code
1 year (949 LOC)
0.7K
kt
0.2K
py
0.08K
kts

0

18

0

3

2026 2025

generated by sokrates.dev (configuration) on 2026-01-18