Show HN: jj-benchmark – Evaluating AI agents on Jujutsu version control
4.2(10,000 条评价)
免费增值
2026年上线
Sobre
Hi HN, Meng from TabbyML here.<p>We decided to build this simply because we find Jujutsu (jj) really interesting, and many folks on our team have started trying it out recently. Since it introduces a very different workflow compared to traditional Git, we thought it would be a fun challenge to see how well current AI coding agents can actually use it.<p>To build this, we created a semi-automated pipeline. We used AI to research the official Jujutsu documentation and websites, which then helped u
优点
- +Evaluando agentes de IA em Jujutsu
- +Pipeline automatizada
- +Pesquisa em documentação oficial
缺点
- −Limitações em avaliação de agentes de IA
- −Dependência de documentação oficial