← Quay lại danh mục
Show HN: jj-benchmark – Evaluating AI agents on Jujutsu version control logo

Show HN: jj-benchmark – Evaluating AI agents on Jujutsu version control

4.2(10,000 đánh giá)
Freemium
Ra mắt năm 2026

Sobre

Hi HN, Meng from TabbyML here.<p>We decided to build this simply because we find Jujutsu (jj) really interesting, and many folks on our team have started trying it out recently. Since it introduces a very different workflow compared to traditional Git, we thought it would be a fun challenge to see how well current AI coding agents can actually use it.<p>To build this, we created a semi-automated pipeline. We used AI to research the official Jujutsu documentation and websites, which then helped u

Ưu điểm

  • +Evaluando agentes de IA em Jujutsu
  • +Pipeline automatizada
  • +Pesquisa em documentação oficial

Nhược điểm

  • Limitações em avaliação de agentes de IA
  • Dependência de documentação oficial

Você também pode gostar