← कैटलॉग पर वापस जाएं
Show HN: jj-benchmark – Evaluating AI agents on Jujutsu version control logo

Show HN: jj-benchmark – Evaluating AI agents on Jujutsu version control

4.2(10,000 समीक्षाएं)
फ्रीमियम
2026 में लॉन्च

Sobre

Hi HN, Meng from TabbyML here.<p>We decided to build this simply because we find Jujutsu (jj) really interesting, and many folks on our team have started trying it out recently. Since it introduces a very different workflow compared to traditional Git, we thought it would be a fun challenge to see how well current AI coding agents can actually use it.<p>To build this, we created a semi-automated pipeline. We used AI to research the official Jujutsu documentation and websites, which then helped u

फायदे

  • +Evaluando agentes de IA em Jujutsu
  • +Pipeline automatizada
  • +Pesquisa em documentação oficial

नुकसान

  • Limitações em avaliação de agentes de IA
  • Dependência de documentação oficial

Você também pode gostar