← カタログに戻る
SShow HN: Llmbuffer – Python library for cache-optimized LLM conversation history logo

Show HN: Llmbuffer – Python library for cache-optimized LLM conversation history

3.8editorial
無料
2026年にリリース

Sobre

I was not getting good cache utilization when including dynamic context in agent threads. After a lot of experimentation, I found a good pattern that minimizes how often long lived conversation history gets modified while still supporting dynamic context. It has flexible hooks for doing things like truncating or summarizing tool outputs when transitioning messages to the long term history. And I'm seeing >>90% of tokens hitting the cache for my agents despite including a lot of dynami

Casos de uso

  • Otimizar histórico de conversas de LLM
  • Reduzir consumo de tokens
  • Melhorar desempenho de cache

Como funciona

O usuário instala a biblioteca e a utiliza para gerenciar o histórico de conversas com cache otimizado.

Exemplo de uso

Qual a capital do Brasil?

長所

  • +Melhora significativamente a taxa de cache de tokens para históricos de conversas longas com contexto dinâmico
  • +Oferece hooks flexíveis para truncar ou resumir saídas de ferramentas ao transicionar para o histórico de longo prazo
  • +Atinge >90% de acertos no cache mesmo com contexto dinâmico em agentes

短所

  • Requer integração manual com o código existente do agente
  • Pode ter overhead de configuração inicial para casos de uso específicos

Perguntas frequentes

O que é o Show HN: Llmbuffer – Python library for cache-optimized LLM conversation history?
O Show HN: Llmbuffer – Python library for cache-optimized LLM conversation history é uma ferramenta de inteligência artificial especializada em chatbots e marketing. I was not getting good cache utilization when including dynamic context in agent threads. After a lot of experimentation, I found a good pattern that minimizes how often long lived conversation history gets modified while still supporting dynamic context. It has flexible hooks for doing things like truncating or summarizing tool outputs when transitioning messages to the long term history. And I'm seeing >>90% of tokens hitting the cache for my agents despite including a lot of dynami
O Show HN: Llmbuffer – Python library for cache-optimized LLM conversation history é gratuito?
Sim, o Show HN: Llmbuffer – Python library for cache-optimized LLM conversation history é totalmente gratuito para usar.

Você também pode gostar