• HN Mail
  • Subscribe

CPP

I patched llama.cpp to gain 20% prompt processing TPS. Help me make a PR
5 points | 2 comments

As; HN: I was curious why MTP affects PP TPS in llama.cpp. My PoC recovers it?
4 points | 1 comments

Llama.cpp flags auto-tuning tool
3 points | 0 comments

Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp
2 points | 0 comments

Show HN: Sipp – Run small local LLMs in browser 3x faster
5 points | 3 comments

Show HN: role-model, a router for hybrid local/cloud AI
2 points | 1 comments

A C++ AirPlay 2 sender: the encrypted RAOP/RTSP recipe, written down
3 points | 0 comments