• HN Mail
  • Subscribe

CPP

Llama.cpp at 100k Stars
4 points | 0 comments

Fixed a llama.cpp bug silently disabling Vulkan GPU on all 32-bit ARM devices
3 points | 0 comments

Show HN: Running LLM on smartwatch – found llama.cpp loading model twice in RAM
1 points | 0 comments

Show HN: Clusterflock: An AI orchestrator for networked hardware
3 points | 0 comments

Show HN: Host any GGUF model in one command
3 points | 0 comments

Ask HN: Best LLM model for a RAG-based Android app across all smartphones?
1 points | 1 comments

Show HN: WayInfer – Native GGUF engine that runs models larger than your RAM
1 points | 0 comments