• HN Mail
  • Subscribe

CPP

Show HN: Turboquant.cpp – Quantize embeddings to 1-4 bits, no training (400 LoC)
2 points | 0 comments

Monitoring LLM Inference with Prometheus and Grafana (vLLM, TGI, Llama.cpp)
2 points | 0 comments

Llama.cpp – Run LLM Inference in C/C++
2 points | 0 comments

Show HN: Selora – local model for Home Assistant
6 points | 4 comments

Show HN: Peek – A Figma like DB GUI
6 points | 1 comments

Ask HN: What are some good/fast coding models for Apple Silicon?
2 points | 3 comments

How memory safety CVEs differ between Rust and C/C++
141 points | 244 comments