HN Mail
Subscribe
CPP
Show HN: Turboquant.cpp – Quantize embeddings to 1-4 bits, no training (400 LoC)
2 points
|
0 comments
Monitoring LLM Inference with Prometheus and Grafana (vLLM, TGI, Llama.cpp)
2 points
|
0 comments
Llama.cpp – Run LLM Inference in C/C++
2 points
|
0 comments
Show HN: Selora – local model for Home Assistant
6 points
|
4 comments
Show HN: Peek – A Figma like DB GUI
6 points
|
1 comments
Ask HN: What are some good/fast coding models for Apple Silicon?
2 points
|
3 comments
How memory safety CVEs differ between Rust and C/C++
141 points
|
244 comments