HN Mail
Subscribe
CPP
Whisper.cpp 1.8.3 Delivers a "12x Performance Boost" with Integrated Graphics
1 points
|
0 comments
VLLM or llama.cpp: Choosing the right LLM inference engine for your use case
1 points
|
0 comments
Ask HN: Who's running local AI workstations in 2026?
8 points
|
8 comments
Show HN: Revibing nanochat's inference model in C++ with ggml
5 points
|
0 comments
Confused between lowlevel and back end development – need guidance
3 points
|
4 comments
Show HN: Shimmytok – Pure Rust GGUF tokenizer (no C++, no extra files)
2 points
|
2 comments
Hermit-AI – An offline, privacy-first RAG chatbot for ZIM files
1 points
|
0 comments