HN Mail
Subscribe
CPP
I patched llama.cpp to gain 20% prompt processing TPS. Help me make a PR
6 points
|
2 comments
As; HN: I was curious why MTP affects PP TPS in llama.cpp. My PoC recovers it?
4 points
|
1 comments
Transcribe.cpp – ggml based transcription engine
4 points
|
0 comments
TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B
3 points
|
0 comments
Llama.cpp flags auto-tuning tool
3 points
|
0 comments
Transcribe.cpp – ggml speech-to-text inference engine
1 points
|
0 comments
Show HN: C++, Java and C# light-weight-logger
12 points
|
0 comments