• HN Mail
  • Subscribe

CPP

I patched llama.cpp to gain 20% prompt processing TPS. Help me make a PR
6 points | 2 comments

Transcribe.cpp – ggml based transcription engine
5 points | 0 comments

TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B
3 points | 0 comments

Transcribe.cpp – ggml speech-to-text inference engine
2 points | 0 comments

Embodied.cpp: A Portable Inference Runtime of Embodied AI Models
1 points | 0 comments

Transcribe.cpp
1 points | 0 comments

Show HN: C++, Java and C# light-weight-logger
12 points | 0 comments