whisper.cpp

mirror of https://github.com/ggml-org/whisper.cpp.git synced 2025-09-15 13:28:35 +08:00

History

Aman Gupta 93c7a08019 CUDA: add attention sinks for tile and wmma (llama/15178) * CUDA: add attention sinks for tile and wmma * Review: formatting changes + remove syncthreads from tile + remove warp_reduce_max from wmma		2025-08-18 20:30:45 +03:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094)	2025-08-18 20:30:45 +03:00
include	llama : add gpt-oss (llama/15091)	2025-08-18 20:30:45 +03:00
src	CUDA: add attention sinks for tile and wmma (llama/15178)	2025-08-18 20:30:45 +03:00
.gitignore	whisper : reorganize source code + improve CMake (#2256 )	2024-06-26 19:34:09 +03:00
CMakeLists.txt	HIP: add cmake option to enable compiler output of kernel resource usage metrics (llama/15103)	2025-08-18 20:30:45 +03:00