CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (llama/14196) adf6b4b uvos commited on Jun 15, 2025
HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (llama/14183) c3467c7 uvos commited on Jun 15, 2025
sycl: Adding additional cpy dbg print output (llama/14034) 6799437 Anton Mitkov commited on Jun 13, 2025
sycl: Remove not needed copy f16->f32 for dnnl mul mat (llama/14125) eed049f Anton Mitkov commited on Jun 12, 2025
cmake : handle whitepsaces in path during metal build (llama/14126) 8076017 ggerganov HF Staff danbev commited on Jun 12, 2025
Implement GGML_CPU_ALL_VARIANTS for ARM (llama/14080) c9cec9d Christian Kastner commited on Jun 11, 2025
vulkan: Better thread-safety for command pools/buffers (llama/14116) fdc26e7 jeffbolznv commited on Jun 11, 2025
vulkan: Track descriptor pools/sets per-context (llama/14109) 855a3bf jeffbolznv commited on Jun 11, 2025
Vulkan: Don't default to CPU device (like llvmpipe), even if no other device is available, to allow fallback to CPU backend (llama/14099) dcb106f OccamRazor commited on Jun 10, 2025
rpc : nicer error messages for RPC server crash (llama/14076) 5d5056e mcfadyeni commited on Jun 10, 2025
ggml : disable warnings for tests when using MSVC (ggml/1273) 1669c07 danbev commited on Jun 13, 2025
whisper : clear result_all if vad_samples is empty (#3262) 60ecd84 unverified danbev commited on Jun 18, 2025
examples : set the C++ standard to C++17 for server (#3261) 5257653 unverified danbev commited on Jun 17, 2025
examples : update usage/help in yt-wsp.sh (#3251) 0931e4a unverified w1redch4d commited on Jun 16, 2025
server : graceful shutdown, atomic server state, and health endpoint Improvements (#3243) 170eb31 unverified sachaarbonel commited on Jun 16, 2025
whisper : fix VAD processing for skipped audio segments (#3230) a69c121 unverified danbev commited on Jun 13, 2025
server : add Voice Activity Detection (VAD) support (#3246) 58d6e4e unverified danbev commited on Jun 13, 2025
cli : fix short name conflict for vad options [no ci] (#3247) c5f7b7e unverified danbev commited on Jun 13, 2025
ruby : add .gitignore entries for ext directory (#3245) 984d583 unverified danbev commited on Jun 13, 2025
ci : update windows runner to windows-2022 (#3242) 7e96237 unverified danbev commited on Jun 11, 2025
ruby : add cleaning of library names in dependencies (#3241) f6dc2ad unverified danbev commited on Jun 10, 2025
metal : use less stack memory in FA kernel (llama/14088) 014afb6 ggerganov HF Staff commited on Jun 9, 2025
ggml-cpu : split arch-specific implementations (llama/13892) 8c833e9 xctan ggerganov HF Staff commited on Jun 9, 2025
sycl: Add reorder to Q6_K mmvq implementation (llama/13885) 56f0e48 Nicolò Scipione commited on Jun 9, 2025
cuda : fix buffer type check with integrated GPUs (llama/14069) 747ad97 Diego Devesa commited on Jun 8, 2025
SYCL: Implement few same quantized type copy kernels (llama/13739) 4c88a27 qnixsynapse commited on Jun 7, 2025
vulkan: Enable VK_KHR_cooperative_matrix extension for Intel Xe2 GPUs (llama/14001) e5107fe rillomas commited on Jun 5, 2025
llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WIN_VER to llama.cpp sources (llama/14013) f0a0ac8 Diego Devesa commited on Jun 5, 2025
vulkan: automatically deduce size of push constants (llama/13936) 00a9e2f jeffbolznv commited on Jun 5, 2025
ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (llama/13813) 32985b0 etasnadi commited on Jun 4, 2025
releases : use dl backend for linux release, remove arm64 linux release (llama/13996) 9896625 Diego Devesa commited on Jun 4, 2025
vulkan: fix warnings in perf logger querypool code (llama/13937) 11bac96 jeffbolznv commited on Jun 3, 2025
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840) 5ff8785 rmatif commited on Jun 2, 2025