Commit History

ggml-cpu : rework weak alias on apple targets (llama/14146)
de5e986

xctan commited on

CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (llama/14196)
adf6b4b

uvos commited on

HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (llama/14183)
c3467c7

uvos commited on

sycl: Adding additional cpy dbg print output (llama/14034)
6799437

Anton Mitkov commited on

SYCL: Bump oneMath commit (llama/14152)
4d12916

Ewan Crawford commited on

sycl: Remove not needed copy f16->f32 for dnnl mul mat (llama/14125)
eed049f

Anton Mitkov commited on

cmake : handle whitepsaces in path during metal build (llama/14126)
8076017

ggerganov HF Staff danbev commited on

Implement GGML_CPU_ALL_VARIANTS for ARM (llama/14080)
c9cec9d

Christian Kastner commited on

vulkan: Better thread-safety for command pools/buffers (llama/14116)
fdc26e7

jeffbolznv commited on

vulkan: Track descriptor pools/sets per-context (llama/14109)
855a3bf

jeffbolznv commited on

opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003)
d0a458b

lhez commited on

Vulkan: Don't default to CPU device (like llvmpipe), even if no other device is available, to allow fallback to CPU backend (llama/14099)
dcb106f

OccamRazor commited on

rpc : nicer error messages for RPC server crash (llama/14076)
5d5056e

mcfadyeni commited on

ggml : disable warnings for tests when using MSVC (ggml/1273)
1669c07

danbev commited on

ggml : remove unused ggml_context_container (ggml/1272)
e6d6988

danbev commited on

examples : include examples in msvc disable warn (ggml/1270)
0c191be

danbev commited on

whisper : clear result_all if vad_samples is empty (#3262)
60ecd84
unverified

danbev commited on

examples : set the C++ standard to C++17 for server (#3261)
5257653
unverified

danbev commited on

examples : update usage/help in yt-wsp.sh (#3251)
0931e4a
unverified

w1redch4d commited on

server : graceful shutdown, atomic server state, and health endpoint Improvements (#3243)
170eb31
unverified

sachaarbonel commited on

whisper : fix VAD processing for skipped audio segments (#3230)
a69c121
unverified

danbev commited on

server : add Voice Activity Detection (VAD) support (#3246)
58d6e4e
unverified

danbev commited on

cli : fix short name conflict for vad options [no ci] (#3247)
c5f7b7e
unverified

danbev commited on

ruby : add .gitignore entries for ext directory (#3245)
984d583
unverified

danbev commited on

ci : update windows runner to windows-2022 (#3242)
7e96237
unverified

danbev commited on

ruby : add cleaning of library names in dependencies (#3241)
f6dc2ad
unverified

danbev commited on

ggml : fix weak alias win32 (#0)
d47070d

ggerganov HF Staff commited on

android : fix builds (#0)
4043835

ggerganov HF Staff commited on

sync : ggml
a890a8c

ggerganov HF Staff commited on

files : remove old sources (part 2)
c1c9908

ggerganov HF Staff commited on

sync : ggml
43cbdf7

ggerganov HF Staff commited on

files : remove old sources
e4ae8c6

ggerganov HF Staff commited on

talk-llama : sync llama.cpp
5ef1601

ggerganov HF Staff commited on

sync : ggml
6ac9e73

ggerganov HF Staff commited on

metal : use less stack memory in FA kernel (llama/14088)
014afb6

ggerganov HF Staff commited on

ggml-cpu : split arch-specific implementations (llama/13892)
8c833e9

xctan ggerganov HF Staff commited on

cuda : fix device sync on buffer clear (llama/14033)
8f2e8d6

Diego Devesa commited on

CANN: Simplify the environment variable setting(#13104)
f1535d7

dou112 commited on

sycl: Add reorder to Q6_K mmvq implementation (llama/13885)
56f0e48

Nicolò Scipione commited on

cuda : fix buffer type check with integrated GPUs (llama/14069)
747ad97

Diego Devesa commited on

SYCL: Implement few same quantized type copy kernels (llama/13739)
4c88a27

qnixsynapse commited on

vulkan: Enable VK_KHR_cooperative_matrix extension for Intel Xe2 GPUs (llama/14001)
e5107fe

rillomas commited on

llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WIN_VER to llama.cpp sources (llama/14013)
f0a0ac8

Diego Devesa commited on

vulkan: automatically deduce size of push constants (llama/13936)
00a9e2f

jeffbolznv commited on

ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (llama/13813)
32985b0

etasnadi commited on

releases : use dl backend for linux release, remove arm64 linux release (llama/13996)
9896625

Diego Devesa commited on

CUDA: fix FTZ in FA for Gemma 3 (llama/13991)
40fc316

JohannesGaessler commited on

vulkan: fix warnings in perf logger querypool code (llama/13937)
11bac96

jeffbolznv commited on

opencl: add `backend_synchronize` (llama/13939)
a9ce9a8

lhez commited on

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840)
5ff8785

rmatif commited on