Zoran's picture

Zoran

zokica

·

AI & ML interests

None yet

Recent Activity

new activity about 3 hours ago

UnstableLlama/Qwen3.5-4B-exl3-4.00bpw:Does not work

new activity about 1 year ago

unsloth/gemma-3-4b-it-unsloth-bnb-4bit:Does not work at all

new activity about 1 year ago

ISTA-DASLab/gemma-3-4b-it-GPTQ-4b-128g:Does not work

View all activity

Organizations

None yet

New activity in UnstableLlama/Qwen3.5-4B-exl3-4.00bpw about 3 hours ago

Does not work

#1 opened about 3 hours ago by

New activity in unsloth/gemma-3-4b-it-unsloth-bnb-4bit about 1 year ago

Does not work at all

#1 opened about 1 year ago by

New activity in ISTA-DASLab/gemma-3-4b-it-GPTQ-4b-128g about 1 year ago

Does not work

#1 opened about 1 year ago by

New activity in google/gemma-2-9b over 1 year ago

Gemma 2's Flash attention 2 implementation is strange...

#23 opened almost 2 years ago by

New activity in google/gemma-2-2b over 1 year ago

Problem with Lora finetuning, Out of memory

#13 opened over 1 year ago by

New activity in unsloth/gemma-2-2b-bnb-4bit over 1 year ago

OOM when finetuning with lora.

#1 opened over 1 year ago by

New activity in google/gemma-2-9b almost 2 years ago

Model repeating information and "spitting out" random characters

#14 opened almost 2 years ago by

Gemma2FlashAttention2 missing sliding_window variable

#8 opened almost 2 years ago by

New activity in EleutherAI/pile-t5-large almost 2 years ago

why UMT5

#1 opened about 2 years ago by

New activity in microsoft/phi-1_5 almost 2 years ago

Something broken on last update

#85 opened almost 2 years ago by

New activity in rhysjones/phi-2-orange about 2 years ago

Can't get it to generate the EOS token and beam search is not supported

#3 opened about 2 years ago by

New activity in microsoft/phi-2 about 2 years ago

How to fine-tune this? + Training code

#19 opened over 2 years ago by

Generation after finetuning does not ends at EOS token

#123 opened about 2 years ago by

New activity in microsoft/phi-1_5 over 2 years ago

Attention mask for generation function in the future?

#7 opened over 2 years ago by

New activity in TheBloke/guanaco-33B-GPTQ almost 3 years ago

guanaco-65b

#1 opened almost 3 years ago by

New activity in Sosaka/Alpaca-native-4bit-ggml almost 3 years ago

How do you run this?

#2 opened almost 3 years ago by

New activity in openai-community/roberta-base-openai-detector almost 3 years ago

How to run this?

#13 opened almost 3 years ago by

New activity in sileod/deberta-v3-base-tasksource-nli almost 3 years ago

Does not work at all, i tried to calculate cola

#2 opened about 3 years ago by

New activity in tloen/alpaca-lora-7b almost 3 years ago

This works, but training does not work at all

#4 opened about 3 years ago by

This works, but training does not work at all

#4 opened about 3 years ago by