-
Notifications
You must be signed in to change notification settings - Fork 6.2k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Qwen2.5VL-3B使用chat用于视频任务时报错
bug
Something isn't working
pending
This problem is yet to be addressed
#8177
opened May 27, 2025 by
zzg-1999
1 task done
当前最新版本代码使用内置数据集c4_demo,在华为NPU上对qwen3-0.6b做增量预训练报错argument of type "NoneType" is not iterable
bug
Something isn't working
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#8175
opened May 27, 2025 by
garyyang85
1 task done
RuntimeError: generator raised StopIteration
enhancement
New feature or request
pending
This problem is yet to be addressed
#8168
opened May 27, 2025 by
cs-mshah
1 task done
PPO ds3报错问题
bug
Something isn't working
pending
This problem is yet to be addressed
#8158
opened May 26, 2025 by
Mango17adjz
1 task done
视频DPO训练报错
bug
Something isn't working
pending
This problem is yet to be addressed
#8157
opened May 26, 2025 by
zhanghang-official
1 task done
[question] dataset 怎么增加自定义 key
bug
Something isn't working
pending
This problem is yet to be addressed
#8155
opened May 26, 2025 by
tpoisonooo
1 task done
Qwen-Omni在混合模态数据上dpo训练时,训练卡住
bug
Something isn't working
pending
This problem is yet to be addressed
#8151
opened May 25, 2025 by
wwfnb
1 task done
Qwen3-8b模型全参数预训练过程中,grad_norm突然增大,模型训练中止
bug
Something isn't working
pending
This problem is yet to be addressed
#8150
opened May 24, 2025 by
hummingbird2030
1 task done
Customizations for LLaMa-Factory
solved
This problem has been already solved
#8149
opened May 24, 2025 by
TanyaChutani
1 task done
Error while serving fine-tuned Qwen 2.5 VL model
bug
Something isn't working
help wanted
Extra attention is needed
pending
This problem is yet to be addressed
#8147
opened May 23, 2025 by
nishadsinghi
1 task done
The performance decreases seriously after finetuning on qwen2.5-Omni model with lora
bug
Something isn't working
pending
This problem is yet to be addressed
#8146
opened May 23, 2025 by
humble-gambler
1 task done
Expects torch.Size([525336576]) but got torch.Size([128256, 4096])
bug
Something isn't working
pending
This problem is yet to be addressed
#8142
opened May 23, 2025 by
Abhivadan
1 task done
dlopen: cannot load any more object with static TLS
bug
Something isn't working
pending
This problem is yet to be addressed
#8140
opened May 23, 2025 by
wangsikuan
1 task done
Loss becomes 0 when using DeepSpeed Zero2 with multi-node training (Zero3 works fine)
bug
Something isn't working
pending
This problem is yet to be addressed
#8137
opened May 22, 2025 by
JackLingjie
1 task done
how to train with vqa
bug
Something isn't working
pending
This problem is yet to be addressed
#8132
opened May 22, 2025 by
lleye
1 task done
BAdam算法finetune训练报错
bug
Something isn't working
pending
This problem is yet to be addressed
#8126
opened May 21, 2025 by
HenryBao91
1 task done
求大佬相助!单卡/多卡lora微调qwen都会卡住,但是无报错信息
bug
Something isn't working
pending
This problem is yet to be addressed
#8118
opened May 20, 2025 by
xinxinzi8
1 task done
Qwen3 MoE模型训练GPU使用率很低
bug
Something isn't working
pending
This problem is yet to be addressed
#8117
opened May 20, 2025 by
haichuan1221
[Convert HF] TypeError: Received a NoneType for argument video_processor, but a BaseVideoProcessor was expected.
bug
Something isn't working
pending
This problem is yet to be addressed
#8107
opened May 19, 2025 by
MathewCrespo
1 task done
Continue training error: No such file or directory zero_pp_rank_4_mp_rank_00_optim_states.pt
bug
Something isn't working
pending
This problem is yet to be addressed
#8098
opened May 19, 2025 by
lmc8133
1 task done
使用vllm推理InternVL3-8B-hf时返回ValueError: New feature or request
good first issue
Good for newcomers
pending
This problem is yet to be addressed
limit_mm_per_prompt
is only supported for multimodal models.
enhancement
#8086
opened May 16, 2025 by
zhaomeng1234456
1 task done
RuntimeError: CUDA error: operation not supported
bug
Something isn't working
pending
This problem is yet to be addressed
#8085
opened May 16, 2025 by
52fhy
1 task done
能否支持自定义loss function?
enhancement
New feature or request
pending
This problem is yet to be addressed
#8084
opened May 16, 2025 by
longXboy
1 task done
Training hangs after "Using auto half precision backend" with no error or progress
bug
Something isn't working
pending
This problem is yet to be addressed
#8079
opened May 15, 2025 by
SakirHussain
1 task done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.