Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

Qwen2.5VL-3B使用chat用于视频任务时报错 bug Something isn't working pending This problem is yet to be addressed
#8177 opened May 27, 2025 by zzg-1999
1 task done
当前最新版本代码使用内置数据集c4_demo,在华为NPU上对qwen3-0.6b做增量预训练报错argument of type "NoneType" is not iterable bug Something isn't working npu This problem is related to NPU devices pending This problem is yet to be addressed
#8175 opened May 27, 2025 by garyyang85
1 task done
RuntimeError: generator raised StopIteration enhancement New feature or request pending This problem is yet to be addressed
#8168 opened May 27, 2025 by cs-mshah
1 task done
PPO ds3报错问题 bug Something isn't working pending This problem is yet to be addressed
#8158 opened May 26, 2025 by Mango17adjz
1 task done
视频DPO训练报错 bug Something isn't working pending This problem is yet to be addressed
#8157 opened May 26, 2025 by zhanghang-official
1 task done
[question] dataset 怎么增加自定义 key bug Something isn't working pending This problem is yet to be addressed
#8155 opened May 26, 2025 by tpoisonooo
1 task done
Qwen-Omni在混合模态数据上dpo训练时,训练卡住 bug Something isn't working pending This problem is yet to be addressed
#8151 opened May 25, 2025 by wwfnb
1 task done
Qwen3-8b模型全参数预训练过程中,grad_norm突然增大,模型训练中止 bug Something isn't working pending This problem is yet to be addressed
#8150 opened May 24, 2025 by hummingbird2030
1 task done
Customizations for LLaMa-Factory solved This problem has been already solved
#8149 opened May 24, 2025 by TanyaChutani
1 task done
Error while serving fine-tuned Qwen 2.5 VL model bug Something isn't working help wanted Extra attention is needed pending This problem is yet to be addressed
#8147 opened May 23, 2025 by nishadsinghi
1 task done
The performance decreases seriously after finetuning on qwen2.5-Omni model with lora bug Something isn't working pending This problem is yet to be addressed
#8146 opened May 23, 2025 by humble-gambler
1 task done
Expects torch.Size([525336576]) but got torch.Size([128256, 4096]) bug Something isn't working pending This problem is yet to be addressed
#8142 opened May 23, 2025 by Abhivadan
1 task done
111 bug Something isn't working pending This problem is yet to be addressed
#8141 opened May 23, 2025 by XiaYifen
dlopen: cannot load any more object with static TLS bug Something isn't working pending This problem is yet to be addressed
#8140 opened May 23, 2025 by wangsikuan
1 task done
Loss becomes 0 when using DeepSpeed Zero2 with multi-node training (Zero3 works fine) bug Something isn't working pending This problem is yet to be addressed
#8137 opened May 22, 2025 by JackLingjie
1 task done
how to train with vqa bug Something isn't working pending This problem is yet to be addressed
#8132 opened May 22, 2025 by lleye
1 task done
BAdam算法finetune训练报错 bug Something isn't working pending This problem is yet to be addressed
#8126 opened May 21, 2025 by HenryBao91
1 task done
求大佬相助!单卡/多卡lora微调qwen都会卡住,但是无报错信息 bug Something isn't working pending This problem is yet to be addressed
#8118 opened May 20, 2025 by xinxinzi8
1 task done
Qwen3 MoE模型训练GPU使用率很低 bug Something isn't working pending This problem is yet to be addressed
#8117 opened May 20, 2025 by haichuan1221
[Convert HF] TypeError: Received a NoneType for argument video_processor, but a BaseVideoProcessor was expected. bug Something isn't working pending This problem is yet to be addressed
#8107 opened May 19, 2025 by MathewCrespo
1 task done
Continue training error: No such file or directory zero_pp_rank_4_mp_rank_00_optim_states.pt bug Something isn't working pending This problem is yet to be addressed
#8098 opened May 19, 2025 by lmc8133
1 task done
使用vllm推理InternVL3-8B-hf时返回ValueError: limit_mm_per_prompt is only supported for multimodal models. enhancement New feature or request good first issue Good for newcomers pending This problem is yet to be addressed
#8086 opened May 16, 2025 by zhaomeng1234456
1 task done
RuntimeError: CUDA error: operation not supported bug Something isn't working pending This problem is yet to be addressed
#8085 opened May 16, 2025 by 52fhy
1 task done
能否支持自定义loss function? enhancement New feature or request pending This problem is yet to be addressed
#8084 opened May 16, 2025 by longXboy
1 task done
Training hangs after "Using auto half precision backend" with no error or progress bug Something isn't working pending This problem is yet to be addressed
#8079 opened May 15, 2025 by SakirHussain
1 task done
ProTip! Add no:assignee to see everything that’s not assigned.