Qwen2Moe zero3卡住的问题，已找到原因 #6669

MrLittleB · 2025-01-16T09:10:26Z

Reminder

I have read the above rules and searched the existing issues.

System Info

Reproduction

src/llamafactory/model/model_utils/moe.py

Others

No response

MrLittleB · 2025-01-16T09:11:13Z

src/llamafactory/model/model_utils/moe.py文件第64行 if model_type == "qwen2moe":
from transformers.models.qwen2_moe.modeling_qwen2_moe import Qwen2MoeSparseMoeBlock

qwen2moe改为qwen2_moe

hiyouga · 2025-01-17T05:46:25Z

fixed, thanks for pointing it out

MrLittleB added bug Something isn't working pending This problem is yet to be addressed labels Jan 16, 2025

hiyouga added a commit that referenced this issue Jan 17, 2025

fix #6669

61ff95a

hiyouga mentioned this issue Jan 17, 2025

[model] fix qwen2 moe #6684

Merged

2 tasks

hiyouga closed this as completed in #6684 Jan 17, 2025

hiyouga added solved This problem has been already solved and removed bug Something isn't working pending This problem is yet to be addressed labels Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen2Moe zero3卡住的问题，已找到原因 #6669

Qwen2Moe zero3卡住的问题，已找到原因 #6669

MrLittleB commented Jan 16, 2025

MrLittleB commented Jan 16, 2025

Uh oh!

hiyouga commented Jan 17, 2025

Uh oh!

Qwen2Moe zero3卡住的问题，已找到原因 #6669

Qwen2Moe zero3卡住的问题，已找到原因 #6669

Comments

MrLittleB commented Jan 16, 2025

Reminder

System Info

Reproduction

Others

MrLittleB commented Jan 16, 2025

Uh oh!

hiyouga commented Jan 17, 2025

Uh oh!