Skip to content

Customizations for LLaMa-Factory #8149

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
1 task done
TanyaChutani opened this issue May 24, 2025 · 1 comment
Closed
1 task done

Customizations for LLaMa-Factory #8149

TanyaChutani opened this issue May 24, 2025 · 1 comment
Labels
solved This problem has been already solved

Comments

@TanyaChutani
Copy link

Reminder

  • I have read the above rules and searched the existing issues.

Description

Overall, looks powerful! Just a few things I’m curious about:

  • Can we go beyond the UI for custom setups?

  • How well does it handle big models like LLaMA 3–70B on multi-GPU setups?

  • Is it smooth with large or custom datasets?

  • Does it track experiments well for reproducibility?

Pull Request

No response

@TanyaChutani TanyaChutani added enhancement New feature or request pending This problem is yet to be addressed labels May 24, 2025
@hiyouga
Copy link
Owner

hiyouga commented May 26, 2025

The first two questions can be answered by the examples and the FAQs.

Large datasets are supported as long as they meet the requirements described in the data README.

You can enable TensorBoard or Wandb for better reproducibility.

@hiyouga hiyouga added solved This problem has been already solved and removed enhancement New feature or request pending This problem is yet to be addressed labels May 26, 2025
@hiyouga hiyouga closed this as completed May 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
solved This problem has been already solved
Projects
None yet
Development

No branches or pull requests

2 participants