Update tokenizer.model_max_length if necessary (Fix #6415) #6632

xiaosu-zhu · 2025-01-14T02:44:58Z

What does this PR do?

It modifies logic of loader.py -> load_tokenizer(...) to update tokenizer.model_max_length if model_args.model_max_length changes.

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

hiyouga · 2025-01-14T08:46:22Z

Hi Xiaosu, thanks for your concern about the handling of tokenizer in llama factory. We thought that this property does not affect both the training and inference, so we would like not to modify it in our framework.

xiaosu-zhu · 2025-01-14T08:59:36Z

Thank you for your comments! You are right there will be no defects to do training and/or inference, except one place:

The VLLM inference will read the model_max_length as a hard token limit for generation.

If we create an vllm engine by LLM(max_model_len=something_larger) which is longer than model_max_length, it will raise an error and we need to set VLLM_ALLOW_LONG_MAX_MODEL_LEN=1 to bypass it.

Therefore, I think this pr makes less confusion for further use cases.

hiyouga

LGTM

xiaosu-zhu · 2025-01-14T09:17:18Z

Thank you for your quick response ❤️

Former-commit-id: 58d029f

Update loader.py

0545668

hiyouga approved these changes Jan 14, 2025

View reviewed changes

xiaosu-zhu temporarily deployed to tests January 14, 2025 09:15 — with GitHub Actions Inactive

hiyouga merged commit 58d029f into hiyouga:main Jan 14, 2025
12 checks passed

hiyouga added the solved This problem has been already solved label Jan 14, 2025

apolo-developer mentioned this pull request Feb 13, 2025

Upstream branch main (revision 11eac71c) neuro-inc/LLaMA-Factory#15

Closed

1587causalai pushed a commit to 1587causalai/llama_factory that referenced this pull request Feb 18, 2025

Fix tokenizer max length (hiyouga#6632)

cbce854

stephen-nju pushed a commit to stephen-nju/Llmtrain that referenced this pull request Mar 24, 2025

Fix tokenizer max length (hiyouga#6632)

a52496c

Former-commit-id: 58d029f

yoonseok312 pushed a commit to pensieve-ai/LLaMA-Factory-vlm that referenced this pull request Apr 29, 2025

Fix tokenizer max length (hiyouga#6632)

6f45cca

Former-commit-id: 58d029f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update tokenizer.model_max_length if necessary (Fix #6415) #6632

Update tokenizer.model_max_length if necessary (Fix #6415) #6632

Uh oh!

xiaosu-zhu commented Jan 14, 2025 •

edited by hiyouga

Loading

Uh oh!

hiyouga commented Jan 14, 2025

Uh oh!

xiaosu-zhu commented Jan 14, 2025

Uh oh!

hiyouga left a comment

Uh oh!

xiaosu-zhu commented Jan 14, 2025

Uh oh!

Uh oh!

Uh oh!

Update tokenizer.model_max_length if necessary (Fix #6415) #6632

Update tokenizer.model_max_length if necessary (Fix #6415) #6632

Uh oh!

Conversation

xiaosu-zhu commented Jan 14, 2025 • edited by hiyouga Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

hiyouga commented Jan 14, 2025

Uh oh!

xiaosu-zhu commented Jan 14, 2025

Uh oh!

hiyouga left a comment

Choose a reason for hiding this comment

Uh oh!

xiaosu-zhu commented Jan 14, 2025

Uh oh!

Uh oh!

Uh oh!

xiaosu-zhu commented Jan 14, 2025 •

edited by hiyouga

Loading