hiyouga / LlamaFactory Public

Notifications You must be signed in to change notification settings
Fork 8.7k
Star 71.5k

Code
Issues 966
Pull requests 50
Discussions
Actions
Security and quality 4
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security and quality
Insights

Pull requests: hiyouga/LlamaFactory

Labels 13 Milestones 0

New pull request New

50 Open 1,272 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[fix] drop Qwen3-VL video metadata before model inputs

#10509 opened May 22, 2026 by luca-888 Contributor • Draft

1 of 2 tasks

[WIP]for kt full-fine-tune-support

#10508 opened May 22, 2026 by poryfly Contributor

Loading…

2 tasks

[v1] Implement dynamic padding-free stretrgy for batching

#10507 opened May 21, 2026 by XuanyuChen-SEU

Loading…

1 of 2 tasks

Patch GDN for NPU

#10504 opened May 21, 2026 by A1waysBeenHere

Loading…

2 tasks

fix(api_example): guard against IndexError on empty choices list

#10498 opened May 17, 2026 by qizwiz

Loading…

feat(v1): add system resource metrics collection

#10495 opened May 15, 2026 by singhalshubham03 • Draft

[Device-Metrics]: Add support to dump device metrics in log

#10492 opened May 14, 2026 by pankd

Loading…

feat(v1): add comprehensive training metrics logging

#10488 opened May 14, 2026 by singhalshubham03

Loading…

5 tasks done

[v1] Enhance LlamaFactory v1 and fix bugs in multi-node distributed training

#10483 opened May 12, 2026 by YouCanLX

Loading…

1 of 2 tasks

fix: replace mutable default input_kwargs={} with None in HuggingfaceEngine

#10477 opened May 9, 2026 by kuishou68 Contributor

Loading…

feat: support per-message loss control via inline loss field

#10471 opened May 8, 2026 by Swimakkkkkkkk

Loading…

[data] qwen3_vl: relax default thought_words to handle single-newline </think>

#10464 opened May 7, 2026 by Schuture

Loading…

3 tasks done

[train] fix loss aggregation bug in SFT and PT training

#10454 opened Apr 30, 2026 by 4teven

Loading…

[xpu] extend runs_on test markers to include xpu

#10445 opened Apr 29, 2026 by singhalshubham03

Loading…

2 tasks

[model] support DeepSeek V4 pending

This problem is yet to be addressed

#10434 opened Apr 25, 2026 by isLinXu Contributor

Loading…

[model-test] Add test for Llama-3.1-8B-Instruct model

#10426 opened Apr 23, 2026 by pankd

Loading…

2 tasks done

[chat] add thinking token injection for reasoning models in all engines

#10424 opened Apr 23, 2026 by kally788

Loading…

4 tasks

[chat] fix enable_thinking=None overriding template defaults

#10423 opened Apr 23, 2026 by kally788

Loading…

3 tasks

fix: add ignore_mismatched_sizes option to model loader

#10420 opened Apr 22, 2026 by octo-patch

Loading…

avoid the EOFError issue when run chat with a sample prompt at a jupyter notebook

#10409 opened Apr 19, 2026 by zhangnju

Loading…

fix: materialize FSDP2 model on CPU when CPU offloading is enabled

#10403 opened Apr 17, 2026 by octo-patch

Loading…

fix: handle mm_token_type_ids in collator and packing tests pending

This problem is yet to be addressed

#10397 opened Apr 16, 2026 by markmochi200

Loading…

2 tasks done

fix: add None check for feature_extractor in Gemma4Plugin audio processing

#10388 opened Apr 13, 2026 by Ricardo-M-L

Loading…

2 tasks done

fix: prevent training hang when WebUI client disconnects

#10383 opened Apr 11, 2026 by Ricardo-M-L

Loading…

4 tasks done

fix: Ray placement group over-allocation and NCCL hang on GPU-less head node

#10380 opened Apr 10, 2026 by Ricardo-M-L

Loading…

5 tasks

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!