-
Notifications
You must be signed in to change notification settings - Fork 8.7k
Pull requests: hiyouga/LlamaFactory
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP]for kt full-fine-tune-support
#10508
opened May 22, 2026 by
poryfly
Contributor
Loading…
2 tasks
[v1] Implement dynamic padding-free stretrgy for batching
#10507
opened May 21, 2026 by
XuanyuChen-SEU
Loading…
1 of 2 tasks
fix(api_example): guard against IndexError on empty choices list
#10498
opened May 17, 2026 by
qizwiz
Loading…
feat(v1): add system resource metrics collection
#10495
opened May 15, 2026 by
singhalshubham03
•
Draft
[Device-Metrics]: Add support to dump device metrics in log
#10492
opened May 14, 2026 by
pankd
Loading…
feat(v1): add comprehensive training metrics logging
#10488
opened May 14, 2026 by
singhalshubham03
Loading…
5 tasks done
[v1] Enhance LlamaFactory v1 and fix bugs in multi-node distributed training
#10483
opened May 12, 2026 by
YouCanLX
Loading…
1 of 2 tasks
fix: replace mutable default input_kwargs={} with None in HuggingfaceEngine
#10477
opened May 9, 2026 by
kuishou68
Contributor
Loading…
feat: support per-message loss control via inline loss field
#10471
opened May 8, 2026 by
Swimakkkkkkkk
Loading…
[data] qwen3_vl: relax default thought_words to handle single-newline </think>
#10464
opened May 7, 2026 by
Schuture
Loading…
3 tasks done
[train] fix loss aggregation bug in SFT and PT training
#10454
opened Apr 30, 2026 by
4teven
Loading…
[xpu] extend runs_on test markers to include xpu
#10445
opened Apr 29, 2026 by
singhalshubham03
Loading…
2 tasks
[model] support DeepSeek V4
pending
This problem is yet to be addressed
#10434
opened Apr 25, 2026 by
isLinXu
Contributor
Loading…
[model-test] Add test for Llama-3.1-8B-Instruct model
#10426
opened Apr 23, 2026 by
pankd
Loading…
2 tasks done
[chat] add thinking token injection for reasoning models in all engines
#10424
opened Apr 23, 2026 by
kally788
Loading…
4 tasks
[chat] fix enable_thinking=None overriding template defaults
#10423
opened Apr 23, 2026 by
kally788
Loading…
3 tasks
fix: add ignore_mismatched_sizes option to model loader
#10420
opened Apr 22, 2026 by
octo-patch
Loading…
avoid the EOFError issue when run chat with a sample prompt at a jupyter notebook
#10409
opened Apr 19, 2026 by
zhangnju
Loading…
fix: materialize FSDP2 model on CPU when CPU offloading is enabled
#10403
opened Apr 17, 2026 by
octo-patch
Loading…
fix: handle mm_token_type_ids in collator and packing tests
pending
This problem is yet to be addressed
#10397
opened Apr 16, 2026 by
markmochi200
Loading…
2 tasks done
fix: add None check for feature_extractor in Gemma4Plugin audio processing
#10388
opened Apr 13, 2026 by
Ricardo-M-L
Loading…
2 tasks done
fix: prevent training hang when WebUI client disconnects
#10383
opened Apr 11, 2026 by
Ricardo-M-L
Loading…
4 tasks done
fix: Ray placement group over-allocation and NCCL hang on GPU-less head node
#10380
opened Apr 10, 2026 by
Ricardo-M-L
Loading…
5 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.