transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-10-20 09:03:53 +08:00

Author	SHA1	Message	Date
Yih-Dar	307c523854	further improve `utils/check_bad_commit.py` (#41658 ) (#41690 ) * fix * Update utils/check_bad_commit.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>	2025-10-17 23:07:00 +02:00
tobiasofsn	448c553ccb	Update `run_name` docs in TrainingArguments (#41705 ) Update run_name docs in TrainingArguments	2025-10-17 20:40:03 +00:00
Yih-Dar	cb4d4f5b75	pin torchcodec on CI docker image (#41703 ) pin Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-10-17 20:50:18 +02:00
HyunZ118	ac81541778	🌐 [i18n-KO] Translated gemma3n.md to Korean (#40873 ) * fix: manual edits * Apply suggestions from code review Apply suggestions from code review and make additional revisions Co-authored-by: HyunSang Jang <tasker.dev103@gmail.com> * Apply suggestions from code review Apply suggestions from code review — updated inline links for related text * Apply suggestions from code review Apply suggestions from code review - final * Update docs/source/ko/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: HyunSang Jang <tasker.dev103@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-10-17 09:57:05 -07:00
Steven Liu	e7592f2508	[docs] Manual tp-plan (#41674 ) * manual tp-plan * feedback	2025-10-17 09:38:26 -07:00
Justin Chu	347a0f9e83	Simplify GQA conditions in sdpa_attention.py (#41699 ) Removed unnecessary checks for key being a torch.fx.Proxy in GQA conditions because fx tracing is no longer supported, and torch.export supports enable_gqa.	2025-10-17 16:36:38 +00:00
Anton Vlasjuk	7e204ad121	[`Attn`] Allow dynamic causality in SDPA via Kwargs (#41692 ) * is causal as kwarg * Update src/transformers/integrations/sdpa_attention.py Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> * fix comment --------- Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>	2025-10-17 15:51:51 +00:00
Yuanyuan Chen	a15d77cd0c	Remove upper version bound of pandas (#41677 ) Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-17 17:31:41 +02:00
Yuanyuan Chen	12a50f294d	Enable FURB rules in ruff (#41395 ) * Apply ruff FURB rules Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Enable ruff FURB rules Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Revert changes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> --------- Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-17 15:00:40 +00:00
Cyril Vallez	39b6d3bf7e	Remove skipped tests without parents (#41691 ) remove	2025-10-17 16:25:40 +02:00
Cyril Vallez	75da795d8f	🚨 Remove torch.fx support (#41683 ) * remove all * fix comments * better checks * doc	2025-10-17 16:12:46 +02:00
Yuanyuan Chen	080d704af1	Fix Pylint warnings (#41644 ) * Fix pylint warnings Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Raise with an exception Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> --------- Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-17 13:09:42 +00:00
Yuanyuan Chen	c01ceffeb4	Enable faiss-cpu on Windows (#41678 ) faiss-cpu is supported on Windows Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-17 13:00:57 +00:00
Raushan Turganbay	10de06dace	🚨 [v5] Refactor RoPE for layer types (#39847 ) * update * batch update model code * typos * too many diffs, dump * dump again * another dump * fix copies * make `rope_scaling_dict` self attr * fix a few more tests * another update * fix a few more tests, hopefully last ones * fox copies * fix copies again * fix newly added models, I hate rebasing on main * update config files * modular files * fix rope utils test * docstring has to be indented more, why? * oops forgot to update some modualr files * copy from doesn't copy decorators? * fix overriden test as well * add a new test * fix failing tests again * update docstrings * fix phi3 * fix two models * fix copies * forgot to add * stupid bug from modular conversion * fix slow tests * update to call rotary emb once per model forward * 3K tests failing?! * update * update more models * fix copies * fix the rest of tests hopefully * fix after rebase * fix the rope tests * fix docs omni * change a bit * models with layer types * why it was deleted? * fix a few tests * fix last test! * delete extra empty lines * add a test case * more changes * fix models * typing hint for nested rope params * missed when resolving conflicts * delete layer types and fix typo * fix copies * fix copies * update docs text * docs * huuge update all models * fix copies * rename attr to align with new format * delete redundant rope tests * trigger ci * update the case * this is why i hate rebasing * maybe fixed? * oops * now fix? * fix last tests and copies * fix copies? * fix minimax and gemma3n * update typo * deprecation end version * final fix copies :fingers-crossed: * oh my, add the docs in toctree * oke, this is really the last fix * fix copies and hope that tests won't start failing again * use rope scaling if saved * fix slow tests * fix cwm and unrelated deepseek * fix last * update * hope it works now, it took so long * lets keep None for now, I will try to remove after checking tests * some more fixes, i find and replace does not always find all cases * last fix of tests * arthur's comment for extra foreward kwargs * delete unused code * fix slow qwen tests * delete layer types from models * faulty modular conversion * fix qwen omni * fix copies and style * address my comment --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-10-17 14:57:27 +02:00
Yuanyuan Chen	def9a7ef05	Use \| for Optional and Union typing (#41675 ) Use \| for Optional and Union typing Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-17 12:52:47 +00:00
Yuanyuan Chen	0beda2aa3a	Fix MarkDown syntax (#41676 ) Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-17 12:44:27 +00:00
Cyril Vallez	0b3aef1da9	🚨 Remove torchscript support (#41688 ) * remove a lot * remove the rest * doc	2025-10-17 13:38:27 +02:00
Yih-Dar	7370a1babd	path validation for security reason (#41256 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-10-17 12:36:04 +02:00
Yuanyuan Chen	151d6adc86	Remove require_torch_bf16_gpu (#40979 ) * More cleanup Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Remove more functions Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> --------- Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2025-10-17 12:35:19 +02:00
Lucain	252d7cd952	Remove deprecated `use_auth_token` parameter (#41666 ) * Remove deprecated use_auth_token * code styl * fix test * Update examples/pytorch/speech-recognition/README.md	2025-10-17 09:57:46 +00:00
Yih-Dar	415cb37708	torch 2.9 still don't ❤️ torchcodec 0.8 💔 (#41686 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-10-17 11:57:28 +02:00
Raushan Turganbay	1eb45cd61d	Fix ckpt in docs (#41659 ) * fix ckpt in docs * fix config ckpt	2025-10-17 11:00:34 +02:00
Julien	354567d955	Adding superglue fast image processing (#41394 ) * Default implementation - no time improvement * Improved implementation - apparently 2 times faster with only simple function refactor * elementary torch first approach, still need further implementation of torch first method * torch-first approach finished * refactor processor * refactor test * partial doc update * EfficientLoFTRImageProcessorFast based implementation * EfficientLoFTRImageProcessorFast based implementation * Logic checked - Test Passed - Validated execution speed * use modular for efficientloftr * fix import --------- Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>	2025-10-16 19:34:09 +00:00
SSUM	4dd4133d32	🌐 [i18n-KO] Translated `ko-LFM2.md` to Korean (#41502 ) * feat: nmt draft * fix: manual edits * Update docs/source/ko/model_doc/lfm2.md Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com> * Update docs/source/ko/model_doc/lfm2.md Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com> * Update docs/source/ko/model_doc/lfm2.md Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com> * Update docs/source/ko/model_doc/lfm2.md Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com> --------- Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com> Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>	2025-10-16 11:29:04 -07:00
HyunSang Jang	eefbf4ac8b	🌐 [i18n-KO] Translated llama4.md to Korean (#40396 ) * docs: ko: llama4.md * feat: nmt draft * fix: manual edits * Update docs/source/ko/model_doc/llama4.md Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com> * Update docs/source/ko/model_doc/llama4.md Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com> * Update docs/source/ko/model_doc/llama4.md Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com> * Update docs/source/ko/model_doc/llama4.md Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com> --------- Co-authored-by: TaskerJang <bymyself103@naver.com> Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>	2025-10-16 11:28:27 -07:00
Judy	50ca781d78	🌐 [i18n-KO] Translated `code_llama.md` to Korean (#40558 ) * docs: ko: code_llama.md * feat: nmt draft * fix: manual edits * Apply suggestions from code review Co-authored-by: Harheem Kim <49297157+harheem@users.noreply.github.com> Co-authored-by: HyunZ118 <156191095+HyunZ118@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Harheem Kim <49297157+harheem@users.noreply.github.com> --------- Co-authored-by: Harheem Kim <49297157+harheem@users.noreply.github.com> Co-authored-by: HyunZ118 <156191095+HyunZ118@users.noreply.github.com>	2025-10-16 11:27:46 -07:00
SSUM	8739fc05c4	[i18n-KO] Translated `big_bird.md` to Korean (#40445 ) * docs: ko: BigBird.md * feat: nmt draft * fix: manual edits	2025-10-16 11:23:56 -07:00
HyunZ118	77b5ad65ee	🌐 [i18n-KO] Translated sam_hq.md to Korean (#41340 ) * fix: manual edits * Apply suggestions from code review Apply suggestions from code review Co-authored-by: HyunSang Jang <tasker.dev103@gmail.com> * Apply suggestions from code review Apply suggestions from code review Co-authored-by: Woojun Jung <46880056+jungnerd@users.noreply.github.com> --------- Co-authored-by: HyunSang Jang <tasker.dev103@gmail.com> Co-authored-by: Woojun Jung <46880056+jungnerd@users.noreply.github.com>	2025-10-16 11:10:16 -07:00
Judy	a9731a725e	🌐 [i18n-KO] Translated `chat_extras.md` to Korean (#39863 ) * docs: ko: chat_extras.md * feat: nmt draft * fix: manual edits * Apply suggestions from code review * Apply suggestions from code review * Update docs/source/ko/chat_extras.md	2025-10-16 10:41:03 -07:00
Marc Sun	bdbc2d037b	[Trainer] [Breaking change] `use_cache` default to `False` (#41585 ) * use_cache default to `False` when training * style * Fix comment * add checks * style * set * switch	2025-10-16 18:51:36 +02:00
Mohamed Mekkouri	fe11cbb808	Erroring when KernelConfig is passed without use_kernels = True (#41657 ) * update * update	2025-10-16 18:08:46 +02:00
Yih-Dar	6344371a91	improve `utils/check_bad_commit.py` (#41658 ) * robust * robust * robust --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-10-16 15:51:19 +00:00
Cyril Vallez	a408384a88	Improve package version check (#41661 ) fix	2025-10-16 17:31:58 +02:00
Rémi Ouazan	f7c33abab3	Small changes to benchmarking script (#41662 )	2025-10-16 17:25:49 +02:00
Marc Sun	9839d57a02	Fix serving continuous batching (#41624 ) * udpate-serving-cb * style * style * check none * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-10-16 17:24:21 +02:00
Cyril Vallez	e85d5ab2bb	Fix dtype casting with quantization (#41665 ) fix dtype casting	2025-10-16 17:19:32 +02:00
Raushan Turganbay	1c36d407d5	Add in-out modalities as class attribute per model (#41366 ) * update all models * fix copies * explanation comment * better notation in omni model * style * fix copies * output_modalities under generation mixin * fix copies * oh, glm4v also needs conversion	2025-10-16 17:11:06 +02:00
Rémi Ouazan	0215846d98	Switch to CB if cache_implementation == paged (#41655 ) * Add a switch to CB in case of paged cache * Added paged as a valid cache implem * Added a fallback on inputs_ids as a name * Rookie mistake * Removed paged from cache implems * Added warning about some beam search args * Moved up CB warning	2025-10-16 17:00:18 +02:00
Yuanyuan Chen	9e99198e5e	Use \| for Optional and Union typing (#41646 ) Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-16 14:29:54 +00:00
Anton Vlasjuk	bf815e9b5e	[`Masks`] Fix mask handling in eager for vision models (#41625 ) add mask handling in case of models that do use it	2025-10-16 16:27:26 +02:00
vb	4a43e3d57c	purge HF_HUB_ENABLE_HF_TRANSFER; promote Xet (#41656 )	2025-10-16 16:17:09 +02:00
Fabian Joswig	8725ce10ed	[Fix] Deepseek V3 expert bias routing (#41647 ) * [Fix] Deepseek V3 expert bias routing * [Fix] fix-copies * [Fix] Run make style	2025-10-16 14:04:48 +00:00
Mohamed Mekkouri	1fb3fc4db0	[kernels] refactor function kernel calling (#41577 ) * refactor function kernel callling * nit * don't pass the mapping * use _kernels_available * rm import	2025-10-16 15:43:02 +02:00
Pablo Montalvo	9176af574a	Double router compute? (#41653 ) * weird double router compute? * flip it	2025-10-16 15:17:21 +02:00
Yuanyuan Chen	503c933f36	Fix confusing cls assignment (#41642 ) Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-16 13:01:07 +00:00
Yuanyuan Chen	2aff20aff6	Fix typos in documentation (#41641 ) Fix typos Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-16 12:58:46 +00:00
Yuanyuan Chen	981370c038	Format MarkDown documentation and tiny fixes (#41638 ) * Fix MarkDown syntax Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> --------- Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-16 12:58:06 +00:00
Rémi Ouazan	eef9fb2af3	Fix EncoderDecoder cache (#41612 ) * Fix EncoderDecoder cache * Add the option for the ddp data tuples to have 2 elems * Modifiy the order of the KV and sliding * Adapted RAG and Whisper to new EncoderDecoderCache * A single comma * Remove kwargs in map * Fixed order in manual injection cache test * Slight changes to support legacy format * Removed Nonnes	2025-10-16 14:55:41 +02:00
Mario Koddenbrock	35dc8f0a2e	Adjust device logging level and add minor fixes (#41636 ) This commit addresses a noisy warning and improves the robustness of the base pipeline implementation. - The device placement message in the pipeline base class has been changed from a `warning` to a `debug` log. This reduces log noise for users who are aware of their device setup, while still providing the information for debugging purposes. - Additionally, potential `UnboundLocalError` exceptions in the `_pad` and `check_model_type` functions have been prevented by initializing variables before their conditional assignment.	2025-10-16 12:47:39 +00:00
Rémi Ouazan	2935a1be19	Fix fp32_ln for various models (#41605 ) * Add is_causal to KosmosTextAttention * Move get target_dtype to be imported elsewhere * Fix fp32 flash attention bug in bark * Fix is_causal in mllama * Fix fp32 issue on StableLM * Fix repo-consistency	2025-10-16 14:18:49 +02:00

1 2 3 4 5 ...

20920 Commits