transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-10-21 01:23:56 +08:00

Author	SHA1	Message	Date
Yuanyuan Chen	12a50f294d	Enable FURB rules in ruff (#41395 ) * Apply ruff FURB rules Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Enable ruff FURB rules Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Revert changes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * More fixes Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> --------- Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-17 15:00:40 +00:00
Lucain	252d7cd952	Remove deprecated `use_auth_token` parameter (#41666 ) * Remove deprecated use_auth_token * code styl * fix test * Update examples/pytorch/speech-recognition/README.md	2025-10-17 09:57:46 +00:00
Rémi Ouazan	cf1e9834ec	Restore cuda graphs to continuous batching (#41421 ) * Type hints and small fixes * Remove unusued params * Made slice inputs the default * ruffed * Updated some var name and moved index slicing * Logging arg in example * Added some padding debug var and reformat out cg * First working CG, fixe size * Working flexible CG * CG are compatible with all implementations * Fixed CG API * Update example * Documentation * Fix padding tokens in FA * Review compliance * Better doc around weird bug * Style * Fix for sliding with CG	2025-10-13 11:57:56 +02:00
Marc Sun	feca4f3de7	remove `tpu_num_cores` (#41383 ) * remove-tpu-num-cores * fix * let's remove it * style * Update examples/legacy/seq2seq/finetune_tpu.sh Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-10-10 15:53:28 +02:00
Marc Sun	0419ff881d	Remove `local_rank` arg from `TrainingArguments` (#41382 )	2025-10-09 18:54:12 +02:00
Marc Sun	081391b20e	deprecate `jit_mode_eval` (#41376 )	2025-10-09 18:50:45 +02:00
Marc Sun	776eea8612	deprecate `overwrite_output_dir` (#41323 ) * dep * style * rm * wut * style	2025-10-09 18:36:19 +02:00
Arthur	8dfc8e8cfc	🤦 CB nit! (#41413 ) * 🤦 * updates * update cb simple * merge * up * update * fix * up * nit * rumble this is annoying * update * update * up * fix * .... * cleanup a bit * nit * typo * typing and typo * nit * updates * up * final fix! * update * fix more import issues * nuke is paged * up	2025-10-08 13:36:27 +02:00
Arthur	0395ed52ae	[`CB`] Refactors the way we access paged (#41370 ) * up * refactor the way we handle paged attention * affect serve as well * update * fix * cup	2025-10-06 17:55:31 +02:00
Yuanyuan Chen	fa36c973fc	Remove unnecessary list comprehension (#41305 ) Remove unnecessary comprehension Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-10-06 14:49:02 +00:00
Cyril Vallez	163601c619	Standardize `PretrainedConfig` to `PreTrainedConfig` (#41300 ) * replace * add metaclass for full BC * doc * consistency * update deprecation message * revert	2025-10-06 11:34:02 +02:00
Yuanyuan Chen	894a2bdd8c	Fix pylint generator warnings (#41258 ) Fix pylint generator warnings Signed-off-by: cyy <cyyever@outlook.com>	2025-10-02 12:35:42 +00:00
Marc Sun	103fa6d235	[v5] Remove deprecated prediction loop (#41123 ) * rem deprecated * more * rm all instances of legacy arg	2025-09-30 17:43:01 +02:00
Marc Sun	06c04e0851	Deprecate `half_precision_backend` (#41134 ) * deprecate * remove * rm apex * fix * fix * fix doc	2025-09-30 11:36:44 +02:00
OMOTAYO OMOYEMI	42c682514b	docs/examples(speech): pin CTC commands to Hub datasets; add Windows notes (#41027 ) * examples(speech): load Common Voice from Hub; remove deprecated dataset-script references (Windows-friendly notes) * docs/examples(speech): pin CTC streaming & other CTC commands to Hub datasets; add Windows notes * make style * examples(speech): align DataTrainingArguments help with datasets docs; minor wording fixes * docs/examples(speech): address review remove Hub subsection & Whisper tip; align dataset help text * style: apply ruff/black/usort/codespell on examples/speech-recognition * Apply style fixes * Update examples/pytorch/speech-recognition/README.md * update doc to match load_dataset --------- Co-authored-by: Eustache Le Bihan <eulebihan@gmail.com> Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-09-30 08:38:31 +00:00
Lysandre Debut	10f6891fc5	Remove data from examples (#41168 ) Remove telemetry	2025-09-26 13:52:45 +02:00
Rémi Ouazan	97ca0b4712	Fix flash-attn for paged_attention when no kernels (#41078 ) * Fix non-kernels flash attention paged implementation * Cover all cases * Style * Update src/transformers/integrations/flash_paged.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Apply style fixes --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-09-26 10:41:21 +02:00
Yuanyuan Chen	65dcd66cc8	🚨 [V5] Remove deprecated training arguments (#41017 ) * Remove deprecated training arguments from V5 Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Remove deprecated training arguments from V5 Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Fix comments Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> * Fix code Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> --------- Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>	2025-09-24 12:01:27 +02:00
Cyril Vallez	4df2529d79	🚨🚨🚨 Fully remove Tensorflow and Jax support library-wide (#40760 ) * setup * start the purge * continue the purge * more and more * more * continue the quest: remove loading tf/jax checkpoints * style * fix configs * oups forgot conflict * continue * still grinding * always more * in tje zone * never stop * should fix doc * fic * fix * fix * fix tests * still tests * fix non-deterministic * style * remove last rebase issues * onnx configs * still on the grind * always more references * nearly the end * could it really be the end? * small fix * add converters back * post rebase * latest qwen * add back all converters * explicitly add functions in converters * re-add	2025-09-18 18:27:39 +02:00
Rémi Ouazan	ef053939ca	Fixes for continuous batching (#40828 ) * Fix for CB attn mask and refactor * Tests for CB (not all passing) * Passing tests and a logger fix * Fixed the KV metrics that were broken when we moved to hybrid alloc * Fix circular import and style * Added tests for FA * Unfolded test to have device expectations * Fixes for H100 * more fixes for h100 * H100 are good * Style * Adding some comments from #40831 * Rename test * Avoid 1 letter variables * Dictonnary is only removed during kwargs * Test for supported sample * Fix a unvoluntary slice * Fixes for non-sliced inputs and small example improvments * Slice inputs is more understandabe * Style	2025-09-12 15:35:31 +02:00
Rémi Ouazan	1cdbbb3e9d	Support sliding window in CB (#40688 ) * CB example: better compare feature * Cache managers, still issue w/ effective length * WIP -- fix for effective length * Renames * Wroking, need better parity checks, we mind be missing 1 token * Small fixes * Fixed wrong attn mask and broke cache into pieces * Warmup is slowing down things, disabling it * Cache was too big, fixed * Simplified index objects * Added a profile option to the example * Avoid calls to memory reporing tools * Restore full attention read indices for better latency * Adressed some TODOS and style * Docstrings for cache managers * Docstrings for Schedulers * Refactor scheudlers * [Important] Cache fix for sliding window, check with small sw size * Updated doc for cache memory compute and cache as a whole * Moved a todo * Nits and style * Fix for when sliding window is smaller than max batch per token * Paged interface update * Support for FLash in new API * Fix example CB * Fix bug in CB for paged * Revert example * Style * Review compliance * Style * Styleeeee * Removed NO_SLIDING_WINDOW * Review #2 compliance * Better art * Turn cum_seqlens_k in a dict * Attn mask is now a dict * Update examples/pytorch/continuous_batching.py Co-authored-by: Luc Georges <McPatate@users.noreply.github.com> * Adressed McPatate pro review * Style and fix --------- Co-authored-by: Luc Georges <McPatate@users.noreply.github.com>	2025-09-09 15:51:11 +02:00
Kashif Rasul	3f7bda4209	[Continous Batching] fix do_Sample=True in continuous batching (#40692 ) * fix do_Sample=True in continous batching * added test * fix top_p * test * Update examples/pytorch/continuous_batching.py	2025-09-08 10:30:15 +02:00
Yuanyuan Chen	a470f21396	Enable more ruff UP rules (#40579 ) * Import Sequence from collections.abc Signed-off-by: cyy <cyyever@outlook.com> * Apply ruff UP rules Signed-off-by: cyy <cyyever@outlook.com> --------- Signed-off-by: cyy <cyyever@outlook.com>	2025-09-02 17:29:59 +02:00
Lysandre	ce48e9cac0	Dev version	2025-08-29 20:17:34 +02:00
Rémi Ouazan	34108a2230	Continuous batching refactor (#40426 ) * Rework of the CB example * Further rework of CB example * Refactor PA cache, slice on tokens, add debug prints -- WIP * Slice cache -- WIP * Added a mechanism to check batched outputs in CB script * Less logging, debug flag for slice, !better reset! -- WIP * QOL and safety margins * Refactor and style * Better saving of cb example * Fix * Fixes and QOL * Mor einformations about metrics * Further logging * Style * Licenses * Removed some comments * Add a slice input flag * Fix in example * Added back some open-telemetry deps * Removed some aux function * Added FA2 option to example script * Fixed math (all of it) * Added a simple example * Renamed core to classes * Made allocation of attention mask optionnal * Style	2025-08-26 13:01:42 +02:00
Manuel de Prada Corral	49e168ff08	🚨 Remove Contrastive Search decoding strategy (#40428 ) * delete go brrr * fix tests * review	2025-08-26 12:31:46 +02:00
Cyril Vallez	d8f6d3790a	⚠️⚠️ Use `dtype` instead of `torch_dtype` everywhere! (#39782 ) * update everywhere * style * pipelines * switch it everywhere in tests * switch it everywhere in docs * switch in converters everywhere * update in examples * update in model docstrings * style * warnings * style * Update configuration_utils.py * fix * Update configuration_utils.py * fixes and add first test * add pipeline tests * Update test_pipelines_common.py * add config test * Update test_modeling_common.py * add new ones * post rebase * add new * post rebase adds	2025-08-22 12:34:16 +02:00
Yuanyuan Chen	6333eb986a	Fix more typos (#40212 ) Signed-off-by: cyy <cyyever@outlook.com>	2025-08-18 12:52:12 +00:00
Yuanyuan Chen	9bfbdd2945	Fix default values of getenv (#39867 ) Signed-off-by: cyy <cyyever@outlook.com>	2025-08-07 17:25:40 +00:00
Lysandre	eb6e26acf3	Dev version	2025-08-05 18:09:30 +02:00
Arthur	6ea646a03a	Update ux cb (#39845 ) * clenaup * nits * updates * fix logging * push updates? * just passexception * update * nits * fix * add tokencount * style	2025-08-01 16:50:28 +02:00
Tommy Chiang	4fcf455517	Fix broken links (#39809 ) Replace links in the form of `[text]((url))` to `[text](url)`. This is the correct format of a url in the markdown.	2025-07-31 13:23:04 +00:00
Yuanyuan Chen	95faabf0a6	Apply several ruff SIM rules (#37283 ) * Apply ruff SIM118 fix Signed-off-by: cyy <cyyever@outlook.com> * Apply ruff SIM910 fix Signed-off-by: cyy <cyyever@outlook.com> * Apply ruff SIM101 fix Signed-off-by: cyy <cyyever@outlook.com> * Format code Signed-off-by: cyy <cyyever@outlook.com> * More fixes Signed-off-by: cyy <cyyever@outlook.com> --------- Signed-off-by: cyy <cyyever@outlook.com>	2025-07-29 11:40:34 +00:00
Arthur	c3401d6fad	dev version 4.55	2025-07-25 21:11:20 +02:00
Lysandre Debut	f90de364c2	Rename huggingface_cli to hf (#39630 ) * Rename huggingface_cli to hf * hfh	2025-07-25 14:10:04 +02:00
Quentin Lhoest	91f591f7bc	Make pytorch examples UV-compatible (#39635 ) * update release.py * add uv headers in some pytorch examples * rest of pytorch examples * style	2025-07-25 10:46:22 +02:00
Sai-Suraj-27	970d9a75ce	Raise `TypeError` instead of ValueError for invalid types (#38660 ) * Raise TypeError instead of ValueError for invalid types. * Removed un-necessary changes. * Resolved conflicts * Code quality * Fix failing tests. * Fix failing tests.	2025-07-21 12:42:00 +00:00
Yuanyuan Chen	60b5471da3	Enable some ruff checks for performance and readability (#39383 ) * Fix inefficient sequence tests Signed-off-by: cyy <cyyever@outlook.com> * Enable PERF102 Signed-off-by: cyy <cyyever@outlook.com> * Enable PLC1802 Signed-off-by: cyy <cyyever@outlook.com> * Enable PLC0208 Signed-off-by: cyy <cyyever@outlook.com> --------- Signed-off-by: cyy <cyyever@outlook.com>	2025-07-17 13:21:59 +00:00
Hosein Rezaei	d8e05951b8	Fix bugs in pytorch example run_clm when streaming is enabled (#39286 )	2025-07-15 15:37:28 +02:00
eromomon	903944a411	[examples] fix do_reduce_labels argument for run_semantic_segmentation_no_trainer (#39322 ) * no use do_reduce_labels argument in model * use do_reducer_labels in AutoImageProcessor	2025-07-14 10:16:49 +00:00
eromomon	af74ec65a7	Update Readme to Run Multiple Choice Script from Example Directory (#39323 ) * Update Readme to run in current place * Update Readme files to execute PyTorch examples from their respective folders	2025-07-11 10:58:26 -07:00
Quentin Lhoest	1ecd52e50a	Add torchcodec in docstrings/tests for `datasets` 4.0 (#39156 ) * fix dataset run_object_detection * bump version * keep same dataset actually * torchcodec in docstrings and testing utils * torchcodec in dockerfiles and requirements * remove duplicate * add torchocodec to all the remaining docker files * fix tests * support torchcodec in audio classification and ASR * [commit to revert] build ci-dev images * [commit to revert] trigger circleci * [commit to revert] build ci-dev images * fix * fix modeling_hubert * backward compatible run_object_detection * revert ci trigger commits * fix mono conversion and support torch tensor as input * revert map_to_array docs + fix it * revert mono * nit in docstring * style * fix modular --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-07-08 17:06:12 +02:00
Lysandre	5154497607	Dev version	2025-06-26 18:04:36 +02:00
Quentin Lhoest	858f9b71a8	Remove script datasets in tests (#38940 ) * remove trust_remote_code * again * Revert "Skip some tests for now (#38931)" This reverts commit 31d30b72245aacfdf70249165964b53790d9c4d8. * again * style * again * again * style * fix integration test * fix tests * style * fix * fix * fix the last ones * style * last one * fix last * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-25 14:31:20 +00:00
Mylon Jones	9f42c1f192	Added scikit-learn to the example image-classification requirements.txt (#37506 ) Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2025-06-24 15:24:02 +02:00
Dianana	4f650040a6	Removing extra space in large command for speech-pretraining example (#38705 ) Removing extra space in Large command	2025-06-24 12:24:56 +00:00
Yih-Dar	31d30b7224	Skip some tests for now (#38931 ) * try * [test all] --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-20 11:05:49 +02:00
Matt	508a704055	No more Tuple, List, Dict (#38797 ) * No more Tuple, List, Dict * make fixup * More style fixes * Docstring fixes with regex replacement * Trigger tests * Redo fixes after rebase * Fix copies * [test all] * update * [test all] * update * [test all] * make style after rebase * Patch the hf_argparser test * Patch the hf_argparser test * style fixes * style fixes * style fixes * Fix docstrings in Cohere test * [test all] --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-17 19:37:18 +01:00
Quentin Gallouédec	de24fb63ed	Use HF papers (#38184 ) * Use hf papers * Hugging Face papers * doi to hf papers * style	2025-06-13 11:07:09 +00:00
Cyril Vallez	4b8ec667e9	Remove all traces of `low_cpu_mem_usage` (#38792 ) * remove it from all py files * remove it from the doc * remove it from examples * style * remove traces of _fast_init * Update test_peft_integration.py * CIs	2025-06-12 16:39:33 +02:00

1 2 3 4 5 ...

579 Commits