mirror of https://github.com/huggingface/accelerate.git synced 2025-10-20 18:13:46 +08:00

Files

Yao Matrix 1ac8643df7 xpu enablement on left cases (#3654 )

* 1. enable xpu for launcher 2. expand cuda only ds uts to xpu 3. expand profiler example to xpu

Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* fix style

Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* rename

Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* Update profiler.py

* Apply style fixes

---------

Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

2025-07-07 18:10:53 +02:00

automatic_gradient_accumulation.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

checkpointing.py

Fixup dataloader state dict bugs + incorporate load/save_state API (#3034 )

2024-08-23 15:13:33 -04:00

cross_validation.py

Apply ruff py39 fixes (#3461 )

2025-03-31 19:10:08 +02:00

ddp_comm_hook.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

deepspeed_with_config_support.py

Integrate SwanLab for offline/online experiment tracking for Accelerate (#3605 )

2025-06-18 15:42:29 +02:00

early_stopping.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

fsdp_with_peak_mem_tracking.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

gradient_accumulation_for_autoregressive_models.py

Give example on how to handle gradient accumulation with cross-entropy (#3193 )

2024-12-24 12:26:45 +01:00

gradient_accumulation.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

local_sgd.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

megatron_lm_gpt_pretraining.py

Integrate SwanLab for offline/online experiment tracking for Accelerate (#3605 )

2025-06-18 15:42:29 +02:00

memory.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

multi_process_metrics.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

profiler.py

xpu enablement on left cases (#3654 )

2025-07-07 18:10:53 +02:00

README.md

Add Profiler Support for Performance Analysis (#2883 )

2024-07-01 18:01:09 -04:00

schedule_free.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

tracking.py

Add end_training/destroy_pg to everything and unpin numpy (#3030 )

2024-08-20 10:40:12 -04:00

README.md

What are these scripts?

All scripts in this folder originate from the nlp_example.py file, as it is a very simplistic NLP training example using Accelerate with zero extra features.

From there, each further script adds in just one feature of Accelerate, showing how you can quickly modify your own scripts to implement these capabilities.

A full example with all of these parts integrated together can be found in the complete_nlp_example.py script and complete_cv_example.py script.

Adjustments to each script from the base nlp_example.py file can be found quickly by searching for "# New Code #"

Example Scripts by Feature and their Arguments

Base Example (`../nlp_example.py`)

Shows how to use Accelerator in an extremely simplistic PyTorch training loop
Arguments available:
- mixed_precision, whether to use mixed precision. ("no", "fp16", or "bf16")
- cpu, whether to train using only the CPU. (yes/no/1/0)

All following scripts also accept these arguments in addition to their added ones.

These arguments should be added at the end of any method for starting the python script (such as python, accelerate launch, python -m torch.distributed.run), such as:

accelerate launch ../nlp_example.py --mixed_precision fp16 --cpu 0