mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

Divyansh Khanna e6d8ed02cb PyTorch Data Sampler benchmark (#156974 )

## Motivation
Many PRs optimizing samplers (for eg https://github.com/pytorch/pytorch/pull/147706, https://github.com/pytorch/pytorch/pull/137423) are leveraging an adhoc script for benchmarking samplers. The script and outputs are often copied over in PRs. We want to begin centralizing benchmarks for torch.utils.data components.

## What ?
* This PR adds a new sub-folder in `benchmarks`  for `data`. This is aimed to cover benchmarking scripts for torch.utils.data components like dataloader and sampler.
* Specifically, this PR includes a simple script to time samplers. This is often "copy-pasted" in PRs optimizing samplers. Having it in a centralized location should prevent that, and allow a common standard.

## Output
```
Benchmark Results:
+--------------+-------------+----------------+-----------+-----------+
|   Batch Size | Drop Last   |   Original (s) |   New (s) | Speedup   |
+==============+=============+================+===========+===========+
|            4 | True        |         0.004  |    0.0088 | -119.62%  |
+--------------+-------------+----------------+-----------+-----------+
|            4 | False       |         0.0083 |    0.009  | -9.23%    |
+--------------+-------------+----------------+-----------+-----------+
|            8 | True        |         0.003  |    0.0074 | -147.64%  |
+--------------+-------------+----------------+-----------+-----------+
|            8 | False       |         0.0054 |    0.0075 | -38.72%   |
+--------------+-------------+----------------+-----------+-----------+
|           64 | True        |         0.0021 |    0.0056 | -161.92%  |
+--------------+-------------+----------------+-----------+-----------+
|           64 | False       |         0.0029 |    0.0055 | -92.50%   |
+--------------+-------------+----------------+-----------+-----------+
|          640 | True        |         0.002  |    0.0055 | -168.75%  |
+--------------+-------------+----------------+-----------+-----------+
|          640 | False       |         0.0024 |    0.0062 | -161.35%  |
+--------------+-------------+----------------+-----------+-----------+
|         6400 | True        |         0.0021 |    0.0055 | -160.13%  |
+--------------+-------------+----------------+-----------+-----------+
|         6400 | False       |         0.0021 |    0.0068 | -215.46%  |
+--------------+-------------+----------------+-----------+-----------+
|        64000 | True        |         0.0042 |    0.0065 | -55.29%   |
+--------------+-------------+----------------+-----------+-----------+
|        64000 | False       |         0.0029 |    0.0077 | -169.56%  |
+--------------+-------------+----------------+-----------+-----------+
```
Pull Request resolved: https://github.com/pytorch/pytorch/pull/156974
Approved by: https://github.com/ramanishsingh

2025-06-27 04:49:43 +00:00

data

PyTorch Data Sampler benchmark (#156974 )

2025-06-27 04:49:43 +00:00

distributed/ddp

[BE] Remove outdated RPC benchmark (#146716 )

2025-03-29 04:44:36 +00:00

dynamo

update expected results (#157010 )

2025-06-26 21:56:57 +00:00

fastrnns

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

framework_overhead_benchmark

Fix unused Python variables outside torch/ and test/ (#136359 )

2024-12-11 17:10:23 +00:00

functional_autograd_benchmark

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

fuser

Fix unused Python variables outside torch/ and test/ (#136359 )

2024-12-11 17:10:23 +00:00

gpt_fast

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

inductor_backends

Rename inductor cache (#156128 )

2025-06-17 03:57:18 +00:00

inference

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

instruction_counts

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

nested

Fix unused Python variables outside torch/ and test/ (#136359 )

2024-12-11 17:10:23 +00:00

operator_benchmark

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

overrides_benchmark

[BE][Easy][3/19] enforce style for empty lines in import segments in benchmarks/ (#129754 )

2024-07-17 14:34:42 +00:00

profiler_benchmark

Apply TorchFix TOR203 fixes (#143691 )

2024-12-23 18:21:03 +00:00

record_function_benchmark

[Caffe2]Remove Caffe2 scripts and benchmarks (#126747 )

2024-06-05 23:46:31 +00:00

serialization

Fix unused Python variables outside torch/ and test/ (#136359 )

2024-12-11 17:10:23 +00:00

sparse

[build] Change --cmake{,-only} arguments to envvars to support modern Python build frontend (#156045 )

2025-06-17 11:40:24 +00:00

static_runtime

[3/N] Use internal linkage in C++ files (#151297 )

2025-05-05 17:48:39 +00:00

tensorexpr

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

transformer

[BE] fix typos in benchmarks/ (#156077 )

2025-06-17 13:12:18 +00:00

compare-fastrnn-results.py

[BE][Easy][3/19] enforce style for empty lines in import segments in benchmarks/ (#129754 )

2024-07-17 14:34:42 +00:00

compare.sh

Benchmarks: add scripts for FastRNNs results comparison. (#44134 )

2020-09-03 13:44:42 -07:00

README.md

PyTorch Data Sampler benchmark (#156974 )

2025-06-27 04:49:43 +00:00

upload_scribe.py

Fix broken URLs (#152237 )

2025-04-27 09:56:42 +00:00

README.md

PyTorch Benchmarks

This folder contains scripts that produce reproducible timings of various PyTorch features.

It also provides mechanisms to compare PyTorch with other frameworks.

Setup environment

Make sure you're on a machine with CUDA, torchvision, and pytorch installed. Install in the following order:

# Install torchvision. It comes with the pytorch stable release binary
pip3 install torch torchvision

# Install the latest pytorch master from source.
# It should supersede the installation from the release binary.
cd $PYTORCH_HOME
python setup.py build develop

# Check the pytorch installation version
python -c "import torch; print(torch.__version__)"

Benchmark List

Please refer to each subfolder to discover each benchmark suite. Links are provided where descriptions exist: