|
6fa3592dc6
|
Dataloader benchmark script (#159432)
This script adds a simple dataloading benchmark tracking throughput and memory.
The output looks like this
```
System Information:
PyTorch version: 2.9.0a0+gitf87d117
PyTorch location: /home/divyanshkhanna/pytorch/torch/__init__.py
Torchvision version: 0.24.0a0+f52c4f1
Torchvision location: /home/divyanshkhanna/pytorch/vision/torchvision/__init__.py
CUDA available: True
CUDA device: NVIDIA PG509-210
CPU count: 192
Physical CPU cores: 96
Total system memory: 1510.11 GB
Loading dataset from imagenet/val (1 copies)
Dataset size: 50000
--- Benchmarking DataLoader with worker_method=multiprocessing ---
Memory before DataLoader creation: 500.59 MB
Detailed memory information:
USS (Unique Set Size): 499.00 MB
PSS (Proportional Set Size): 500.74 MB
RSS (Resident Set Size): 497.39 MB
Memory after DataLoader creation: 1127.61 MB
Memory increase: 627.02 MB
Starting training loop with 1 epochs (max 100 batches per epoch)
Epoch 1, Batch 10, Time: 0.2910s, Memory: 12044.50 MB
Epoch 1, Batch 20, Time: 0.2909s, Memory: 12185.71 MB
Epoch 1, Batch 30, Time: 0.2909s, Memory: 10654.93 MB
Epoch 1, Batch 40, Time: 0.2909s, Memory: 12378.26 MB
Epoch 1, Batch 50, Time: 0.2907s, Memory: 12402.28 MB
Epoch 1, Batch 60, Time: 0.2909s, Memory: 10559.35 MB
Epoch 1, Batch 70, Time: 0.2907s, Memory: 12644.69 MB
Epoch 1, Batch 80, Time: 0.2909s, Memory: 12654.65 MB
Epoch 1, Batch 90, Time: 0.2909s, Memory: 12727.20 MB
Epoch 1, Batch 100, Time: 0.2908s, Memory: 12722.09 MB
Results:
Worker method: multiprocessing
DataLoader init time: 0.1553 seconds
Average batch time: 0.3408 seconds
Samples per second: 375.53
Peak memory usage: 12738.76 MB
Memory increase: 12238.17 MB
```
> TODO: This script right now is CPU-only friendly and GPU friendly. But it might be worth upgrading it to test against a canonical DistributedDataParallel setup on say a 1x8 node. Or maybe we can keep that as a separate script inside `benchmarks`
Pull Request resolved: https://github.com/pytorch/pytorch/pull/159432
Approved by: https://github.com/ramanishsingh
|
2025-08-06 19:05:19 +00:00 |
|