mirror of
https://github.com/pytorch/pytorch.git
synced 2025-10-20 12:54:11 +08:00
Build flash attention separately in build using 2 jobs since it OOMs on more, then the rest of the job uses 6 Pull Request resolved: https://github.com/pytorch/pytorch/pull/156236 Approved by: https://github.com/malfet