Created Autograd Onboarding Lab (markdown)

2025-10-20 21:14:14 +08:00 · 2021-06-29 16:38:37 -04:00
parent ed491b587e
commit c792d91298
1 changed files with 49 additions and 0 deletions
--- a/Autograd-Onboarding-Lab.md
+++ b/Autograd-Onboarding-Lab.md
@ -0,0 +1,49 @@
+The goal of this lab is to give first hand experience with deriving autograd formulas and adding them to PyTorch.
+
+The deliverable will be a stack of PRs (that won’t be merged into master) containing all the code for the different sections below.
+You should add both your mentor as well as @albanD as a reviewer for this PR.
+
+
+## Considered Function
+
+For this lab, we are going to consider the following attention function
+
+```
+import torch
+
+def attn(q, k, v):
+    attn = torch.matmul(q, k.swapaxes(0, 1))
+    attn = torch.tanh(attn)
+    output = torch.matmul(attn, v)
+    return output, attn
+    
+q = torch.rand(2, 3)
+k = torch.rand(2, 3)
+v = torch.rand(2, 4)
+out, attn = attn(q, k ,v)
+```
+
+The goal is to implement this function as a native operator in PyTorch and provide an efficient autograd formula for it.
+You can assume that all inputs are 2D Tensors where the first dimension is always the same and the second dimension for q and k being the same.
+
+## I) Derive gradient formula
+
+For the function above, you should derive on paper the gradient formula.
+You can write the final formula in a markdown file.
+
+## II) Write custom Function and gradcheck
+
+Based on the formula above, you should implement the forward/backward inside a custom autograd Function to be able to verify that the computed gradient is correct.
+You can write this in a small standalone script that defines the Function and run gradcheck/gradgradcheck on it.
+
+## III) Write native composite function and OpInfo
+
+Now that you know that the formula is correct, you can implement a brand new native function in “native_functions.yaml” as a CompositeImplicit function.
+You can then add an OpInfo entry (in common_method_invocation.py) to be able to test the new operator.
+
+## IV) Write custom formula in derivatives.yaml
+
+Once testing is working fine with the Composite version, you can replace it with an Explicit version and add an entry in “derivatives.yaml" to specify this gradient.
+
+
+