Add auto-translation with GPT-4

Fix model.fit() and the dataset section too
Fix TF example in quicktour
2025-10-20 17:13:56 +08:00 · 2023-04-24 18:31:52 +01:00 · 2023-04-24 16:58:05 +01:00 · 2023-04-24 13:52:05 +01:00
3 changed files with 1291 additions and 3 deletions
--- a/docs/source/en/quicktour.mdx
+++ b/docs/source/en/quicktour.mdx
@ -528,7 +528,7 @@ All models are a standard [`tf.keras.Model`](https://www.tensorflow.org/api_docs
   ```py
   >>> dataset = dataset.map(tokenize_dataset)  # doctest: +SKIP
   >>> tf_dataset = model.prepare_tf_dataset(
-   ...     dataset, batch_size=16, shuffle=True, tokenizer=tokenizer
+   ...     dataset["train"], batch_size=16, shuffle=True, tokenizer=tokenizer
   ... )  # doctest: +SKIP
   ```

@ -538,7 +538,7 @@ All models are a standard [`tf.keras.Model`](https://www.tensorflow.org/api_docs
   >>> from tensorflow.keras.optimizers import Adam

   >>> model.compile(optimizer=Adam(3e-5))
-   >>> model.fit(dataset)  # doctest: +SKIP
+   >>> model.fit(tf_dataset)  # doctest: +SKIP
   ```

 ## What's next?
--- a/docs/source/en/training.mdx
+++ b/docs/source/en/training.mdx
@ -247,7 +247,7 @@ reduces the number of padding tokens compared to padding the entire dataset.


 ```py
->>> tf_dataset = model.prepare_tf_dataset(dataset, batch_size=16, shuffle=True, tokenizer=tokenizer)
+>>> tf_dataset = model.prepare_tf_dataset(dataset["train"], batch_size=16, shuffle=True, tokenizer=tokenizer)
 ```

 Note that in the code sample above, you need to pass the tokenizer to `prepare_tf_dataset` so it can correctly pad batches as they're loaded.
--- a/src/transformers/models/sam/modeling_tf_sam.py
+++ b/src/transformers/models/sam/modeling_tf_sam.py
Author	SHA1	Message	Date
Matt	6909770bcc	Add auto-translation with GPT-4	2023-04-24 18:31:52 +01:00
Matt	41103c6405	Fix model.fit() and the dataset section too	2023-04-24 16:58:05 +01:00
Matt	c5b5d4b8c7	Fix TF example in quicktour	2023-04-24 13:52:05 +01:00