Manuel de Prada Corral
1aa7256f01
Refactor MambaCache
to modeling_mamba.py
(#38086)
* Refactor MambaCache to modeling_mamba.py (parity with Zamba)
* ruff
* fix dummies
* update
* update
* remove mamba ref in cache tests
* remove cache_implementation from tests
* update
* ruff
* ruff
* sneaky regression
* model consistency
* fix test_multi_gpu_data_parallel_forward
* fix falcon slow tests
* ruff
* ruff
* add sample false
* try to fix slow tests
* Revert "fix test_multi_gpu_data_parallel_forward"
This reverts commit 66b7162c7c5c5ce8a73ccf48cffc8a96343ebb33.
* fix tests on nvidia t4, remove dataparallel tests from mamba
* ruff
* remove DDP tests from mamba and falcon_mamba
* add explicit error for MambaCache
* mamba2 also needs to init cache in prepare_inputs_for_generation
* ruff
* ruff
* move MambaCache to its own file
* ruff
* unprotected import fix
* another attempt to fix unprotected imports
* Revert "another attempt to fix unprotected imports"
This reverts commit 2338354fcab630de5899321f5daced5fb312c2a2.
* fixing unprotected import, attempt 3
* Update src/transformers/cache_utils.py
* ruff's fault
* fix arthur review
* modular falcon mamba
* found a hack
* fix config docs
* fix docs
* add export info
* merge modular falcon branch
* oopsie
* fix fast path failing
* new approach
* oopsie
* fix types
* Revert new pragma in modular
This reverts commit 80b1cf160ee251536f07c40b8a0857d499e70db6.
* trying another modular workaround
* review & fix ci
* oopsie
* clear prepare_inputs on mamba/mamba2/falcon_mamba
2025-07-21 14:59:36 +02:00
..
2025-07-18 13:41:54 +02:00
2021-02-15 07:55:10 -05:00
2025-07-17 13:21:59 +00:00
2025-07-21 12:38:05 +00:00
2025-03-21 13:08:47 +01:00
2025-07-15 10:40:41 +02:00
2025-05-06 06:47:43 +02:00
2025-06-18 14:38:08 +01:00
2025-06-17 19:37:18 +01:00
2025-07-18 18:00:34 +00:00
2024-05-22 06:40:15 +02:00
2025-06-17 19:37:18 +01:00
2025-06-17 19:37:18 +01:00
2023-03-13 19:11:19 +01:00
2025-07-10 19:07:59 +01:00
2025-06-13 13:44:07 +01:00
2025-07-17 13:21:59 +00:00
2025-07-21 12:38:05 +00:00
2021-02-15 07:55:10 -05:00
2025-07-16 04:20:02 +02:00
2025-07-10 19:07:59 +01:00
2024-08-27 11:58:27 +01:00
2025-06-17 19:37:18 +01:00
2025-06-26 11:56:31 +00:00
2025-03-25 16:00:11 +01:00
2025-05-09 11:45:03 +02:00
2024-04-15 15:08:09 +02:00
2025-04-02 14:39:57 +02:00
2024-01-31 15:58:17 +01:00
2025-03-25 16:00:11 +01:00
2023-02-03 12:57:02 -05:00
2025-07-01 20:19:06 +02:00
2025-07-16 04:20:02 +02:00
2025-06-20 16:10:35 +00:00
2024-08-27 11:58:27 +01:00
2024-04-12 10:01:28 +02:00
2024-05-22 06:40:15 +02:00
2025-07-21 14:59:36 +02:00
2025-07-03 17:04:16 +01:00
2025-07-17 13:21:59 +00:00
2025-07-18 11:45:47 +00:00
2025-03-25 16:00:11 +01:00
2025-07-21 12:38:05 +00:00
2025-06-17 19:37:18 +01:00
2025-07-08 17:06:12 +02:00
2025-05-24 19:15:02 +02:00
2025-03-25 16:00:11 +01:00
2024-09-03 16:53:21 +02:00
2025-03-11 13:47:38 +00:00
2025-07-16 12:45:46 +02:00
2024-06-10 15:16:58 +02:00
2025-06-13 12:02:27 -07:00
2024-05-22 06:40:15 +02:00
2025-07-09 19:36:48 +02:00
2025-07-03 11:17:27 +02:00
2025-07-11 16:26:58 +00:00
2025-07-08 10:20:52 +02:00
2024-07-22 14:14:47 +01:00