[UX] Change kv-cache-memory log level to debug (#25479)

Signed-off-by: Michael Goin <mgoin64@gmail.com>
This commit is contained in:
Michael Goin
2025-09-23 11:01:24 -04:00
committed by GitHub
parent 4c966e440e
commit 2c58742dff

View File

@ -389,7 +389,7 @@ class Worker(WorkerBase):
f"utilize gpu memory. Current kv cache memory in use is "
f"{int(self.available_kv_cache_memory_bytes)} bytes.")
logger.info(msg)
logger.debug(msg)
# Warm up sampler and preallocate memory buffer for logits and other
# sampling related tensors of max possible shape to avoid memory