https://developers.openai.com/api/docs/guides/prompt-caching GPT-5.5 + 不是 24 小时吗? Extended prompt cache retention keeps cached prefixes active for longer, up to a maximum of 24 hours. Extended Prompt Caching works by offloading the key/value tensors to GPU-local storage when memory is full, significantly increasing the storage capacity available for caching.