Editing Openai/692db884-c2d8-800c-a357-aa2a0e684fbe (section)

===== There is prior art around: =====
* Compressed RAM on CPUs (zram, zswap).
* Research and some vendor work on GPU compressed memory and “buddy compression”: - Where parts of VRAM or backing store are transparently compressed to increase effective capacity.

These are:
* Very similar in philosophy: “present a larger virtual memory pool via transparent compression.”
* Usually implemented at the driver/hardware/OS level, not as a userland LLM-specific hook.
* Not generally exposed as: “drop this user .so in and your LLM fits now.”

So: closest conceptual cousin, but:
* Not LoreToken-aware
* Not specialized for LLM tensors
* Not something you casually LD_PRELOAD into an existing stack.