
Flash Compact: 33,000 tok/sec Context Compaction
Flash Compact drops 50-70% of an agent's context at 33,000+ tokens/second while keeping every surviving line verbatim. Two modes: objective compaction strips filler with no guidance, query-based compaction weights keep/drop decisions against what the agent needs next.


















![Everything is Model[s]](/_next/image?url=%2Fimages%2Fhands.jpg&w=3840&q=75)

