There's a lot to go through in this update, including adding agent sessions to chat and delegating work to them. However, ...
Learn the right VRAM for coding models, why an RTX 5090 is optional, and how to cut context cost with K-cache quantization.