vLLM v0.20.0 Release
vLLM v0.20.0 focuses on memory and MoE serving efficiency with TurboQuant 2-bit KV cache.
Read MoreMistral launched Workflows for durable, fault-tolerant AI process orchestration.
Read MoreMicrosoft's TRELLIS.2 is a 4B image-to-3D model producing up to 1536³ PBR textured assets.
Read MoreHermes is gaining traction, outperforming OpenClaw in instruction-following and practical workflows.
Read More