Qwen3-Omni: first multimodal model with SoTA text, image, audio, and video perf | Not Hacker News!