NVIDIA Launches Gamma-World, a Multi-Agent Model Supporting 4-Player Collaboration at 24 FPS

iconKuCoinFlash
Share
Share IconShare IconShare IconShare IconShare IconShare IconCopy
AI summary iconSummary

expand icon
NVIDIA announced a project with Gamma-World, a multi-agent model developed in collaboration with Tsinghua University, the University of Toronto, and the Vector Institute. The system enables 4-player collaboration at 24 FPS, utilizing high-dimensional rotary position encoding and sparse hub attention. The team has shared the project page and paper, with code and weights to follow. This announcement aligns with the growing momentum around real-world assets (RWA) in AI-driven environments.
ME AI News: According to monitoring by Beating, researchers from NVIDIA, Tsinghua University, the University of Toronto, and the Vector Institute have introduced Gamma-World, a multi-agent generative world model that breaks the long-standing bottleneck in virtual environment simulation, which was previously limited to single or dual-player interactions. The team has now released the project page and paper, with code and weights planned for open-sourcing soon. The model introduces two key mechanisms—high-dimensional generalization of rotary positional encoding and informational mediator tokens—enabling independent control of multiple players while achieving zero-shot generalization from two-player to four-player collaboration without retraining. The primary challenge in multi-player world models lies in ensuring each player maintains independent control without conflicting actions. The research team designed Simplex Rotary Agent Encoding, extending the classic Rotary Positional Encoding (RoPE) into a high-dimensional angular space. This new encoding grants all players full physical symmetry, eliminating reliance on fixed player IDs and enabling more natural individual referencing and control. To prevent computational costs from quadratically increasing with more players, the solution introduces Sparse Hub Attention, which uses learnable hub tokens to mediate interaction information, reducing the attention computation cost between players to linear scale. In terms of generation speed, the team distilled a high-latency diffusion model teacher into a causal student model, combined with KV Cache, achieving real-time action response output at 24 FPS. Evaluations in multi-player game environments show that the new model significantly outperforms traditional slot-based and dense attention networks in video realism, action controllability, and inter-player consistency. (Source: BlockBeats)
Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information. Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.