Projects
ROCKET-1
ROCKET- 1: Master Open-World Interaction with Visual-Temporal Context Prompting
OmniJARVIS
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
JARVIS-1
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
GROOT
GROOT: Learning to Follow Instructions by Watching Gameplay Videos