LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning

Related