VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model

Chen Feng
Chen Feng
Associate Professor at NYU