Open-H-Embodiment

A Large-Scale Dataset for Enabling Foundation Models in Medical Robotics

Open-H Consortium (hover to view full authorship)
Nigel Nelson1,†, Juo-Tung Chen2,†, Jesse Haworth2,†, Xinhao Chen2,†, Lukas Zbinden1,†, Dianye Huang3,†, Alaa Eldin Abdelaal4, Alberto Arezzo5, Ayberk Acar6, Farshid Alambeigi7, Carlo Alberto Ammirati5, Yunke Ao8,9,10, Pablo David Aranda Rodriguez11, Soofiyan Atar12, Mattia Ballo13, Noah Barnes2, Federica Barontini5, Filip Binkiewicz14, Peter Black15,16, Sebastian Bodenstedt17,18, Leonardo Borgioli19, Nikola Budjak11, Benjamin Calmé20, Fabio Carrillo8, Nicola Cavalcanti8, Changwei Chen12, Haoxin Chen21, Sihang Chen22, Qihan Chen23, Zhongyu Chen24,25, Ziyang Chen26, Shing Shin Cheng24, Meiqing Cheng27, Min Cheng28,22, Zih-Yun Sarah Chiu2, Xiangyu Chu24,25, Camilo Correa-Gallego29, Giulio Dagnino5,30, Anton Deguet2, Jacob Delgado2, Jonathan C. DeLong31, Kaizhong Deng32, Alexander Dimitrakakis29, Qingpeng Ding24, Hao Ding2,33, Giovanni Distefano5, Daniel Donoho34, Anqing Duan35, Marco Esposito11, Shane Farritor36, Jad Fayad37, Zahi Fayad29, Mario Ferradosa38, Filippo Filicori39, Chelsea Finn4,40, Philipp Fürnstahl8,41, Jiawei Ge2, Stamatia Giannarou32, Xavier Giralt Ludevid38, Frederic Giraud8, Aditya Amit Godbole42, Ken Goldberg26, Antony Goldenberg2, Diego Granero Marana14, Xiaoqing Guo21, Tamás Haidegger43,44, Evan Hailey36, Pascal Hansen18, Ziyi Hao24, Kush Hari26, Kengo Hayashi5, Jonathon Hawkins14, Shelby Haworth2, Ortrun Hellig17, S. Duke Herrell6, Zhouyang Hong24, Andrew Howe14, Junlei Hu20, Zhaoyang Jacopo Hu32, Ria Jain26, Mohammad Rafiee Javazm7, Howard Ji4, Rui Ji45, Jianmin Ji22, Zhongliang Jiang46,3, Dominic Jones20, Jeffrey Jopling2, Britton Jordan6, Ran Ju46,28, Michael Kam2, Luoyao Kang24, Fausto Kang2, Siddhartha Kapuria7, Peter Kazanzides2, Aimal Khan47, Sonika Kiehler7, Ethan Kilmer2, Ji Woong (Brian) Kim2,4, Przemysław Korzeniowski1,13, Chandra Kuchi40, Nithesh Kumar6, Alan Kuntz6, Federico Lavagno5, Yu Chung Lee15, Hao-Chih Lee29, Hang Li24, Zhen Li45, Xiao Liang12, Xinxin Lin27, Jinsong Lin24, Chang Liu2, Fei Liu31, Pei Liu3, Yun-hui Liu24, Wanli Liuchen23, Eszter Lukács43,44, Sareena Mann26, Miles Mannas15,16, Brett Marinelli29, Sabina Martyniak13, Francesco Marzola5, Lorenzo Mazza17, Xueyan Mei29, Maria Clara Morais42, Luigi Muratore5, Chetan Reddy Narayanaswamy4, Michał Naskręt13, David Navarro-Alarcon23, Cyrus Neary15, Chi Kit Ng24, Christopher Nguan15,16, David Noonan37, Ki Hwan Oh19, Tom Christian Olesch31, Allison M. Okamura4, Justin Opfermann2, Matteo Pescio5, Doan Xuan Viet Pham2, Tito Porras33, Hongliang Ren24, Ariel Rodriguez Jimenez17, Ferdinando Rodriguez y Baena32, Septimiu E. Salcudean15, Asmitha Sathya2, Preethi Satish26, Lalithkumar Seenivasan2, Jiaqi Shao4, Yiqing Shen2,33, Yu Sheng22, Lucy XiaoYang Shi4,40, Zoe Soulé17, Stefanie Speidel17,18, Mingwu Su24, Jianhao Su32, Idris Sunmola2, Kristóf Takács43, Yunxi Tang24,25, Patrick Thornycroft14, Yu Tian24, Jordan Thompson6, Mehmet K. Turkcan48, Mathias Unberath2,33, Pietro Valdastri20, Carlos Vives38, Quan Vuong40, Martin Wagner17, Farong Wang31, Wei Wang27, Lidian Wang22, Chung-Pang Wang12, Guankun Wang24, Junyi Wang46, Erqi Wang24, Ziyi Wang24, Tanner Watts6, Wolfgang Wein11, Yimeng Wu2, Zijian Wu15, Hongjun Wu2, Luohong Wu8, Jie Ying Wu6, Junlin Wu2, Victoria Wu37, Kaixuan Wu24, Mateusz Wójcikowski13, Yunye Xiao11, Nan Xiao31, Wenxuan Xie24, Hao Yang6, Tianqi Yang24,25, Yinuo Yang12, Menglong Ye37, Ryan S. Yeung15, Nural Yilmaz2, Chim Ho Yin24, Michael Yip12, Rayan Younis17, Chenhao Yu33, Sayem Nazmuz Zaman15, Milos Zefran19, Han Zhang2, Yuelin Zhang24, Yidong Zhang24, Yanyong Zhang22, Xuyang Zhang22, Yameng Zhang46,25, Joyce Zhang14, Ning Zhong45, Peng Zhou49, Haoying Zhou2,50, Xiuli Zuo45, Nassir Navab3,‡, Mahdi Azizian1,‡, Sean D. Huver1,‡, Axel Krieger2,33,‡
1NVIDIA, 2Johns Hopkins University, 3Technical University of Munich, 4Stanford University, 5University of Turin, 6Vanderbilt University, 7The University of Texas at Austin, 8Balgrist University Hospital, 9ETH Zurich, 10ETH AI Center, 11ImFusion GmbH, 12University of California San Diego, 13Sano Centre for Computational Medicine, 14CMR Surgical, 15University of British Columbia, 16Vancouver General Hospital, 17CeTI/TU Dresden, 18German Cancer Research Center, 19University of Illinois Chicago, 20University of Leeds, 21Hong Kong Baptist University, 22University of Science and Technology of China, 23The Hong Kong Polytechnic University, 24The Chinese University of Hong Kong, 25Multi-scale Medical Robotics Center, 26University of California Berkeley, 27Sun Yat-Sen University, 28Tuodao Medical Technology Co., Ltd, 29Icahn School of Medicine at Mount Sinai, 30University of Twente, 31University of Tennessee Knoxville, 32Imperial College London, 33Semaphor Surgical, 34Surgical Data Science Collective, 35Mohamed bin Zayed University of Artificial Intelligence, 36Virtual Incision, 37Moon Surgical, 38Rob Surgical, 39Hofstra/Northwell School of Medicine, 40Physical Intelligence, 41University of Zurich, 42Northwell Health, 43Óbuda University, 44Austrian Center for Medical Innovation and Technology, 45Qilu Hospital of Shandong University, 46The University of Hong Kong, 47Vanderbilt University Medical Center, 48Columbia University, 49Great Bay University, 50Worcester Polytechnic Institute
Co-first authors. Co-senior authors.

A snapshot of the Open-H-Embodiment dataset: 780 hours of synchronized video and kinematics across 20 robotic platforms and 50 institutions worldwide.

Abstract

Autonomous medical robots hold promise in improving patient outcomes by reducing provider fatigue and workload, democratizing access to surgical care, and enabling super-human precision. However, progress in autonomous medical robotics has been limited by a fundamental data problem: existing robot demonstration datasets are small, collected on single platforms, and rarely shared openly, restricting not just policy learning but the broader ecosystem of foundation models, simulation tools, and benchmarks that the field needs to advance.

We introduce Open-H-Embodiment, the first large-scale, multi-institution, multi-robot open dataset for medical robot learning, comprising synchronized video and kinematics collected across 50 institutions worldwide and multiple robotic platforms including the CMR Versius, Intuitive Surgical's da Vinci, da Vinci Research Kit (dVRK), Rob Surgical BiTrack, Virtual Incision's MIRA, Moon Surgical Maestro, and a variety of custom systems, spanning surgical manipulation, robotic ultrasound, and endoscopy procedures, for a total of 780 hours of data.

We demonstrate the breadth of research enabled by this dataset through two foundation models. We train GR00T-H, the first open foundation vision-language-action model for medical robotics, which is the only evaluated model to achieve full end-to-end task completion on a structured suturing benchmark (25% of trials vs. 0% for all baselines) and achieves 65% average success across a 29-step ex vivo suturing sequence on skin-on pork belly. We also train Cosmos-H-Surgical-Simulator, the first kinematic action-conditioned world model to enable multi-embodiment surgical simulation from a single checkpoint, spanning nine robotic platforms and supporting in-silico policy evaluation and synthetic data generation for the surgical domain.

Open-H-Embodiment Overview

Open-H-Embodiment overview showing geographic distribution, robotic platforms, representative frames, and data composition

Figure 1: (A) Geographic distribution of the 50 participating institutions across North America, Europe, the Middle East, and Asia. (B) The 20 healthcare robotic platforms represented in the dataset, spanning surgical systems (da Vinci Si, da Vinci Xi, dVRK, dVRK-Si, MIRA, Versius, BiTrack, Maestro, Torin), general-purpose manipulators adapted for clinical use (Franka Panda, UR5e, Kuka Med 14), and emerging platforms. (C) Representative frames from the dataset illustrating the diversity of tasks, viewpoints, and tissue types covered, including robotic surgery, robotic ultrasound, and related healthcare manipulation tasks. (D) The dataset comprises 780 hours of synchronized multimodal demonstrations spanning language annotations, video observations, and kinematic trajectories. This corpus supports two downstream directions: training GR00T-H, a healthcare-focused vision-language-action model targeting surgical autonomy, and training Cosmos-H-Surgical-Simulator, a multi-embodiment, action-conditioned world model for surgical scene synthesis.

Dataset Composition

Composition of the Open-H-Embodiment dataset showing hours by platform, environment, and task family

Figure 2: (a) Dataset hours by robot platform. (b) Distribution of dataset hours by environment type. (c) Distribution of dataset hours across task families. Together, these panels summarize the current distribution of contributed data across embodiments, collection environments, and task families in Open-H-Embodiment.

Video Demonstrations

Experiment Results

BibTeX

@article{openh2026,
  title={Open-H-Embodiment: A Large-Scale Dataset for Enabling Foundation Models in Medical Robotics},
  author={Nelson, Nigel and Chen, Juo-Tung and Haworth, Jesse and Chen, Xinhao and Zbinden, Lukas and Huang, Dianye and Abdelaal, Alaa Eldin and Arezzo, Alberto and Acar, Ayberk and Alambeigi, Farshid and Ammirati, Carlo Alberto and Ao, Yunke and Aranda Rodriguez, Pablo David and Atar, Soofiyan and Ballo, Mattia and Barnes, Noah and Barontini, Federica and Binkiewicz, Filip and Black, Peter and Bodenstedt, Sebastian and Borgioli, Leonardo and Budjak, Nikola and Calm{\'e}, Benjamin and Carrillo, Fabio and Cavalcanti, Nicola and Chen, Changwei and Chen, Haoxin and Chen, Sihang and Chen, Qihan and Chen, Zhongyu and Chen, Ziyang and Cheng, Shing Shin and Cheng, Meiqing and Cheng, Min and Chiu, Zih-Yun Sarah and Chu, Xiangyu and Correa-Gallego, Camilo and Dagnino, Giulio and Deguet, Anton and Delgado, Jacob and DeLong, Jonathan C. and Deng, Kaizhong and Dimitrakakis, Alexander and Ding, Qingpeng and Ding, Hao and Distefano, Giovanni and Donoho, Daniel and Duan, Anqing and Esposito, Marco and Farritor, Shane and Fayad, Jad and Fayad, Zahi and Ferradosa, Mario and Filicori, Filippo and Finn, Chelsea and F{\"u}rnstahl, Philipp and Ge, Jiawei and Giannarou, Stamatia and Giralt Ludevid, Xavier and Giraud, Frederic and Godbole, Aditya Amit and Goldberg, Ken and Goldenberg, Antony and Granero Marana, Diego and Guo, Xiaoqing and Haidegger, Tam{\'a}s and Hailey, Evan and Hansen, Pascal and Hao, Ziyi and Hari, Kush and Hayashi, Kengo and Hawkins, Jonathon and Haworth, Shelby and Hellig, Ortrun and Herrell, S. Duke and Hong, Zhouyang and Howe, Andrew and Hu, Junlei and Hu, Zhaoyang Jacopo and Jain, Ria and Rafiee Javazm, Mohammad and Ji, Howard and Ji, Rui and Ji, Jianmin and Jiang, Zhongliang and Jones, Dominic and Jopling, Jeffrey and Jordan, Britton and Ju, Ran and Kam, Michael and Kang, Luoyao and Kang, Fausto and Kapuria, Siddhartha and Kazanzides, Peter and Khan, Aimal and Kiehler, Sonika and Kilmer, Ethan and Kim, Ji Woong (Brian) and Korzeniowski, Przemys{\l}aw and Kuchi, Chandra and Kumar, Nithesh and Kuntz, Alan and Lavagno, Federico and Lee, Yu Chung and Lee, Hao-Chih and Li, Hang and Li, Zhen and Liang, Xiao and Lin, Xinxin and Lin, Jinsong and Liu, Chang and Liu, Fei and Liu, Pei and Liu, Yun-hui and Liuchen, Wanli and Luk{\'a}cs, Eszter and Mann, Sareena and Mannas, Miles and Marinelli, Brett and Martyniak, Sabina and Marzola, Francesco and Mazza, Lorenzo and Mei, Xueyan and Morais, Maria Clara and Muratore, Luigi and Narayanaswamy, Chetan Reddy and Naskr{\k{e}}t, Micha{\l} and Navarro-Alarcon, David and Neary, Cyrus and Ng, Chi Kit and Nguan, Christopher and Noonan, David and Oh, Ki Hwan and Olesch, Tom Christian and Okamura, Allison M. and Opfermann, Justin and Pescio, Matteo and Pham, Doan Xuan Viet and Porras, Tito and Ren, Hongliang and Rodriguez Jimenez, Ariel and Rodriguez y Baena, Ferdinando and Salcudean, Septimiu E. and Sathya, Asmitha and Satish, Preethi and Seenivasan, Lalithkumar and Shao, Jiaqi and Shen, Yiqing and Sheng, Yu and Shi, Lucy XiaoYang and Soul{\'e}, Zoe and Speidel, Stefanie and Su, Mingwu and Su, Jianhao and Sunmola, Idris and Tak{\'a}cs, Krist{\'o}f and Tang, Yunxi and Thornycroft, Patrick and Tian, Yu and Thompson, Jordan and Turkcan, Mehmet K. and Unberath, Mathias and Valdastri, Pietro and Vives, Carlos and Vuong, Quan and Wagner, Martin and Wang, Farong and Wang, Wei and Wang, Lidian and Wang, Chung-Pang and Wang, Guankun and Wang, Junyi and Wang, Erqi and Wang, Ziyi and Watts, Tanner and Wein, Wolfgang and Wu, Yimeng and Wu, Zijian and Wu, Hongjun and Wu, Luohong and Wu, Jie Ying and Wu, Junlin and Wu, Victoria and Wu, Kaixuan and W{\'o}jcikowski, Mateusz and Xiao, Yunye and Xiao, Nan and Xie, Wenxuan and Yang, Hao and Yang, Tianqi and Yang, Yinuo and Ye, Menglong and Yeung, Ryan S. and Yilmaz, Nural and Yin, Chim Ho and Yip, Michael and Younis, Rayan and Yu, Chenhao and Zaman, Sayem Nazmuz and Zefran, Milos and Zhang, Han and Zhang, Yuelin and Zhang, Yidong and Zhang, Yanyong and Zhang, Xuyang and Zhang, Yameng and Zhang, Joyce and Zhong, Ning and Zhou, Peng and Zhou, Haoying and Zuo, Xiuli and Navab, Nassir and Azizian, Mahdi and Huver, Sean D. and Krieger, Axel},
  year={2026},
  url={https://open-h.github.io}
}