Sumit Chopra

I am an Associate Professor in the Department of Computer Science at Courant Institute of Mathematical Sciences, NYU. I'm also an Associate Professor and the Director of Machine Learning Research in the Department of Radiology at the NYU Grossman School of Medicine. My research focuses on developing machine learning (specifically deep learning) models for representation learning with a particular focus on applications in healthcare.

Research at the chopralab

The prevailing approach to AI in healthcare primarily relies on retrospectively collected datasets—such as medical images and electronic health records (EHRs)—to train machine learning models that mimic clinical inference. While this method seems reasonable, it is fundamentally suboptimal, as evidenced by the limited adoption of AI in real-world clinical practice.

One major limitation stems from the nature of healthcare datasets: they are designed for human interpretability, meaning they are structured to ensure clinicians can detect abnormalities. However, machine learning models are not bound by this constraint—why limit them to the same data types that humans rely on? Moreover, human perception itself is inherently limited. What hidden patterns or signals exist within these datasets that we are currently overlooking? With these challenges in mind, my research lab focuses on two key themes:

Learning the Data Acquisition

Can we determine the most informative datasets—whether human-interpretable or not—that provide the richest signals for AI-driven insights?

Uncovering the Unknown Unknowns

Can we detect hidden signals within existing datasets to reveal previously unnoticed patterns and observations?

By shifting the paradigm from mimicking to discovery, we aim to unlock AI's full potential in transforming healthcare.

News and Updates

2025/02: Our paper on prostate cancer risk stratification was accepted in the Journal of Magnetic Resonance Imaging.
2025/01: Our blogpost on multi-modal learning was accepted at the International Conference on Representation Learning (ICLR), 2025.
2024/12: Our paper on principled way of doing multi-modal learning was accepted in NeurIPS.
2024/08: We were the recipients of the Early Stage Research Award from the NYU Discovery Research Fund for Human Health.
2024/07: Our paper on adaptive sampling of k-space in MR imaging was accepted in ICML.
2024/01: Congratulations to Revant Teotia for being selected as the NYU-Meta Fellow.
2023/09: NIH-NSF proposal (with Narges Razavian as PI) on using self-supervised learning for early detection of Dementia got funded.
2023/08: Our paper on robustness of normalization schemes in MR imaging was accepted in MIDL.

Selected Research Projects

End-to-End Magnetic Resonance Triaging

Using AI to enable MR-based diagnostics (a highly accurate but expensive and inaccessible technology) for early detection of diseases at population-level, thereby democratizing access to this advanced diagnostic modality. We accomplish this by learning disease signatures in the raw frequency space (a.k.a., k-space) without the need to reconstruct high-fidelity images.

MR Scanners with Memory

Envision MRI scanners equipped with “patient-specific memory,” capable of recalling and leveraging multiple sources of prior data (e.g., prior imaging, EHR) from the same individual—rather than relying solely on the current scan. Freed from the requirement to collect measurements near the Nyquist rate, these scanners can dramatically reduce scan times without sacrificing image quality, even on lower-cost machines, enabling accessible imaging.

A Principled Approach to Multi-Modal Learning

Traditional approaches to multi-modal learning are sub-optimal because they predominently concentrated on capturing in isolation either the inter-modality dependencies or the intra-modality dependencies. Viewing this problem from the lens of generative models, we consider the target as a source of multiple modalities and the interaction between them, and propose the I2M2 framework, that naturally captures both inter- and intra-modality dependencies, leading to more accurate predictions.

RL to Learn What Data to Acquire in MR Scanning

An MR scanner captures a vast array of high-quality k-space measurements to generate detailed cross-sectional images. However, this process is inherently slow and expensive, as the amount of data collected remains constant regardless of patient characteristics or the suspected disease. We propose a method that learns an adaptive policy to selectively acquire k-space measurements, optimizing for disease detection without the need for image reconstruction.

AI-Driven Precision Education for Radiology Residents

AI in radiology is revolutionizing more than just medical image analysis—it’s transforming the entire radiological ecosystem. From optimizing workflows to enhancing training, its potential is limitless. We are pioneering the world’s first AI-driven platform designed to deliver a truly personalized educational experience for radiology residents. Powered by advanced Large Language Models (LLMs), our adaptive system tailors learning to each resident’s unique journey, analyzing their past case exposure, strengths, and areas for improvement. By leveraging AI-driven insights, we are redefining how radiologists learn, grow, and excel in their field.

Lab Members

Ph.D.

Raghav Singhal (2021-; with Rajesh Ranganath)
Divyam Madaan (2021-; with Kyunghyun Cho)
Umang Sharma (2022-)
Arda Atalik (2022-; with Daniel Sodickson)
Revant Teotia (2023-)
Hao Zhang (2024-; with Rajesh Ranganath)
Muhang Tian (2024-; with Rajesh Ranganath)

Undergrad, MS, and Research Engineers

Antonio Verdone Sanchez
Tarun Dutt
Luoyao Chen
Divyansh Jha
Steven Zhang
Ciel Wang
Anisha Bhatnagar
Varshan Muhunthan
Arjun

Program Manager

Harold Stern

Teaching

Fundamentals of Machine Learning: Fall 2025
Fundamentals of Machine Learning: Fall 2023
Machine Learning for Healthcare: Fall 2022
Fundamentals of Machine Learning: Fall 2021

Professional Activities

Organizing committee of Machine Learning for Healthcare (MLHC) 2025
Associate Editor of ACM Transactions on Computing for Healthcare (HEALTH)
Area Chair NeurIPS: 2025, 2024
Co-organizer of the NYU Computational Biology and Medicine Seminar
Co-organizer of the BOLD-AIR Summit: an NYU Langone and Stanford Medical Collaboration

Funding

Research within chopralab is funded by the National Science Foundation (NSF), National Institute of Health (NIH), the NYU Discovery Research Fund, and the Global AI Frontier Lab NYU/South Korea.

Bio

Sumit Chopra is an Associate Professor at the Courant Institute of Mathematical Sciences, NYU, and the Department of Radiology at the NYU Grossman School of Medicine, where he serves as the Director of Machine Learning Research. His work focuses on advancing AI, with an emphasis on deep learning models and their transformative applications in healthcare.

Before joining NYU, he co-founded Imagen Technologies, a well-funded startup revolutionizing healthcare through AI, where he served as Vice President of AI. Prior to that, he was a research scientist at Facebook (now Meta) AI Research (FAIR), contributing to advancements in natural language understanding. He earned his Ph.D. in Computer Science from New York University under the mentorship of Prof. Yann LeCun. His dissertation introduced a pioneering neural network model for relational regression, which became the conceptual foundation for a startup focused on modeling residential real estate prices. Following his Ph.D., he joined AT&T Labs–Research as a senior scientist in the machine learning and statistics department, where he developed innovative deep learning models for speech recognition, natural language processing, and computer vision. There, his research also extended into areas such as recommender systems, computational advertising, and ranking algorithms.

He is best known for his early pioneering work on learning representation using contrastive learning methods that became the origins of self-supervised learning (SSL), proposing Memory Networks architecture that formed the conceptual foundation of attention-based models, and proposing energy-based models for relational regression. With a career spanning academia, industry, and entrepreneurship, Sumit Chopra is dedicated to pushing the boundaries of AI and driving its real-world impact.

Selected and Recent Publications

Leveraging representation learning for bi-parametric Prostate MRI to disambiguate PI-RADS 3 and improve biopsy decision strategies
Lavanya Umapathy, Patricia Johnson, Tarun Dutt, Angela Tong, Sumit Chopra, Daniel K Sodickson, Hersh Chandarana.
(Under Review) European Urology Oncology.
Harnessing Side Information for Highly Accelerated MRI
Arda Atalık, Sumit Chopra, Daniel K Sodickson.
Under review at MICCAI, 2025.
A Trust-Guided Approach to MR Image Reconstruction with Side Information
Arda Atalık, Sumit Chopra, Daniel K Sodickson.
Under review at IEEE Transactions on Medical Imaging.
Prostate Cancer Risk Stratification and Scan Tailoring Using Deep Learning with Abbreviated Prostate MRI
Patricia M. Johnson, Tarun Dutt, Luke Ginocchio, Amanpreet Singh Saimbhi, Lavanya Umapathy, Tobias Block, Daniel K. Sodickson, Sumit Chopra, Angela Tong, Hersh Chandarana.
Journal of Magnetic Resonance Imaging, February, 2025.
Resolving ambiguous space: Leveraging side information with deep learning to extend the limits of MR image reconstruction
Arda Atalık, Sumit Chopra, Daniel K Sodickson.
ISMRM 2025, Honolulu, HI, June 2025.
Multi-coil multi-contrast joint reconstruction with protection from hallucination: Application to low-field MR
Arda Atalık, Sumit Chopra, Daniel K Sodickson.
ISMRM 2025 Honolulu, HI, June 2025.
Multi-modal Learning: A Lookback and the Road Ahead
Divyam Madaan, Taro Makino, Kyunghyun Cho, Sumit Chopra.
International Conference on Representation Learning (ICLR) Blogposts, Singapore, 2025.
Accelerating multi-coil MR image reconstruction using weak supervision
Arda Atalık, Sumit Chopra, Daniel K Sodickson.
Magnetic Resonance Materials in Physics, Biology and Medicine, Vol 38, February, 2025.
A Framework for Multi-modal Learning: Jointly Modeling Inter- & Intra-Modality Dependencies
Divyam Madaan, Taro Makino, Sumit Chopra, Kyunghyun Cho.
NeurIPS, Vancouver, CA, December 2024.
Leveraging Historical Patient Reports for Enhanced Automatic Diagnosis
Haoxu Huang, Cem M. Deniz, Kyunghyun Cho, Sumit Chopra, Divyam Madaan.
Machine Learning 4 Health (ML4H) Proceedings, Vancouver, CA, December 2024.
A training regime to learn unified representations from complementary breast imaging sources
Umang Sharma, J Park, Sumit Chopra, Krzysztof Geras.
arXiv:2408.08560, August 2024.
Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology Prediction
Chen-Yu Yen, Raghav Singhal, Umang Sharma, Lerrel Pinto, Sumit Chopra.
International Conference on Machine Learning (ICML), Vienna, Austria, July 2024.
Weak supervision in multi-coil accelerated MR image reconstruction
Arda Atalık, Sumit Chopra, Daniel K Sodickson.
ISMRM 2024, Singapore, Singapore, June, 2024.
Predicting Alzheimer's Diseases and Related Dementias in 3-year timeframe with AI Foundation Model on Electronic Health Records
Weicheng Zhu, Huanze Tang, Hao Zhang, Haresh Rengaraj Rajamohan, Shih-Lun Huang, Xinyue Ma, Ankush Chaudhari, Divyam Madaan, Elaf Almahmoud, Sumit Chopra, John A Dodson, Abraham A Brody, Arjun V Masurkar, Narges Razavian.
Alzheimer's Association International Conference, Philadelphia, PA, 2024.
FastMRI Prostate: A public, biparametric MRI dataset to advance machine learning for prostate cancer imaging
Radhika Tibrewala, Tarun Dutt, Angela Tong, Luke Ginocchio, Riccardo Lattanzi, Mahesh B. Keerthivasan, Steven H. Baete, Sumit Chopra, Yvonne W Lui, Daniel K. Sodickson, Hersh Chandarana, Patricia M. Johnson.
Scientific Data, Volume 11, Issue 1, April 2024.
Separating Multimodal Modeling from Multidimensional Modeling for Multimodal Learning
Divyam Madaan, Taro Makino, Sumit Chopra, Kyunghyun Cho.
ICML Workshop on Spurious correlations, Invariance, and Stability, Honolulu, HI 2023.
On Sensitivity and Robustness of Normalization Schemes to Input Distribution Shifts in Automatic MR Image Diagnosis
Divyam Madaan, Daniel Sodickson, Kyunghyun Cho, Sumit Chopra.
Medical Imaging and Deep Learning (MIDL) Nashville, TN, 2023.
Assessment of a Deep-Learning System for Fracture Detection in Musculoskeletal Radiographs.
Rebecca M. Jones, Anuj Sharma, Robert Hotchkiss, John W. Sperling, Jackson Hamburger, Christian Ledig, Robert O’Toole, Michael Gardner, Srivas Venkatesh, Matthew M. Roberts, Romain Sauvestre, Max Shatkhin, Anant Gupta, Sumit Chopra, Manickam Kumaravel, Aaron Daluiski, Will Plogger, Jason Nascone, Hollis G. Potter, and Robert V. Lindsey.
Nature Digital Medicine, October 2020.
Generative Image Translation for Data Augmentation of Bone Lesion Pathology.
Anant Gupta, Srivas Venkatesh, Sumit Chopra, and Christian Ledig.
Medical Imaging with Deep Learning (MIDL), London U.K., July 2019.
Deep Neural Network Improves Fracture Detection by Clinicians.
Robert Lindsey, Aaron Daluiski, Sumit Chopra, Alexander Lachapelle, Michael Mozer, Serge Sicular, Douglas Hanel, Michael Gardner, Anurag Gupta, Robert Hotchkiss, Hollis Potter.
Proceedings of the National Academy of Sciences (PNAS), November 2018.
StarSpace: Embed All The Things!
Ledell Wu, Adam Fisch, Sumit Chopra, Keith Adams, Antoine. Bordes, Jason Weston.
Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans LA, February 2018.
Training Language Models Using Target-Propagation.
Sam Wiseman, Sumit Chopra, Marc'Aurelio Ranzato, Arthur Szlam, Ruoyu Sun, Soumith Chintala, and Nicolas Vasilache.
arXiv:1702.04770.
Learning Through Dialogue Interactions.
Jiwei Li, Alexander H. Miller, Sumit Chopra, Marc'Aurelio Ranzato, and Jason Weston.
International Conference on Learning Regresentations (ICLR), Toulon France, April 2017. [Code+Data]
Dialogue Learning With Human-In-The-Loop.
Jiwei Li, Alexander H. Miller, Sumit Chopra, Marc'Aurelio Ranzato, and Jason Weston.
International Conference on Learning Regresentations (ICLR), Toulon France, April 2017. [Code+Data]
Abstractive sentence summarization with attentive recurrent neural networks.
Sumit Chopra, Michael Auli, and Alexander M. Rush.
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), San Diego CA, June 2016.
Evaluating prerequisite qualities for learning end-to-end dialog systems
Jesse Dodge, Andreea Gane, Xiang Zhang, Antoine Bordes, Sumit Chopra, Alexander Miller, Arthur Szlam, and Jason Weston.
International Conference on Learning Representations (ICLR), San Juan Puerto Rico, May 2016. [Data]
Sequence level training with recurrent neural networks
Marc'Aurelio Ranzato, Sumit Chopra, Michael Auli, and Wojciech Zaremba.
International Conference on Learning Representations (ICLR), San Juan Puerto Rico, May 2016.
The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations
Felix Hill, Antoine Bordes, Sumit Chopra, and Jason Weston. International Conference on Learning Representations (ICLR), San Juan Puerto Rico, May 2016. [Data]
A neural attention model for abstractive sentence summarization
Alexander M. Rush, Sumit Chopra, and Jason Weston.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), Lisbon Portugal, September 2015. [Code]
Large-scale simple question answering with memory networks
Antoine Bordes, Nicolas Usunier, Sumit Chopra, and Jason Weston.
arXiv:1506.02075. [Data]
Towards ai-complete question answering: A set of prerequisite toy tasks
Jason Weston, Antoine Bordes, Sumit Chopra, Alexander M. Rush, Bart van Merriënboer, Armand Joulin, and Tomas Mikolov.
International Conference on Learning Representations (ICLR), San Diego CA, May 2015. [Code+Data]
Learning longer memory in recurrent neural networks
Tomas Mikolov, Armand Joulin, Sumit Chopra, Michael Mathieu, and Marc'Aurelio Ranzato.
International Conference on Learning Representations (ICLR), San Diego CA, May 2015.
Video (language) modeling: a baseline for generative models of natural videos
Marc'Aurelio Ranzato, Arthur Szlam, Joan Bruna, Michael Mathieu, Ronan Collobert, and Sumit Chopra.
International Conference on Learning Representations (ICLR), San Diego CA, May 2015.
Memory Networks
Jason Weston, Sumit Chopra, and Antoine Bordes.
International Conference on Learning Representations (ICLR), San Diego CA, May 2015.
Question answering with subgraph embeddings
Antoine Bordes, Sumit Chopra, and Jason Weston.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha Qatar, October 2014.
# TagSpace: Semantic embeddings from hashtags
Jason Weston, Sumit Chopra, and Keith Adams.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha Qatar, October 2014.
DLID: Deep Learning for Domain Adaptation by Interpolating between Domains
Sumit Chopra, Suhrid Balakrishnan, and Raghuraman Gopalan.
Proceedings of the ICML 2013, Workshop on Representation Learning, Atlanta, Georgia, USA, 2013.
Collaborative Ranking
Suhrid Balakrhishnan and Sumit Chopra.
Proceedings of the fifth ACM international conference on Web search and data mining (WSDM) 2012.
Computational television advertising
Suhrid Balakrishnan, Sumit Chopra, David Applegate, and Simon Urbanek.
Data Mining (ICDM), 2012 IEEE 12th International Conference on, 71-80.
Two of a kind or the ratings game? Adaptive pairwise preferences and latent factor models
Suhrid Balakrhishnan and Sumit Chopra.
Frontiers of Computer Science 6 (2), 197-208.
Factor Graphs for Relational Regression
Sumit Chopra, Trivikraman Thampy, John Leahy, Andrew Caplin, and Yann LeCun.
Technical Report: TR2007-906, January 2007.
Discovering the Hidden Structure of House Prices with a Non-Parametric Latent Manifold Model
Sumit Chopra, Trivikraman Thampy, John Leahy, Andrew Caplin, and Yann LeCun.
13th International Conference on Knowledge Discovery and Data Mining (KDD), San Jose CA, August 2007.
Energy-Based Models in Document Recognition and Computer Vision
Yann LeCun, Sumit Chopra, Marc'Aurelio Ranzato, and Jie Huangfu.
Proceedings of the International Conference on Document Analysis and Recognition (ICDAR) 2007.
A Unified Energy Based Framework for Unsupervised Learning
Marc'Aurelio Ranzato, Y-Lan Boureau, Sumit Chopra, and Yann LeCun.
Proceedings of the 2007 Conference on Artificial Intelligence and Statistics (AISTATS) 2007.
A Tutorial on Energy-Based Learning
Yann LeCun, Sumit Chopra, Raia Hadsell, Jie Huangfu, and Marc'Aurelio Ranzato.
Predicting Structured Outputs, Bakir et al. (eds), MIT Press 2006.
Efficient Learning of Sparse Overcomplete Representations with Energy-Based Model
Marc'Aurelio Ranzato, Christopher Poultney, Sumit Chopra, and Yann LeCun.
Advances in Neural Information Processing Systems 19, in Scholkopf et al. (eds), MIT Press, Cambridge, MA, 2006.
Dimensionality Reduction by Learning an Invariant Mapping
Raia Hadsell, Sumit Chopra, and Yann LeCun.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New York City, NY, June 2006.
Learning a Similarity Measure Discriminatively with Applications to Face Verification
Sumit Chopra, Raia Hadsell, and Yann LeCun.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Diego CA, June 2005.

All Publications