Seema Singh Saharan, PhD
Postdoctoral Scholar
Radiology
School of Medicine
Seema Saharan is a postdoctoral scholar in the Department of Radiology and Biomedical Imaging at UCSF, specializing in radiology image analysis, large language models (LLMs), and multimodal data integration. Her research combines deep learning, causal inference, and agentic AI frameworks to advance diagnostic accuracy, outcome prediction, and equitable implementation of precision medicine.
Show full bio (210 words) Hide full bio
Her expertise spans data science, biostatistics, and AI-driven methodologies, with a focus on integrating radiology imaging, biomedical signals, and high-dimensional biomolecular data. She develops AI-powered diagnostic tools that leverage computer vision, neural networks, and multimodal fusion techniques for early disease detection, risk stratification, and personalized treatment strategies.
Seema also contributes to NIH-funded projects addressing Alzheimer’s disease, chronic pain, and disparities in molecular diagnostics access. Her work involves scalable pipelines for large claims datasets, NLP-driven extraction of unstructured EHR data, and transformer-based approaches (BERT, BioBERT, ClinicalBERT) for analyzing ctDNA testing pathways.
She holds a Ph.D. in Statistics with a Data Science Algorithmic focus, where she optimized statistical models of cytokine cascades transported by HDL/Plasma to inform cardiovascular and Alzheimer’s disease research. In collaboration with UCSF’s Cardiovascular Research Institute, she has developed LLM-enabled diagnostic frameworks that integrate cytokine biomarkers, clinical data, and biomedical literature, incorporating retrieval-augmented generation (RAG), SHAP-based explainability, and causal inference approaches.
Beyond research, Seema is passionate about building standardized AI ecosystems for healthcare that are interpretable, secure, and clinically impactful. She is also an experienced educator, serving as a lecturer at UC Berkeley Extension and California State University, East Bay, where she teaches courses in data science, AI, and biostatistics. .
Education & Training
Show all (1) Hide
- Ph.D. Statistics Focused on Data Science algorithms . Ph.D. Thesis Research with Dr John Kane ,CVRI, UCSF University of Rajasthan 5/2023
Publications (7)
-
Machine Learning-Based Model for Predicting Coronary Heart Disease Using Preβ HDL and Cytokines as Plasma Biomarkers.
Proceedings. International Conference on Computational Science and Computational Intelligence 2025 Saharan SS, Creasy KT, Birnbaum L, Stock EO, Mustra Rakic J, Tian X, Prakash A, Malloy M, Kane J -
Logistic Regression and Statistical Regularization Techniques for Risk Classification of Coronary Artery Disease using Cytokines transported by high density lipoproteins.
Proceedings. International Conference on Computational Science and Computational Intelligence 2024 Saharan SS, Nagar P, Creasy KT, Stock EO, Feng J, Malloy MJ, Kane JP -
Optimization of Smoking Classification by Applying Neural Network with Variable Importance Using Cytokine Biomarkers.
Proceedings. International Conference on Computational Science and Computational Intelligence 2024 Saharan SS, Nagar P, Creasy KT, Stock EO, Feng J, Malloy MJ, Kane JP -
Smoking Classification Using Novel Plasma Cytokines by implementing Machine Learning and Statistical Methods.
Proceedings. International Conference on Computational Science and Computational Intelligence 2024 Saharan SS, Nagar P, Creasy KT, Stock EO, Feng J, Malloy MJ, Kane JP -
Application of Machine Learning Ensemble Super Learner for analysis of the cytokines transported by high density lipoproteins (HDL) of smokers and nonsmokers.
Proceedings. International Conference on Computational Science and Computational Intelligence 2022 Saharan SS, Nagar P, Creasy KT, Stock EO, Feng J, Malloy MJ, Kane JP -
Implementation of PCA enabled Support Vector Machine using cytokines to differentiate smokers versus nonsmokers.
Proceedings. International Conference on Computational Science and Computational Intelligence 2022 Saharan SS, Nagar P, Creasy KT, Stock EO, Feng J, Malloy MJ, Kane JP -
Machine learning and statistical approaches for classification of risk of coronary artery disease using plasma cytokines.
BioData mining 2021 Saharan SS, Nagar P, Creasy KT, Stock EO, Feng J, Malloy MJ, Kane JP