Jeremy Rubin

PCA Structured laSSO (PCASSO)

PCA Structured lasSO (PCASSO)
Click to View


Flash Talk Presenter
Photo of Jeremy Rubin
Jeremy Rubin, Biostatistics


J Rubin1, J Zee1

  1. University of Pennsylvania Department of Biostatistics, Epidemiology, and Informatics


Nephrotic syndrome (NS) characterizes a group of rare diseases that can cause chronic kidney disease and kidney failure. Whole slide images (WSIs) of kidney biopsies from NS patients were collected through the Nephrotic Syndrome Study Network (NEPTUNE). These images offer greater insight into disease prognosis through pathomic feature extraction for biomarker discovery. Pathomic features are computer-generated quantitative measurements that are calculated from segmented histological objects which quantify the objects’ heterogeneity. For each subject, we can construct a matrix whose entries are a common set of pathomic features that are measured per segmented histological object from that subject’s WSI. We propose the principal component analysis (PCA) Structured laSSO (PCASSO), a novel scalar-on-matrix regression technique that allows for varying numbers of segmented histological objects across subjects, to predict scalar clinical outcomes from the pathomic feature matrices. Specifically, we consider the setting in which there is a large number of segmented histological objects per subject relative to the number of pathomic features. Simulation study results indicate that PCASSO best identifies the pathomic features which truly affect clinical outcomes relative to naive regression modelling strategies. The application of PCASSO for pathomic feature-based prediction of clinical outcomes contributes to the ultimate goal of personalized care for NS patients based on individual characteristics.


Kidney disease, high-dimensional data analysis, image analysis, computational pathology, scalar-on-matrix regression

About Us

To understand health and disease today, we need new thinking and novel science —the kind  we create when multiple disciplines work together from the ground up. That is why this department has put forward a bold vision in population-health science: a single academic home for biostatistics, epidemiology and informatics. 

© 2023 Trustees of the University of Pennsylvania. All rights reserved.. | Disclaimer

Follow Us