Designed especially for neurobiologists, FluoRender is an interactive tool for multi-channel fluorescence microscopy data visualization and analysis.

BrainStimulator is a set of networks that are used in SCIRun to perform simulations of brain stimulation such as transcranial direct current stimulation (tDCS) and magnetic transcranial stimulation (TMS).

Developing software tools for science has always been a central vision of the SCI Institute.

Image Analysis

SCI's imaging work addresses fundamental questions in 2D and 3D image processing, including filtering, segmentation, surface reconstruction, and shape analysis. In low-level image processing, this effort has produce new nonparametric methods for modeling image statistics, which have resulted in better algorithms for denoising and reconstruction. Work with particle systems has led to new methods for visualizing and analyzing 3D surfaces. Our work in image processing also includes applications of advanced computing to 3D images, which has resulted in new parallel algorithms and real-time implementations on graphics processing units (GPUs). Application areas include medical image analysis, biological image processing, defense, environmental monitoring, and oil and gas.

Ross Whitaker

Segmentation

Sarang Joshi

Shape Statistics
Segmentation
Brain Atlasing

Tolga Tasdizen

Image Processing
Machine Learning

Chris Johnson

Diffusion Tensor Analysis

Shireen Elhabian

Image Analysis
Computer Vision

Funded Research Projects:

Neighborhood Looking Glass: 360 Degree Automated Characterization of the Built Environment for Neighborhood Effects Research

Tolga Tasdizen
This proposal represents a vertical advancement in neighborhood effects research, producing for the first time, national neighborhood indicators of the built environment. Thus far, only local studies have been conducted due to the resource-intensive nature of site visits to conduct assessments of community features and also manual annotations of street images. With the recent advancement of computer vision and the emergence of massive sources of image data, we will leverage our team’s abilities to develop a data collection strategy utilizing geographic information systems to assemble a national collection of Google Street View images of all road intersections and street segments in the United States. We will utilize this data bank, and develop informatics algorithms to produce neighborhood summaries of built environment that have been theoretically and empirically identified to be important for health outcomes. After the creation of Neighborhood Looking Glass, we will conduct investigations into the impact of neighborhood environments on health utilizing medical records from hundreds of thousands of patients and accounting for predisposing characteristics in analyses. Our investigative team (comprised of experts in the field of epidemiology, computer vision, bioinformatics, and computer science) is uniquely suited to implement the study aims.
Our Specific Aims are: 1) Develop informatics techniques to produce neighborhood quality indicators; 2) Measure the accuracy of data algorithms and construct an interactive geoportal for neighborhood data visualization and data sharing, 3) Utilize Neighborhood Looking Glass and a large collection of medical records from Intermountain Healthcare to investigate neighborhood influences on the risk of obesity and substance abuse. The epidemic rise in chronic health conditions is recent and as such suggests its cause is social, cultural, and constructed rather than purely biological. Thus, we have the possibility of intervening on the environment to better support health. Recent studies suggest that the current cohort of young adults may face historically high cardiovascular disease risk and chronic disease burden. Our substantive investigation of the impact of neighborhood factors on chronic conditions will contribute further to the understanding of contextual influences on the health of this cohort at the forefront of a chronic disease epidemic. Moreover, the dramatic rise in overdoses, accidental poisonings, and mental health issues contributing to premature mortality warrants further investigation into risk-inducing environmental factors for substance abuse. Neighborhood Looking Glass will be a significant benefit to neighborhood effects researchers, harnessing the largely untapped potential of street image data to capture built environment characteristics. Results can be utilized to inform population-based strategies to reduce health disparities and improve health.

Public Health Relevance

The epidemic rise in obesity, related chronic diseases, and substance abuse in recent decades signal the importance of structural forces and social processes, but the dearth of data on contextual factors limits the investigation of multilevel effects on health. The development of the Neighborhood Looking Glass will be a significant benefit to neighborhood effects researchers, harnessing the largely untapped potential of street image data to capture built environment characteristics with potential impact on health. Results from our project can be utilized to inform system-wide and local strategies to improve community health.

Shapeworksstudio: An Integrative, User-Friendly, and Scalable Suite for Shape Representation and Analysis

Shireen Youssef Elhabian
The morphology (or shape) of anatomical structures forms the common language among clinicians, where abnormalities in anatomical shapes are often tied to deleterious function. While these observations are often qualitative, finding subtle, quantitative shape effects requires the application of mathematics, statistics, and computing to parse the anatomy into a numerical representation that will facilitate testing of biologically relevant hypotheses. Particle-based shape modeling (PSM) and its associated suite of software tools, ShapeWorks, enable learning population-level shape representation via automatic dense placement of homologous landmarks on image segmentations of general anatomy with arbitrary topology. The utility of ShapeWorks has been demonstrated in a range of biomedical applications. Despite its obvious utility for the research enterprise and highly permissive open-source license, ShapeWorks does not have a viable commercialization path due to the inherent trade-off between development and maintenance costs, and a specialized scientific and clinical market. ShapeWorks has the potential to transform the way researchers approach studies of anatomical forms, but its widespread applicability to medicine and biology is hindered by several barriers that most existing shape modeling packages face. The most important roadblocks are (1) the complexity and steep learning curve of existing shape modeling pipelines and their increased computational and computer memory requirements; (2) the considerable expertise, time, and effort required to segment anatomies of interest for statistical analyses; and (3) the lack of interoperable implementations that can be readily incorporated into biomedical research laboratories. In this project, we propose ShapeWorksStudio, a software suite that leverages ShapeWorks for the automated population-/patient-level modeling of anatomical shapes, and Seg3D--a widely used open-source tool to visualize and process volumetric images--for flexible manual/semiautomatic segmentation and interactive manual correction of segmented anatomy.

In Aim 1, we will integrate ShapeWorks and Seg3D in a framework that supports big data cohorts to enable users to transparently proceed from image data to shape models in a straightforward manner.

In Aim 2, we will endow Seg3D with a machine learning approach that provides automated segmentations within a statistical framework that combines image data with population-specific shape priors provided by ShapeWorks.

In Aim 3, we will support interoperability with existing open-source software packages and toolkits, and provide bindings to commonly used programming languages in the biomedical research community. To promote reproducibility, we will develop and disseminate standard workflows and domain-specific test cases. This project combines an interdisciplinary research and development team with decades of experience in statistical analysis and image understanding, and application scientists to confirm that the proposed developments have a real impact on the biomedical and clinical research communities. Our long-term goal is to make ShapeWorks a standard tool for shape analyses in medicine, and the work proposed herein will establish the groundwork for achieving this goal.

Public Health Relevance
ShapeWorks is a free, open-source software tool that uses a flexible method for automated construction of statistical landmark-based shape models of ensembles of anatomical shapes. ShapeWorks has been effective in a range of applications, including psychology, biological phenotyping, cardiology, and orthopedics. If funded, this application will ensure the viability of ShapeWorks in the face of the ever-increasing complexity of shape datasets and support its availability to biomedical researchers in the future, as well as provide opportunities for use in a wide spectrum of new biological and clinical applications, including anatomy reconstruction from sparse/low-dimensional imaging data, large-scale clinical trials, surgical planning, optimal designs of medical implants, and reconstructive surgery.

Data-Driven Shape Analysis for Quantitative Severity Stratification in Patients with Metopic Craniosynostosis

Ross Whitaker
Craniosynostosis affects close to one in 2000 newborns and causes growth restriction perpendicular to the affected suture. Metopic craniosynostosis is the second most common form of craniosynostosis. The metopic suture is an important sight of cranial growth as the brain rapidly expands in the first year of life. Patients affected by metopic craniosynostosis will present in the first few months of life with varying degrees of narrowing of the forehead and brow, a triangular shaped head, and an abnormal eye position. Surgery is recommended early in childhood to normalize the head shape and expand the restricted skull to prevent complications such as headaches, cognitive impairment, and visual disturbances including blindness. Imaging with computed tomography (CT) is employed to confirm new diagnoses of metopic craniosynostosis and, together with the physical exam, is used in a descriptive and qualitative manner to assess the degree of head shape abnormality. Several methods have been employed to interpret the information provided in the CT scans to allow surgeons to utilize data for surgical decision making. However, these indices reduce the complex three-dimensional skull dysmorphology into isolated measurements of angles or proportions, require detailed calculations to perform, and no universally accepted standard has emerged so far despite significant research efforts and clinical motivation. In this grant proposal, we aim to increase our understanding of the cranial dysmorphology in patients with metopic craniosynostosis by employing latest results from statistical shape modeling and deep learning. Specifically, we will build a statistical shape model of pediatric skulls from CT images of patients with metopic craniosynostosis as well as a group of normal controls capturing normal phenotypical shape variations. The distance of a new shape from the normative shape space will represent the proposed Shape Normality Metric (SNM). The SNM will be validated against ratings from experts in the surgical community (current standard of care) who will be asked to assess the dysmorphology of the skulls in our database. To avoid surgeons' subjective bias, we will aggregate their response using statistical methods that compensate for potential individual bias. Finally, to streamline data collection for future research we will develop a head-shape portal that will allow users to upload CT scans of their patients and the system will automatically calculate the SNM. By developing a severity metric that encompasses the entire extent of dysmorphology in metopic craniosynostosis and establishing a head-shape portal, we will improve our understanding of the spectrum of metopic craniosynostosis, aid in pre-operative and surgical decision making, enable future research, and help facilitate longitudinal outcomes assessments and multi-center communication and collaboration.

Public Health Relevance
This grant proposal aims to improve our understanding of the head shape anomaly associated with metopic craniosynostosis by using recent results from statistical shape analysis and deep learning, with the goal of developing an objective metopic cranioynostosis severity scale. Different from previously proposed metrics, our approach evaluates the entire shape as a whole. With this information, surgeons will be able to objectively determine how severely affected their patients are and will be better able to tailor their interventions to the needs of their individual patients. Additionally, surgeons will be able to better communicate with each other and study the effects of surgical intervention on their patients which will improve patient care in the long run.

A Scalable Non-Intrusive Image Annotation Method Using Eye Tracking for Training Deep Learning Models in Radiology

Tolga Tasdizen
Machine learning (ML) and artificial intelligence have recently emerged as powerful techniques that can augment radiology interpretations and show promise for improving patient outcomes. One of the ways for ML to make a significant impact on health care is in improving the evaluation of high-volume, low-cost exams for early signs of a wide variety of diseases. The routine chest x-ray is an opportunity for screening for diseases, including cancer, chronic obstructive pulmonary disease (COPD), pneumonia and congestive heart failure. For instance, lung cancer is the most common cause of cancer death in the US, and is typically diagnosed at a higher stage than most other cancers leading to low survival rates. The National Lung Screening Trial reported that low dose computed tomography (LDCT) screening resulted in a 20% reduction in lung cancer mortality; however, few eligible people actually undergo LDCT screening. Meanwhile, chest x-rays continue to be the most common form of imaging worldwide. Improved detection from x-rays can direct patients to LDCT. COPD is another important disease that is often under-diagnosed. People with COPD are at increased risk of lung cancer and respiratory infections, or exacerbations, which are associated with higher morbidity and mortality. Furthermore, a chest x-ray may show poorly-defined regions of consolidation that are concerning for pneumonia. Medical attention is required to treat an infection or evaluate for other cause. More generally, methods to detect disease on chest x-rays can be extended to cardiomegaly, pulmonary edema and pleural effusions which are seen in congestive heart failure. Improved detection can direct patients to medical care. Convolutional neural networks (CNN), a highly successful ML model, can be applied to chest x-ray images. However, few annotated medical datasets exist that are sufficiently large to train CNNs. Furthermore, it has been shown that bounding boxes used to localize disease can be incorporated into the training of CNNs and significantly increase their accuracy. Unfortunately, medical datasets with such localized annotations are even rarer and are very limited in the number of cases due to the time-consuming process of creating bounding boxes by radiologists. We propose an innovative integrated approach using eye tracking, speech recording and novel vision and language models to create localized annotations in a manner that is non-intrusive to the workflow of the radiologist. The novelty of our approach is in the use of eye tracking during routine radiological reading. The challenge is to overcome the relatively ambiguous nature of eye tracking information compared to bounding boxes which provide definitive information about abnormalities. To address this challenge, we will also design new CNN architectures and learning algorithms that can use eye tracking and additional information such as pupil dilation and fixation duration. The proposed methodology can easily scale up to create very large datasets without generating additional workload for radiologists. Furthermore, deployed in the reading room, it could provide a continuous stream of annotated images to expand training sets.

Public Health Relevance
Machine learning and artificial intelligence hold the potential to improve patient outcomes by enhancing the evaluation of low cost, routine chest x-rays for early imaging abnormalities associated with diseases such as lung cancer, chronic obstructive pulmonary disease and pneumonia and making recommendations for follow-up care. To realize their full potential, machine learning algorithms require training on very large datasets with localized annotations for abnormalities. We propose a novel data collection methodology using eye tracking that is scalable, non-intrusive to the workflow of the radiologist, and can generate very large datasets with localized annotations for the development of novel machine learning algorithms to address significant medical problems.

The Space of Riemannian Metrics for the Statistical Analysis of the Human Connectome

Sarang Joshi
The human brain is one of the most complex biological geometrical objects. The Human Connectome Project aims to make available an unparalleled compilation of neural functional and structural imaging data from healthy adults. Data from over 900 subjects has already been released. The principal data provided by the Human Connectome Project are diffusion-weighted MRI and functional MRI. Due to the amount and complexity of the data generated in this project, new techniques to analyze, compare and represent these data are needed, which is the motivation and driving force for the research outlined in this proposal. This collaborative project has three fundamental goals: (1) to further develop the mathematical theory of geometrical statistics, in particular the role of the infinite-dimensional manifold of all Riemannian metrics; (2) to develop practical tools for the statistical study of the connectivity of the human brain; and (3) to demonstrate the utility of the developed techniques for the segmentation and parcellation of the thalamus and other subareas of the subcortical gray matter that are not visible in structural MRI.

This project will develop for the first time statistical techniques on the infinite-dimensional manifold of Riemannian metrics. The project team believes that the space of Riemannian metrics is the natural framework for analyzing the variability of the architecture of the human brain. Diffusion-weighted MRI allows the investigators to model an individual human brain as a Riemannian manifold with axonal connections that are geodesic curves of an appropriate metric. The team will study the space of all Riemannian metrics and develop methods based on geometrical statistics for the analysis of the whole population. An immediate practical application of the techniques developed will be the parcellation of the thalamus based on thalamocortical connectivity. The internal architecture of the thalamus is not visible in standard structural MRI but rather is defined via the connections to the different areas of the cortex. In this project, the investigators will partition the thalamus by projecting the functional partition of the cortex onto the thalamus via the connectomics. The aim is to use geometric statistical mapping methods to produce a statistically informed partition of an individual patient's thalamus. The primary driving motivation is to eventually improve outcomes of deep brain stimulation as a therapy for essential tremor, in which the thalamus is the primary target. The subcortical white matter is also implicated in many neurological disorders, such as ischemic vascular disease, Huntington's, Multiple Sclerosis, and HIV/AIDS dementia. The PIs envision that the statistical techniques developed for qualifying the detailed architecture of the white matter in the normal population will have implications for all these diseases. This project will provide novel analytical tools to unravel the mysteries of the human brain.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Validation and Translation of a Non-Invasive, Mr-Guided Breast Cancer Therapy

Sarang Joshi
The treatment of early stage, localized breast cancer has evolved over several decades from highly invasive techniques to minimally invasive, breast-conserving therapies. Our breast-specific magnetic resonance-guided focused ultrasound system (the Muse MRgFUS System) non-invasively delivers focused energy deep inside the body under high-resolution image guidance. The technical advances we have made poise MRgFUS as a nearly ready non-invasive ablation treatment for localized breast disease. The final steps for its clinical translation include studies to compare, optimize and carefully validate treatment planning, real-time monitoring and assessment techniques against the best available standard metrics, which include invasive temperature probes, hydrophone measurements and histopathology. This proposal pairs experienced academic investigators at the University of Utah with industry investigators at Image Guided Therapy. Combining the academic partner's scientific expertise and technological knowledge with the system integration and regulatory expertise of the industry partner will facilitate translation of this exciting technology. This proposal will develop and validate the remaining elements necessary to translate the Muse System into clinical care including: 1) accurate MRgFUS treatment planning based on a rapid ultrasound beam modeling method, 2) a comprehensive breast-focused volumetric MR thermometry method, and 3) a novel MRI-registered whole mount histology technique that compares MRI metrics to histopathological analysis. We will integrate each of these validated elements in a clinic-ready, treatment software control environment and will evaluate the entire process in a Phase I treat-and-resect clinical trial targeting unifocal invasive breast cancer tumors. The validation and translation of the Muse System in this proposal will provide an exciting new image-guided noninvasive treatment option for breast cancer patients.

Public Health Relevance
The validation of breast-specific treatment planning, monitoring and assessment techniques will accelerate translation of a magnetic resonance-guided focused ultrasound system towards the efficient, non-invasive treatment of localized breast cancer. Incorporating gold standard validated techniques in a clinic-ready planning and monitoring software package will enable a Phase I clinical trial to verify this non-invasive therapy can achieve efficacy within acceptable safety limits.

Anatomy Directly from Imagery: General-Purpose, Scalable, and Open-Source Machine Learning Approaches

Shireen Youssef Elhabian
The form (or shape) and function relationship of anatomical structures is a central theme in biology where abnormal shape changes are closely tied to pathological functions. Morphometrics has been an indispensable quantitative tool in medical and biological sciences to study anatomical forms for more than 100 years. Recently, the increased availability of high-resolution in-vivo images of anatomy has led to the development of a new generation of morphometric approaches, called statistical shape modeling (SSM), that take advantage of modern computational techniques to model anatomical shapes and their variability within populations with unprecedented detail. SSM stands to revolutionize morphometric analysis, but its widespread adoption is hindered by a number of significant challenges, including the complexity of the approaches and their increased computational requirements, relative to traditional morphometrics. Arguably, however, the most important roadblock to more widespread adoption is the lack of user-friendly and scalable software tools for a variety of anatomical surfaces that can be readily incorporated into biomedical research labs. The goal of this proposal is thus to address these challenges in the context of a flexible and general SSM approach termed particle-based shape modeling (PSM), which automatically constructs optimal statistical landmark-based shape models of ensembles of anatomical shapes without relying on any specific surface parameterization. The proposed research will provide an automated, general-purpose, and scalable computational solution for constructing shape models of general anatomy.

In Aim 1, we will build computational and machine learning algorithms to model anatomies with complex surface topologies (e.g., surface openings and shared boundaries) and highly variable anatomical populations.

In Aim 2, we will introduce an end-to-end machine learning approach to extract statistical shape representation directly from images, requiring no parameter tuning, image pre-processing, or user assistance.

In Aim 3, we will provide intuitive graphical user interfaces and visualization tools to incorporate user-defined modeling preferences and promote the visual interpretation of shape models. We will also make use of recent advances in cloud computing to enable researchers with limited computational resources and/or large cohorts to build and execute custom SSM workflows using remote scalable computational resources. Algorithmic developments will be thoroughly evaluated and validated using existing, fully funded, large-scale, and constantly growing databases of CT and MRI images located on-site. Furthermore, we will develop and disseminate standard workflows and domain-specific use cases for complex anatomies to promote reproducibility. Efforts to develop the proposed technology are aligned with the mission of the National Institute of General Medical Sciences (NIGMS), and its third strategic goal: to bridge biology and quantitative science for better global health through supporting the development of and access to computational research tools for biomedical research. Our long-term goal is to increase the clinical utility and widespread adoption of SSM, and the proposed research will establish the groundwork for achieving this goal.

Public Health Relevance
This project will develop general-purpose, scalable, and open-source statistical shape modeling (SSM) tools, which will present unique capabilities for automated anatomy modeling with less user input. The proposed technology will introduce a number of significant improvements to current SSM approaches and tools, including the support for challenging modeling problems, inferring shapes directly from images (and hence bypassing the segmentation step), parallel optimizations for speed, and new user interfaces that will be much easier and scalable than the current tools. The proposed technology will constitute an indispensable resource for the biomedical and clinical communities that will enable new avenues for biomedical research and clinical investigations, provide new ways to answer biologically related questions, allow new types of questions to be asked, and open the door for the integration of SSM with clinical care.

Machine Learning and Signature Analysis of Nuclear Forensic Data

Tolga Tasdizen
The development of uranium oxide physical and chemical signatures is critical to the field of nuclear forensic analysis. Qualitative morphological parameters provide supplementary information in nuclear forensic investigations, but tell a limited portion of an unknown sample’s story. There is a major need for quantitative parameters that can rapidly determine whether differences between an unknown sample and a standard are statistically significant. It would be even more beneficial if these parameters could elucidate not just the starting material speciation, but the processing conditions experienced by the sample. Accounting for storage and temporal effects further accounts for the total process history of an unknown sample. In the future, one could see a quantitative morphological database that expedites the attribution process. We propose to develop a data processing pipeline that streamlines the analysis of complex nuclear forensics data and statistically correlates data across multiple techniques. This pipeline will build upon existing computational tools in the nuclear forensics community, such as the Morphological Analysis for Material Attribution (MAMA) software developed by Los Alamos National Laboratory (LANL), to characterize particle morphologies. The proposed fundamental science in an academic setting coupled with technical guidance from Pacific Northwest National Laboratory (PNNL) will instruct and inform the next generation of nuclear scientists and engineers.

Multi-Tiered Carbon Monitoring System

Sarang Joshi
Reducing CH4 and fossil fuel CO2 emissions remains a top climate mitigation priority for stakeholders around the world. States such as California have committed to ambitious GHG stabilization targets. State agencies such as the California Air Resources Board (CARB) are strongly motivated to verify emissions and inform policy formulation at the scale of major emitting regions, air basins and cities. Additionally, private companies such as Chevron have expressed interest in better facility-scale emissions data to reduce their greenhouse gas footprints and product loss. Meanwhile several foundations such as the Rocky Mountain Institute (RMI) are working to establish trusted climate data initiatives through public-private partnerships. A common feature of these interests is a focus on facility-scale point source emitters and their contribution to local emission budgets to prioritize mitigation efforts. A common challenge is that CH4 and CO2 emissions data at those spatial scales is currently sparse, inaccurate or non-existent. There is an urgent need to provide CH4 and CO2 data and analytics that are trusted, timely and at spatial scales relevant to decision making. A tiered observational strategy and integrated data analysis framework have the potential to leverage emerging and planned airborne and satellite remote sensing capabilities to address these challenges and stakeholder needs (ultimately in key regions globally).

We propose to build on the success of our Prototype Methane Monitoring System for California (CMS-2015-Duren) and Megacities Carbon Project to develop and test a Multi-tiered CH4 (and as a secondary goal: CO2) monitoring system for a broader set of high emitting regions and priority emission sectors in the US. In year 1 of the project we plan to conduct CH4 and CO2 point source surveys of key regions and sectors in California, the Permian basin in Texas and New Mexico, and major oil and gas infrastructure centers along the Gulf Coast with NASA's Next Generation Airborne Visible/Infrared Imaging Spectrometer (AVIRIS-ng) together with coordinated snap-shot mode CO2 observations from the Orbiting Carbon Observatory-3 (OCO-3) and routine CH4 observations from the Sentinel-5 Precursor/TROPOMI satellite. In years 2 and 3 we will generate CH4 (goal: CO2) regional and point source emission estimates in those regions that leverage and extend multi-scale estimation techniques previously prototyped in California. This will allow stakeholders to place facility scale emissions into context with regional emissions. If selected, this project will benefit from additional funding from RMI to support more airborne surveys and data product development. It also benefits from in-kind contributions from other collaborators. Our stakeholders including RMI, CARB and Chevron have indicated an interest in evaluating and potentially adopting the methods, tools and data products developed by this project for infusion into their decision frameworks. Finally, we also plan to leverage existing surface measurements from our own Megacities Carbon Project and collaborators in the southern San Joaquin Valley and potentially the Permian basin, Salt Lake City and the Uintah Basin to help validate emission estimates.

Time-Dependent Analysis of SPT Microscopy Data

Publications in Image Analysis:

Page 2 of 22

Start
Prev
1
2
3
4
5
6
7
8
9
10
Next
End

Refining Skewed Perceptions in Vision-Language Models through Visual Representations
Subtitled “arXiv preprint arXiv:2405.14030,” H. Dai, S. Joshi. 2024.

Large vision-language models (VLMs), such as CLIP, have become foundational, demonstrating remarkable success across a variety of downstream tasks. Despite their advantages, these models, akin to other foundational systems, inherit biases from the disproportionate distribution of real-world data, leading to misconceptions about the actual environment. Prevalent datasets like ImageNet are often riddled with non-causal, spurious correlations that can diminish VLM performance in scenarios where these contextual elements are absent. This study presents an investigation into how a simple linear probe can effectively distill task-specific core features from CLIP’s embedding for downstream applications. Our analysis reveals that the CLIP text representations are often tainted by spurious correlations, inherited in the biased pre-training dataset. Empirical evidence suggests that relying on visual representations from CLIP, as opposed to text embedding, is more practical to refine the skewed perceptions in VLMs, emphasizing the superior utility of visual representations in overcoming embedded biases

Grand Challenges at the Interface of Engineering and Medicine
S. Subramaniam, M. Miller, several co-authors, Chris R. Johnson, et al.. In IEEE Open Journal of Engineering in Medicine and Biology, Vol. 5, IEEE, pp. 1--13. 2024.
DOI: 10.1109/OJEMB.2024.3351717

Over the past two decades Biomedical Engineering has emerged as a major discipline that bridges societal needs of human health care with the development of novel technologies. Every medical institution is now equipped at varying degrees of sophistication with the ability to monitor human health in both non-invasive and invasive modes. The multiple scales at which human physiology can be interrogated provide a profound perspective on health and disease. We are at the nexus of creating “avatars” (herein defined as an extension of “digital twins”) of human patho/physiology to serve as paradigms for interrogation and potential intervention. Motivated by the emergence of these new capabilities, the IEEE Engineering in Medicine and Biology Society, the Departments of Biomedical Engineering at Johns Hopkins University and Bioengineering at University of California at San Diego sponsored an interdisciplinary workshop to define the grand challenges that face biomedical engineering and the mechanisms to address these challenges. The Workshop identified five grand challenges with cross-cutting themes and provided a roadmap for new technologies, identified new training needs, and defined the types of interdisciplinary teams needed for addressing these challenges. The themes presented in this paper include: 1) accumedicine through creation of avatars of cells, tissues, organs and whole human; 2) development of smart and responsive devices for human function augmentation; 3) exocortical technologies to understand brain function and treat neuropathologies; 4) the development of approaches to harness the human immune system for health and wellness; and 5) new strategies to engineer genomes and cells.

Matching aggregate posteriors in the variational autoencoder
Subtitled “arXiv preprint arXiv:2311.07693,” S. Saha, S. Joshi, R. Whitaker. 2023.

The variational autoencoder (VAE) [1] is a well-studied, deep, latent-variable model (DLVM) that efficiently optimizes the variational lower bound of the log marginal data likelihood and has a strong theoretical foundation. However, the VAE’s known failure to match the aggregate posterior often results in pockets/holes in the latent distribution (i.e., a failure to match the prior) and/or posterior collapse, which is associated with a loss of information in the latent space. This paper addresses these shortcomings in VAEs by reformulating the objective function associated with VAEs in order to match the aggregate/marginal posterior distribution to the prior. We use kernel density estimate (KDE) to model the aggregate posterior in high dimensions. The proposed method is named the aggregate variational autoencoder (AVAE) and is built on the theoretical framework of the VAE. Empirical evaluation of the proposed method on multiple benchmark data sets demonstrates the effectiveness of the AVAE relative to state-of-the-art (SOTA) methods.

Two-Stage Deep Learning Framework for Quality Assessment of Left Atrial Late Gadolinium Enhanced MRI Images
Subtitled “arXiv:2310.08805,” K.M.A.Sultan, B. Orkild, A. Morris, E. Kholmovski, E. Bieging, E. Kwan, R. Ranjan, E. DiBella, S. Elhabian. 2023.

Accurate assessment of left atrial fibrosis in patients with atrial fibrillation relies on high-quality 3D late gadolinium enhancement (LGE) MRI images. However, obtaining such images is challenging due to patient motion, changing breathing patterns, or sub-optimal choice of pulse sequence parameters. Automated assessment of LGE-MRI image diagnostic quality is clinically significant as it would enhance diagnostic accuracy, improve efficiency, ensure standardization, and contributes to better patient outcomes by providing reliable and high-quality LGE-MRI scans for fibrosis quantification and treatment planning. To address this, we propose a two-stage deep-learning approach for automated LGE-MRI image diagnostic quality assessment. The method includes a left atrium detector to focus on relevant regions and a deep network to evaluate diagnostic quality. We explore two training strategies, multi-task learning, and pretraining using contrastive learning, to overcome limited annotated data in medical imaging. Contrastive Learning result shows about $4 %$ , and $9 %$ improvement in F1-Score and Specificity compared to Multi-Task learning when there's limited data.

Image2SSM: Localization-aware Deep Learning Framework for Statistical Shape Modeling Directly from Images
J Ukey, S Elhabian. In Medical Imaging with Deep Learning, In Proceedings of Machine Learning Research , 2023.

Statistical Shape Modelling (SSM) is an effective tool for quantitatively analyzing anatomical populations. SSM has benefitted largely from advances in deep learning where statistical representations of anatomies (e.g., point distribution models or PDMs) are inferred directly from images, alleviating the need for a time-consuming and expensive workflow of anatomy segmentation, shape registration, and model optimization. Nonetheless, to date, existing deep learning methods do not consider the rigid pose transformation of shapes or anatomy of interest. They also require a tight bounding box to be defined over the image of anatomy-of-interest before feeding the image to the deep network for network training and inference. In this paper, we propose a deep learning framework that simultaneously detects and segments the anatomy of interest, estimate the rigid transformation with respect to the population mean (average) using a spatial transformer, and estimates the corresponding statistical representation of that anatomy, all directly from unsegmented 3D image without the need for any additional supervision. Furthermore, we leverage the segmentation task to provide an attention model for the sub-network that estimates shape representation, giving more accurate shape statistics for shape analysis.

Multi-task Training as Regularization Strategy for Seismic Image Segmentation
S. Saha, W. Gazi, R. Mohammed, T. Rapstine, H. Powers, R. Whitaker. In IEEE Geoscience and Remote Sensing Letters, Vol. 20, IEEE, pp. 1--5. 2023.
DOI: 10.1109/LGRS.2023.3328837

This letter proposes multitask learning as a regularization method for segmentation tasks in seismic images. We examine application-specific auxiliary tasks, such as the estimation/detection of horizons, dip angle, and amplitude that geophysicists consider relevant for identification of channels (a geological feature), which is currently done through painstaking outlining by qualified experts. We show that multitask training helps in better generalization on test datasets with very similar and different structure/statistics. In such settings, we also show that multitask learning performs better on unseen datasets relative to the baseline.

CLASSMix: Adaptive stain separation-based contrastive learning with pseudo labeling for histopathological image classification
Subtitled “arXiv:2312.06978v2,” B. Zhang, H. Manoochehri, M.M. Ho, F. Fooladgar, Y. Chong, B. Knudsen, D. Sirohi, T. Tasdizen. 2023.

Histopathological image classification is one of the critical aspects in medical image analysis. Due to the high expense associated with the labeled data in model training, semi-supervised learning methods have been proposed to alleviate the need of extensively labeled datasets. In this work, we propose a model for semi-supervised classification tasks on digital histopathological Hematoxylin and Eosin (H&E) images. We call the new model Contrastive Learning with Adaptive Stain Separation and MixUp (CLASS-M). Our model is formed by two main parts: contrastive learning between adaptively stain separated Hematoxylin images and Eosin images, and pseudo-labeling using MixUp. We compare our model with other state-of-the-art models on clear cell renal cell carcinoma (ccRCC) datasets from our institution and The Cancer Genome Atlas Program (TCGA). We demonstrate that our CLASS-M model has the best performance on both datasets. The contributions of different parts in our model are also analyzed.

High-Fidelity CT on Rails-Based Characterization of Delivered Dose Variation in Conformal Head and Neck Treatments
H. Dai, V. Sarkar, C. Dial, M.D. Foote, Y. Hitchcock, S. Joshi, B. Salter. In Applied Radiation Oncology, 2023.
DOI: 10.1101/2023.04.07.23288305

Objective: This study aims to characterize dose variations from the original plan for a cohort of patients with head-and-neck cancer (HNC) using high-quality CT on rails (CTOR) datasets and evaluate a predictive model for identifying patients needing replanning.

Materials and Methods: In total, 74 patients with HNC treated on our CTOR-equipped machine were evaluated in this retrospective study. Patients were treated at our facility using in-room, CTOR image guidance—acquiring CTOR kV fan beam CT images on a weekly to near-daily basis. For each patient, a particular day’s delivered treatment dose was calculated by applying the approved, planned beam set to the post image-guided alignment CT image of the day. Total accumulated delivered dose distributions were calculated and compared with the planned dose distribution, and differences were characterized by comparison of dose and biological response statistics.

Results: The majority of patients in the study saw excellent agreement between planned and delivered dose distribution in targets—the mean deviations of dose received by 95% and 98% of the planning target volumes of the cohort are −0.7% and −1.3%, respectively. In critical organs, we saw a +6.5% mean deviation of mean dose in the parotid glands, −2.3% mean deviation of maximum dose in the brainstem, and +0.7% mean deviation of maximum dose in the spinal cord. Of 74 patients, 10 experienced nontrivial variation of delivered parotid dose, which resulted in a normal tissue complication probability (NTCP) increase compared with the anticipated NTCP in the original plan, ranging from 11% to 44%.

Conclusion: We determined that a midcourse evaluation of dose deviation was not effective in predicting the need for replanning for our patient cohorts. The observed nontrivial dose difference to parotid gland delivered dose suggests that even when rigorous, high-quality image guidance is performed, clinically concerning variations to predicted dose delivery can still occur.

Particle-Based Shape Modeling for Arbitrary Regions-of-Interest,
H. Xu, A. Morris, S.Y. Elhabian. In Shape in Medical Imaging, Lecture Notes in Computer Science, vol 14350, 2023.

Statistical Shape Modeling (SSM) is a quantitative method for analyzing morphological variations in anatomical structures. These analyses often necessitate building models on targeted anatomical regions of interest to focus on specific morphological features. We propose an extension to particle-based shape modeling (PSM), a widely used SSM framework, to allow shape modeling to arbitrary regions of interest. Existing methods to define regions of interest are computationally expensive and have topological limitations. To address these shortcomings, we use mesh fields to define free-form constraints, which allow for delimiting arbitrary regions of interest on shape surfaces. Furthermore, we add a quadratic penalty method to the model optimization to enable computationally efficient enforcement of any combination of cutting-plane and free-form constraints. We demonstrate the effectiveness of this method on a challenging synthetic dataset and two medical datasets.

Two-Stage Deep Learning Framework for Quality Assessment of Left Atrial Late Gadolinium Enhanced MRI Images
Subtitled “arXiv:2310.08805v1,” K.M.A. Sultan, B. Orkild, A. Morris, E. Kholmovski, E. Bieging, E. Kwan, R. Ranjan, E. DiBella, s. Elhabian. 2023.

Accurate assessment of left atrial fibrosis in patients with atrial fibrillation relies on high-quality 3D late gadolinium enhancement (LGE) MRI images. However, obtaining such images is challenging due to patient motion, changing breathing patterns, or sub-optimal choice of pulse sequence parameters. Automated assessment of LGE-MRI image diagnostic quality is clinically significant as it would enhance diagnostic accuracy, improve efficiency, ensure standardization, and contributes to better patient outcomes by providing reliable and high-quality LGE-MRI scans for fibrosis quantification and treatment planning. To address this, we propose a two-stage deep-learning approach for automated LGE-MRI image diagnostic quality assessment. The method includes a left atrium detector to focus on relevant regions and a deep network to evaluate diagnostic quality. We explore two training strategies, multi-task learning, and pretraining using contrastive learning, to overcome limited annotated data in medical imaging. Contrastive Learning result shows about 4%, and 9% improvement in F1-Score and Specificity compared to Multi-Task learning when there’s limited data.

Review of Multi-Faceted Morphologic Signatures of Actinide Process Materials for Nuclear Forensic Science
L.W. McDonald IV, K. Sentz, A. Hagen, B.W. Chung, T. Tasdizen, et. al.. In Journal of Nuclear Materials, Elsevier, 2023.

Particle morphology is an emerging signature that has the potential to identify the processing history of unknown nuclear materials. Using readily available scanning electron microscopes (SEM), the morphology of nearly any solid material can be measured within hours. Coupled with robust image analysis and classification methods, the morphological features can be quantified and support identification of the processing history of unknown nuclear materials. The viability of this signature depends on developing databases of morphological features, coupled with a rapid data analysis and accurate classification process. With developed reference methods, datasets, and throughputs, morphological analysis can be applied within days to (i) interdicted bulk nuclear materials (gram to kilogram quantities), and (ii) trace amounts of nuclear materials detected on swipes or environmental samples. This review aims to develop validated and verified analytical strategies for morphological analysis relevant to nuclear forensics.

Progressive DeepSSM: Training Methodology for Image-To-Shape Deep Models
Subtitled “arXiv:2310.01529,” A.Z.B. Aziz, J. Adams, S. Elhabian. 2023.

Statistical shape modeling (SSM) is an enabling quantitative tool to study anatomical shapes in various medical applications. However, directly using 3D images in these applications still has a long way to go. Recent deep learning methods have paved the way for reducing the substantial preprocessing steps to construct SSMs directly from unsegmented images. Nevertheless, the performance of these models is not up to the mark. Inspired by multiscale/multiresolution learning, we propose a new training strategy, progressive DeepSSM, to train image-to-shape deep learning models. The training is performed in multiple scales, and each scale utilizes the output from the previous scale. This strategy enables the model to learn coarse shape features in the first scales and gradually learn detailed fine shape features in the later scales. We leverage shape priors via segmentation-guided multi-task learning and employ deep supervision loss to ensure learning at each scale. Experiments show the superiority of models trained by the proposed strategy from both quantitative and qualitative perspectives. This training methodology can be employed to improve the stability and accuracy of any deep learning method for inferring statistical representations of anatomies from medical images and can be adopted by existing deep learning methods to improve model accuracy and training stability.

Improving Robustness for Model Discerning Synthesis Process of Uranium Oxide with Unsupervised Domain Adaptation,
C. Ly, C. Nizinski, A. Hagen, L. McDonald IV, T. Tasdizen. In Frontiers in Nuclear Engineering, 2023.

The quantitative characterization of surface structures captured in scanning electron microscopy (SEM) images has proven to be effective for discerning provenance of an unknown nuclear material. Recently, many works have taken advantage of the powerful performance of convolutional neural networks (CNNs) to provide faster and more consistent characterization of surface structures. However, one inherent limitation of CNNs is their degradation in performance when encountering discrepancy between training and test datasets, which limits their use widely.The common discrepancy in an SEM image dataset occurs at low-level image information due to user-bias in selecting acquisition parameters and microscopes from different manufacturers.Therefore, in this study, we present a domain adaptation framework to improve robustness of CNNs against the discrepancy in low-level image information. Furthermore, our proposed approach makes use of only unlabeled test samples to adapt a pretrained model, which is more suitable for nuclear forensics application for which obtaining both training and test datasets simultaneously is a challenge due to data sensitivity. Through extensive experiments, we demonstrate that our proposed approach effectively improves the performance of a model by at least 18% when encountering domain discrepancy, and can be deployed in many CNN architectures.

MedShapeNet - A Large-Scale Dataset of 3D Medical Shapes for Computer Vision
Subtitled “arXiv:2308.16139v3,” J. Li, A. Pepe, C. Gsaxner, G. Luijten, Y. Jin, S. Elhabian, et. al.. 2023.

We present MedShapeNet, a large collection of anatomical shapes (e.g., bones, organs, vessels) and 3D surgical instrument models. Prior to the deep learning era, the broad application of statistical shape models (SSMs) in medical image analysis is evidence that shapes have been commonly used to describe medical data. Nowadays, however, state-of-the-art (SOTA) deep learning algorithms in medical imaging are predominantly voxel-based. In computer vision, on the contrary, shapes (including, voxel occupancy grids, meshes, point clouds and implicit surface models) are preferred data representations in 3D, as seen from the numerous shape-related publications in premier vision conferences, such as the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), as well as the increasing popularity of ShapeNet (about 51,300 models) and Princeton ModelNet (127,915 models) in computer vision research. MedShapeNet is created as an alternative to these commonly used shape benchmarks to facilitate the translation of data-driven vision algorithms to medical applications, and it extends the opportunities to adapt SOTA vision algorithms to solve critical medical problems. Besides, the majority of the medical shapes in MedShapeNet are modeled directly on the imaging data of real patients, and therefore it complements well existing shape benchmarks consisting of computer-aided design (CAD) models. MedShapeNet currently includes more than 100,000 medical shapes, and provides annotations in the form of paired data. It is therefore also a freely available repository of 3D models for extended reality (virtual reality - VR, augmented reality - AR, mixed reality - MR) and medical 3D printing. This white paper describes in detail the motivations behind MedShapeNet, the shape acquisition procedures, the use cases, as well as the usage of the online shape search portal: https://medshapenet.ikim.nrw/

Structural Cycle GAN for Virtual Immunohistochemistry Staining of Gland Markers in the Colon
Subtitled “arXiv:2308.13182,” S. Dubey, T. Kataria, B. Knudsen, S.Y. Elhabian. 2023.

With the advent of digital scanners and deep learning, diagnostic operations may move from a microscope to a desktop. Hematoxylin and Eosin (H&E) staining is one of the most frequently used stains for disease analysis, diagnosis, and grading, but pathologists do need different immunohistochemical (IHC) stains to analyze specific structures or cells. Obtaining all of these stains (H&E and different IHCs) on a single specimen is a tedious and time-consuming task. Consequently, virtual staining has emerged as an essential research direction. Here, we propose a novel generative model, Structural Cycle-GAN (SC-GAN), for synthesizing IHC stains from H&E images, and vice versa. Our method expressly incorporates structural information in the form of edges (in addition to color data) and employs attention modules exclusively in the decoder of the proposed generator model. This integration enhances feature localization and preserves contextual information during the generation process. In addition, a structural loss is incorporated to ensure accurate structure alignment between the generated and input markers. To demonstrate the efficacy of the proposed model, experiments are conducted with two IHC markers emphasizing distinct structures of glands in the colon: the nucleus of epithelial cells (CDX2) and the cytoplasm (CK818). Quantitative metrics such as FID and SSIM are frequently used for the analysis of generative models, but they do not correlate explicitly with higher-quality virtual staining results. Therefore, we propose two new quantitative metrics that correlate directly with the virtual staining specificity of IHC markers.

Benchmarking Scalable Epistemic Uncertainty Quantification in Organ Segmentation
Subtitled “arXiv:2308.07506,” J. Adams, S.Y. Elhabian. 2023.

Deep learning based methods for automatic organ segmentation have shown promise in aiding diagnosis and treatment planning. However, quantifying and understanding the uncertainty associated with model predictions is crucial in critical clinical applications. While many techniques have been proposed for epistemic or model-based uncertainty estimation, it is unclear which method is preferred in the medical image analysis setting. This paper presents a comprehensive benchmarking study that evaluates epistemic uncertainty quantification methods in organ segmentation in terms of accuracy, uncertainty calibration, and scalability. We provide a comprehensive discussion of the strengths, weaknesses, and out-of-distribution detection capabilities of each method as well as recommendations for future improvements. These findings contribute to the development of reliable and robust models that yield accurate segmentations while effectively quantifying epistemic uncertainty.

A Non-Contrast Multi-Parametric MRI Biomarker for Assessment of MR-Guided Focused Ultrasound Thermal Therapies
S. Johnson, B. Zimmerman, H. Odéen, J. Shea, N. Winkler, R. Factor, S. Joshi, A. Payne. In IEEE Transactions on Biomedical Engineering, IEEE, pp. 1--12. 2023.
DOI: 10.1109/TBME.2023.3303445

Objective: We present the development of a non-contrast multi-parametric magnetic resonance (MPMR) imaging biomarker to assess treatment outcomes for magnetic resonance-guided focused ultrasound (MRgFUS) ablations of localized tumors. Images obtained immediately following MRgFUS ablation were inputs for voxel- wise supervised learning classifiers, trained using registered histology as a label for thermal necrosis. Methods: VX2 tumors in New Zealand white rabbits quadriceps were thermally ablated using an MRgFUS system under 3 T MRI guidance. Animals were re-imaged three days post-ablation and euthanized. Histological necrosis labels were created by 3D registration between MR images and digitized H&E segmentations of thermal necrosis to enable voxel- wise classification of necrosis. Supervised MPMR classifier inputs included maximum temperature rise, cumulative thermal dose (CTD), post-FUS differences in T2-weighted images, and apparent diffusion coefficient, or ADC, maps. A logistic regression, support vector machine, and random forest classifier were trained in red a leave-one-out strategy in test data from four subjects. Results: In the validation dataset, the MPMR classifiers achieved higher recall and Dice than than a clinically adopted 240 cumulative equivalent minutes at 43^∘ C (CEM ₄₃ ) threshold (0.43) in all subjects.redThe average Dice scores of overlap with the registered histological label for the logistic regression (0.63) and support vector machine (0.63) MPMR classifiers were within 6% of the acute contrast-enhanced non-perfused volume (0.67). Conclusions: Voxel- wise registration of MPMR data to histological outcomes facilitated supervised learning of an accurate non-contrast MR biomarker for MRgFUS ablations in a rabbit VX2 tumor model.

To pretrain or not to pretrain? A case study of domain-specific pretraining for semantic segmentation in histopathology
Subtitled “arXiv:2307.03275,” T. Kataria, B. Knudsen, S. Elhabian. 2023.

Annotating medical imaging datasets is costly, so fine-tuning (or transfer learning) is the most effective method for digital pathology vision applications such as disease classification and semantic segmentation. However, due to texture bias in models trained on real-world images, transfer learning for histopathology applications might result in underperforming models, which necessitates the need for using unlabeled histopathology data and self-supervised methods to discover domain-specific characteristics. Here, we tested the premise that histopathology-specific pretrained models provide better initializations for pathology vision tasks, i.e., gland and cell segmentation. In this study, we compare the performance of gland and cell segmentation tasks with domain-specific and non-domain-specific pretrained weights. Moreover, we investigate the data size at which domain-specific pretraining produces a statistically significant difference in performance. In addition, we investigated whether domain-specific initialization improves the effectiveness of out-of-domain testing on distinct datasets but the same task. The results indicate that performance gain using domain-specific pretraining depends on both the task and the size of the training dataset. In instances with limited dataset sizes, a significant improvement in gland segmentation performance was also observed, whereas models trained on cell segmentation datasets exhibit no improvement.

ADASSM: Adversarial Data Augmentation in Statistical Shape Models From Images
Subtitled “arXiv:2307.03273v2,” M.S.T. Karanam, T. Kataria, S. Elhabian. 2023.

Statistical shape models (SSM) have been well-established as an excellent tool for identifying variations in the morphology of anatomy across the underlying population. Shape models use consistent shape representation across all the samples in a given cohort, which helps to compare shapes and identify the variations that can detect pathologies and help in formulating treatment plans. In medical imaging, computing these shape representations from CT/MRI scans requires time-intensive preprocessing operations, including but not limited to anatomy segmentation annotations, registration, and texture denoising. Deep learning models have demonstrated exceptional capabilities in learning shape representations directly from volumetric images, giving rise to highly effective and efficient Image-to-SSM. Nevertheless, these models are data-hungry and due to the limited availability of medical data, deep learning models tend to overfit. Offline data augmentation techniques, that use kernel density estimation based (KDE) methods for generating shape-augmented samples, have successfully aided Image-to-SSM networks in achieving comparable accuracy to traditional SSM methods. However, these augmentation methods focus on shape augmentation, whereas deep learning models exhibit image-based texture bias results in sub-optimal models. This paper introduces a novel strategy for on-the-fly data augmentation for the Image-to-SSM framework by leveraging data-dependent noise generation or texture augmentation. The proposed framework is trained as an adversary to the Image-to-SSM network, augmenting diverse and challenging noisy samples. Our approach achieves improved accuracy by encouraging the model to focus on the underlying geometry rather than relying solely on pixel values.

Editorial: Image-based computational approaches for personalized cardiovascular medicine: improving clinical applicability and reliability through medical imaging and experimental data
S. Pirola, A. Arzani, C. Chiastra, F. Sturla. In Frontiers in Medical Technology, Vol. 5, 2023.
DOI: 10.3389/fmedt.2023.1222837

Page 2 of 22

Start
Prev
1
2
3
4
5
6
7
8
9
10
Next
End

SCI