Designed especially for neurobiologists, FluoRender is an interactive tool for multi-channel fluorescence microscopy data visualization and analysis.

BrainStimulator is a set of networks that are used in SCIRun to perform simulations of brain stimulation such as transcranial direct current stimulation (tDCS) and magnetic transcranial stimulation (TMS).

Developing software tools for science has always been a central vision of the SCI Institute.

Visualization

Visualization, sometimes referred to as visual data analysis, uses the graphical representation of data as a means of gaining understanding and insight into the data. Visualization research at SCI has focused on applications spanning computational fluid dynamics, medical imaging and analysis, biomedical data analysis, healthcare data analysis, weather data analysis, poetry, network and graph analysis, financial data analysis, etc.

Research involves novel algorithm and technique development to building tools and systems that assist in the comprehension of massive amounts of (scientific) data. We also research the process of creating successful visualizations.

We strongly believe in the role of interactivity in visual data analysis. Therefore, much of our research is concerned with creating visualizations that are intuitive to interact with and also render at interactive rates.

Visualization at SCI includes the academic subfields of Scientific Visualization, Information Visualization and Visual Analytics.

Charles Hansen

Volume Rendering
Ray Tracing
Graphics

Valerio Pascucci

Topological Methods
Data Streaming
Big Data

Chris Johnson

Scalar, Vector, and
Tensor Field Visualization,
Uncertainty Visualization

Mike Kirby

Uncertainty Visualization

Ross Whitaker

Topological Methods
Uncertainty Visualization

Alex Lex

Information Visualization

Bei Wang

Information Visualization
Scientific Visualization
Topological Data Analysis

Centers and Labs:

Funded Research Projects:

SCALE MoDL: Advancing Theoretical Minimax Deep Learning: Optimization, Resilience, and Interpretability

Bei Wang
The past decade has witnessed the great success of deep learning in broad societal and commercial applications. However, conventional deep learning relies on fitting data with neural networks, which is known to produce models that lack resilience. The next-generation deep learning paradigm needs to deliver resilient models that promote robustness to malicious attacks, fairness among users, and privacy preservation. In this project, the investigators will collaboratively develop a comprehensive minimax learning theory that advances the fundamental understanding of minimax deep learning from the perspectives of optimization, resilience, and interpretability.

Enabling Reproducibility of Interactive Visual Data Analysis

Alex Lex
Reproducibility and justifiability are widely recognized as critical aspects of data-driven decision making in fields as varied as scientific research, business, healthcare, or intelligence analysis. This project is concerned with enabling reproducibility and justifiability of decisions in the data analysis process, specifically as it relates to visual data analysis. Visualization is an important tool for discovery, yet decisions made by humans based on visualizations of data are difficult to capture and to justify. This project will develop methods to justify, communicate, and audit decisions made based on visual analysis. This, in turn will lead to better outcomes, achieved with less effort and cost. The increasing use of visual analysis tools for decision making will make data analysis accessible to a broad variety of people, as visual analysis tools are generally easier to use than scripting languages and do not require extensive computational and statistical training. This research and its related activities increase accessibility and enhance the data analysis infrastructure for research and education.

To achieve these goals, this research will develop a framework for making visual analysis sessions not only reproducible but also reusable. The approach is based on tracking semantically meaningful provenance data during an interactive visual analysis session. Once a discovery is made, analysts can use this history to curate a succinct analysis story, adding justifications and explanations to make their analysis reproducible by others. Using a semi-automatic process, analysts will be able to make their actions data-aware, so that their analysis processes become robust to changes, such as updates in the data. A second contribution of the proposed work is the integration of visual analysis into computational analysis processes. While visualization is commonly used to present computational analysis results, the results of a visual analysis session are rarely used to feed into further computational processes. The techniques developed in this project will allow analysts to feed analysis results (selections, aggregations, filters, etc.) back into a computational environment. This will make it possible to use interactive visualization at any point in the data analysis process while maintaining reproducibility and enabling reuse. The expected results include new methods to capture user intent, create data stories from analysis processes, and to integrate computational and visual data analysis, leveraging the strength of both, human abilities and computational power. The results will be disseminated in publications and in the form of open source software, and accessible via the project website (http://vdl.sci.utah.edu/projects/2018-nsf-reproducibility/).

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Reproducible Visual Analysis of Multivariate Networks with Multinet

Miriah Meyer, Bryan Jones, Alexander Lex
Multivariate networks -- datasets that link together entities that are associated with multiple different variables -- are a critical data representation for a range of high-impact problems, from understanding how our bodies work to uncovering how social media influences society. These data representations are a rich and complex reflection of the multifaceted relationships that exist in the world. Reasoning about a problem using a multivariate network allows an analyst to ask questions beyond those about explicit connectivity alone: Do groups of social-media influencers have similar backgrounds or experiences? Do species that co-evolve live in similar climates? What patterns of cell-types support different types of brain functions? Questions like these require understanding patterns and trends about entities with respect to both their attributes and their connectivity, leading to inferences about relationships beyond the initial network structure. As data continues to become an increasingly important driver of scientific discovery, datasets of networks have also become increasingly complex. These networks capture information about relationships between entities as well as attributes of the entities and the connections. Tools used in practice today provide very limited support for reasoning about networks and are also limited in the how users can interact with them. This lack of support leaves analysts and scientists to piece together workflows using separate tools, and significant amounts of programming, especially in the data preparation step. This project aims fill this critical gap in the existing cyber-infrastructure ecosystem for reasoning about multivariate networks by developing MultiNet, a robust, flexible, secure, and sustainable open-source visual analysis system.

MultiNet aims to change the landscape of visual analysis capabilities for reasoning about and analyzing multivariate networks. The web-based tool, along with an underlying plug-in-based framework, will support three core capabilities: (1) interactive, task-driven visualization of both the connectivity and attributes of networks, (2) reshaping the underlying network structure to bring the network into a shape that is well suited to address analysis questions, and (3) leveraging provenance data to support reproducibility, communication, and integration in computational workflows. These capabilities will allow scientists to ask new classes of questions about network datasets, and lead to insights about a wide range of pressing topics. To meet this goal, we will ground the design of MultiNet in four deeply collaborative case studies with domain scientists in biology, neuroscience, sociology, and geology.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Visualizing Robust Features in Vector and Tensor Fields

Bei Wang
Vector and tensor fields provide a powerful language to describe physical phenomena in many scientific applications. In atmospheric sciences, vectors are used to represent air movements with speed and directions and to capture typical and atypical atmospheric conditions. In materials science, stress and strain tensors are used to specify the behaviors of material bodies experiencing deformations and to facilitate the study of material strength. The main objective of this project is to define and quantify robust features in vector and tensor fields and to derive scientifically meaningful visualization for knowledge discovery. Robust features are objects, structures, or regions of interest that are stable under small perturbations of the data that arise from measurement noise, numerical instability or simulation uncertainty. Robust features are defined and evaluated via close collaborations with domain scientists to help them discriminate spurious from essential structures in the data. In materials science, the extraction of robust features in stress tensor fields will help the materials scientists better characterize and predict 3D cracking for manufacturing stronger materials. In neuroscience, quantifying the robustness of degenerate elements in brain imaging will offer new metrics and visualization in characterizing tissue microstructure for disease diagnostics. In bioengineering, robust vortex extraction and tracking of 3D conduction velocity fields in the heart will help bioengineers develop new metrics that detect and characterize ischemic stress associated with a heart attack. In atmospheric sciences, extracting and visualizing robust features in wind data will help the atmospheric scientists establish situation awareness of hazardous weather conditions such as wildfires and to provide wildfire weather forecasting and resource planning for firefighting personnel. This project will also provide a unique environment for multidisciplinary activities and training opportunities for students in integrating visualization with scientific applications.

This project will establish a new approach to feature-based visualization with three interconnected aims. First, it will derive novel mathematical formulations of robust features for vector and tensor fields and their ensembles. Second, it will develop new robustness-driven algorithms in feature extraction, tracking, simplification, visual representation, and uncertainty visualization. Third, it will apply and evaluate the proposed framework via close collaborations with scientists in four high-impact application areas: materials science, neuroscience, bioengineering, and atmospheric sciences. Using simulated micro-mechanical fields in an uncracked polycrystal, the project will integrate robust features with visualization to improve the interpretability of micro-mechanical fields and the prediction of fatigue-failure surfaces. Using diffusion tensor imaging (DTI) from the Human Connectome Project, the project will investigate quantifiable characteristics of crossing fibers as part of a long-term goal for deep brain stimulator placement. Using 3D conduction velocity generated in volumes of swine and canine tissues, the project will generate feature-based signatures from vortex stability and evolution and use them, in the long term, for disease diagnostics and medical intervention. Using ensemble datasets generated from the High-Resolution Rapid Refresh Model (HRRR), the project will use robust features in the visualization and statistical analysis of atmospheric models to identify atypical atmospheric conditions for wildfire weather assessment. The research results will be instantiated by a collection of research papers and open-source software tools targeting the communities of collaborating scientists and the large research community. These software tools will be made available via GitHub under MIT or BSD licenses.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

EAGER: Understanding and Mitigating Misinformation in Visualizations on Social Media

Alexander Lex
In a time of crisis, such as during a hurricane or a global pandemic, social media is an important source of information for the general population. In these scenarios, data visualizations are often used to convey information that is critical for decision making by individuals. For example, a visualization of the path of a hurricane can inform the affected population about the need to prepare or evacuate; while a visualization about the prevalence of a disease in a certain area can inform personal choices, such as limiting interactions with others during a relevant time period. Visualizations, however, can be flawed, which can lead to misinterpretation of the data, and, in a crisis, lead to decisions with negative consequences. This project seeks to identify aspects of visualizations that makes them widely shared, identify flaws a visualization might have, and warn social media users about them. Ultimately, this project can lead to better responses to a crisis by the general population, and contribute to improving visualization literacy. Finally, this project will also enable the training of two graduate students, provide opportunities for undergraduate research, and curate material that can be leveraged by educators teaching about visualization design.

These goals will be achieved by applying existing and novel methods, such as topic modeling and calculating measures of social attention, to three large dataset of social media posts related to recent crisis. Using a qualitative coding approach, a taxonomy of design problems will be developed. This taxonomy will be used to label a large dataset. Finally, a prototype intervention in the form of a plug-in that warns of problematic visualizations, but also enables users to classify problems with visualizations they encounter, will be developed. The dataset and the annotations compiled in the course of this project will be shared publicly. The software created will be released under a permissive, non-viral open source license.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

FluoRender: Visualization-Based and Interactive Analysis for Multi-Channel Microscopy Data

Chuck Hansen
FluoRender is a software package for visualizing and analyzing 3D and 4D (3D over time) fluorescence microscopy data. This project will serve the needs of biologists utilizing confocal microscopy for understanding cell development in many organisms and addresses the big-data problem from the massive increase of imaging data from modern high-resolution fluorescence microscopes.

Specific Aim 1 : Visualization of an extended number of volume channels: FluoRender will be enhanced with the multichannel visualization capability by simultaneously supporting several tens to hundreds of channels, which can be acquired from multispectral imaging devices or by registering data of multiple scans. FluoRender will take advantage of the latest volume rendering techniques to visualize significantly improved signal intensity detail compared to pseudo-surfaces.

Specific Aim 2 : Interactive comparison and organization of volume channels: A package of measures will be implemented in FluoRender for directly comparing volume channels. Leveraging the OpenCL programming interface, shape comparisons will be performed interactively on graphics hardware, allowing compound measures for complex morphology as well as immediate visual feedback via multichannel visualization. Interactive comparison will further enable the development of functions for semiautomatic channel organization and multichannel colocalization analysis.

Specific Aim 3 : 4D tracking of structures with irregular and changing shapes: Tracking irregularly shaped and shape-changing structures will substantially expand FluoRender's application for developmental and morphological studies of intracellular organelles, cells, and tissues. This will include a comprehensive tracking system that integrates different modules and allows them to work in an iterative and integrated environment, allowing user-guided, progressive refinement of the segmentation and tracking results.

Specific Aim 4. Fully hardware-accelerated and customizable computing modules: FluoRender will be restructures using compute modules based on the OpenCL standard, which provides not only hardware-accelerated execution speed, but also convenience for customization and reuse. Computing modules will be integrated with visualization features, enabling interactive and visualization-centered analysis. Users will also be able to reorganize and build modules to customize specific workflows for great adaptability.

Public Health Relevance
FluoRender is a software package for visualizing and analyzing 3D and 4D (3D over time) fluorescence microscopy data. This project will serve the needs of biologists utilizing confocal microscopy for understanding cell development in many organisms and addresses the big-data problem from the massive increase of imaging data from modern high-resolution fluorescence microscopes.

CPS: Synergy: A Layered Framework of Sensors, Models, Land-Use Information and Citizens for Understanding Air Quality in Urban Environments

Miriah Meyer, Ross Whitaker, Kerry Kelly, Pierre-Emmanuel Gaillardon
Poor air quality has been linked to not just adverse health effects such as increased incidence of cardiac arrhythmia, lung cancer, heart disease, and mortality, but also to the vitality of a region’s economy. These issues are particularly important in cities such as Salt Lake City (SLC), where topography, climate, and urban expansion combine to create some of the worst air quality episodes in the country. Cities like SLC currently rely on small numbers of expensive sensors placed across a large geographic area to measure air quality, making local, neighborhood-level measurements impossible to determine. Meanwhile, new commodity technologies are leading to fine-grained, community-based strategies for measuring and communicating air quality. Leveraging both of these approaches, this project will develop and deploy a dense, distributed, and dynamic air quality cyber-physical framework -- focusing on fine particulate matter and using SLC as an urban testbed -- to produce neighborhood-level estimates of air quality. The framework includes a network of low-cost sensors, hosted and maintained through a citizen science effort and maker-kit approach.

This research will result in novel developments in three areas: (i) sensor development that focuses on dramatically reducing cost and a movement toward cheap, wearable, passive sensors; (ii) computational modeling that combines heterogeneous sensor measurements with information about weather, topography, and land use patterns; and (iii) visualization interface design that communicates air quality estimates over space and time, coupled with related uncertainty measurements. Each of these areas requires a multidisciplinary approach that integrates existing and novel insights about sensor networks, computational modeling, and sense-making of data, as well as leveraging an engaged and connected community of residents through citizen science.

SBIR Phase II Immediate Delivery of Massive Aerial Imagery to Farmers and Crop Consultants

Valerio Pascucci, Amy Gooch
This Small Business Innovation Research (SBIR) Phase II project will accelerate the adoption of data intensive precision agriculture, increasing yields while decreasing farm inputs such as fertilizers and pesticides. This project removes the software bottleneck (time and labor) in processing large aerial surveys taken by Unmanned Aerial Systems, enabling a cost-effective and timely process to deliver actionable information to farmers. Using frequent high-quality aerial scans, farmers may optimize the use of fertilizers and more finely control the amount of pesticides and herbicides necessary to increase crop yield. Furthermore, farmers mitigate costs and losses by being able to spot problem areas, minimize the spread of plant diseases, and identify issues such as standing water, irrigation malfunctions, and persistent automated machinery errors in planting or cultivation. This project provides special benefit for rural customers having inadequate internet infrastructure by eliminating the need to upload massive imagery to the cloud for processing. The technology is part of a broad initiative in agriculture addressing the need for large increases in food production by 2050 in response to the projected growth of the world’s population to over 9 Billion people.

This project will continue development of algorithms for on-the-fly orthorectification, stitching, and normalization of aerial image mosaics and their deployment in an easy-to-use software prototype. The Phase I already demonstrated industry-leading speeds for such image processing. The technology behind this research project is designed from the ground up to process massive data with less memory and increased speed relative to other approaches, enabled by a proprietary streaming image representation, that allows multichannel gigapixel and terapixel images to be treated as ordinary images. This Phase II supports new extensions to the software that simplify and accelerate delivering a stitched and analyzed map, such as prioritizing computation in regions of the image that a customer is exploring. This would effectively eliminate the delay between image acquisition on unmanned aerial vehicles and when it can be used. Crop consultants have identified this as a transformative capability, as it enables ground-truthing information derived from aerial imagery in the same field visit, saving time and labor. The performance gains in compute-limited environments supported by this project are a key link between new capabilities to gather information and a farmer’s ability to utilize it to increase productivity while reducing costs.

Topology-Preserving Data Sketching for Scientific Visualization

Bei Wang
We are experiencing an information overload from streams of data that arise from scientific instruments and simulations. For example, material scientists use molecular dynamics (MD) simulations to study how fluids (such as gas, oil, and water) interact with heterogeneous porous solids (such as ceramics, cement, and rock) to improve transport phenomena within porous materials, which play critical roles in our energy sector. Such simulations generate large, time-varying, and complex forms of data under different physical and chemical conditions. Keeping track of interesting phenomena and applying appropriate actions (such as storage, analysis, and visualization) while the simulation is running is necessary but challenging. To address this challenge, the goal is no longer to capture and store observations or simulation in detail, but rather to process data efficiently and approximately in order to create a summary - a sketch - which allows queries over large volumes of data to be answered quickly.

The objective of this research is to conduct a systematic study of topology-preserving data sketching techniques to improve visual exploration and understanding of large scientific data. The project will employ topological sketches, that is, compressed representations of the full data that preserve their important structural properties, to support analysis and visualization as the data are generated. Our proposed solution transforms data sketching ideas from statistics, geometry, and linear algebra to develop new topological sketches of complex data. Such sketches will exploit the high spatial resolution and temporal fidelity of in situ data in an intelligent and scalable way. They will reduce data in situ while preserving its structural properties, and subsequently support interactive data exploration. In addition, topological triggers will be integrated into an adaptive workflow to support anomaly detection, computational steering, and decision optimization. The multidisciplinary nature of the proposed work will be broadly applicable in many scientific areas, including applications in computational fluid dynamics and materials science.

Novel 3d Experiments and Simulations Combined with Genetic Optimization for Accelerated Design of Metallic Foams

Valerio Pascucci
Open-cell metallic foams are an exciting class of structural materials that comprise a network of interconnected metallic ligaments, resulting in an interesting foam architecture. These low-density materials have garnered much attention over the past two decades based on their recognized potential for use in multi-functional applications. For example, in addition to serving as light-weight, load-bearing structures, open-cell metallic foams have the potential to serve concurrently as electrodes for energy-storage devices, as hosts for newly generated bone and blood vessels in biomedical implants, or as impact absorbers and noise insulators for advanced high-speed ground transportation. Despite their potential, the widespread deployment of open-cell metallic foams for a broader range of multi-functional applications remains hampered by inefficient, trial-and-error manufacturing approaches. This Designing Materials to Revolutionize and Engineer our Future (DMREF) Grant Opportunities for Academic Liaison with Industry (GOALI) award supports a joint academic-industry research effort to enable more efficient and intelligent design of open-cell metallic foams, and to achieve precise control over their performance for targeted applications. The results will provide dramatic improvements for the industry by increasing both the manufacturing efficiency and the tailorability of the foams, which will help to expand deployment of the foams throughout the energy, defense, biomedical, aerospace, and automotive industries. The research team will host outreach activities to expose students in K-12, undergraduate, and graduate school to this multi-disciplinary STEM research.

This DMREF GOALI award supports research to enable an accelerated and performance-based design paradigm for open-cell metallic foams through the integration of emergent methods in 3D materials characterization with multi-scale modeling and Bayesian optimization. The new design paradigm will be made possible through the discovery of process-structure-property relationships in the foams. The specific objectives include: experimentally modifying manufacturing parameters to produce variants of open-cell metallic foams; performing 3D synchrotron-based crystal-orientation measurements and in-situ X-ray computed tomography experiments to gain unprecedented insight into the hierarchical structure and multi-scale deformation mechanisms of the foam; using high-fidelity, multi-scale (grain-to-continuum) finite-element modeling to investigate micromechanical behavior and predict performance of the as-manufactured foams; conducting virtual tests on synthetic-foam variants to further populate a metallic-foam design space; and using Bayesian optimization on the simulation-based results to enable selection of optimal hierarchical structures (i.e. topology and crystallography) for targeted performance metrics. The research will be a first to decouple the effects of ligament topology and underlying crystal structure on micromechanical behavior of open-cell metallic foams (including microbuckling, local accumulation of slip, and distribution of crack-nucleation sites), which is postulated to influence its performance.

A Scalable Framework for Visual Exploration and Hypotheses Extraction of Phenomics Data

Bei Wang
Understanding how gene by environment interactions result in specific phenotypes is a core goal of modern biology and has real-world impacts on such things as crop management. Developing and managing successful crop practices is a goal that is fundamentally tied to our national food security. By applying novel computational visual analytical methods, this project seeks to identify and unravel the complex web of interactions linking genotypes, environments and phenotypes. These methods will first need to be designed and developed into usable software applications that can handle large volumes of crop phenomics data. High-throughput sensing technologies collect large volumes of field data for many plant traits, such as flowering time, related to crop development and production. The maize cultivars used here come from multiple genotypes that have been grown under a variety of environmental conditions, in order to give the widest range of conditions for understanding the interactions. The resulting data sets are growing quickly, both in size and complexity, but the analytical tools needed to extract knowledge and catalyze scientific discoveries have significantly lagged behind. The methodologies to be developed in this project represent a systematic attempt at bridging this rapidly widening divide. The project is inherently interdisciplinary, involving close research partnerships among computer scientists, plant scientists, and mathematicians. The research outcomes will be tightly integrated with education using a multipronged approach that includes, among others, postdoctoral and student training (graduates and undergraduates), curriculum development for a new campus-wide interdisciplinary undergraduate degree in Data Analytics, conference tutorials for training phenomics data practitioners, and contribution to the recruitment and retention of underrepresented minorities (particularly women) in STEM fields through the Pacific Northwest Louis Stokes Alliance for Minority Participation.

This project will lead to the design and development of a new, scalable, visual analytics platform suitable for hypothesis extraction and refinement from complex phenomics data sets. Focus on hypothesis extraction is critical in the context of phenomics data sets because much of the high-throughput sensing data being generated in crop fields are generated in the absence of specifically formulated hypotheses. Extracting plausible hypotheses from the data represents an important but tedious task. To this end, this project will apply and develop new capabilities using emerging advanced algorithmic principles, particularly from the branch of mathematics called algebraic topology that studies shapes and structure of complex data. The research objectives are three-fold. First, the project will employ and extend emerging algorithmic techniques from algebraic topology to decode the structure of large, complex phenomics data. Second, an interactive visual analytic platform will be developed to facilitate knowledge discovery using the extracted topological structures. Lastly, the quality and validity of a new visual analytic platform designed by this team will be tested using real-world maize data sets as well as simulated inputs as testbeds. The developed framework will encode functions for scientists to delineate hypotheses of three kinds: i) genetic characterization of single complex traits; ii) genetic characterization of multiple traits that share potentially pleiotropic effects; and iii) decoding and detailed characterization of genotype-by-environmental interactions, in particular, through a collaborative pilot study of maize flowering and growth traits. The expected significance of the proposed work is that biologists will be able to extract different types of testable hypotheses from plant phenomics data sets by employing a new class of visual analytic tools, and thus obtain a deeper understanding of the interactions among genotypes, environments and phenotypes. The project is potentially transformative in two ways: i) it will introduce advanced mathematical and computational principles into mainstream phenomic data analysis; and ii) it will usher in a new era where biologists spearhead data-driven hypothesis extraction and discovery with the aid of interactive, informative, and intuitive tools. The project will have a direct impact on the state of software in phenomics for fundamental data-driven discovery. To facilitate broader community adoption, the project will integrate the tools into the CyVerse Institute, and to a community phenomics software outlet. It will also lead to the development of automated scientific workflows. Project website: http://tdaphenomics.eecs.wsu.edu/.

COVID - RAPID: Building a Visual Consensus Model of the SARS-CoV-2 Life Cycle

Janet Iwasa, Miriah Meyer
The COVID-19 epidemic has motivated hundreds (if not thousands) of biological researchers around the globe to redirect their research efforts towards the understanding of SARS-CoV-2. This is leading to an explosion of data and it will be essential to find ways to rapidly digest and integrate new information into a context that facilitates consensus building in the research community. How do researchers and the broader community stay abreast of this flood of information? And how can we quickly move towards building a consensus model of the SARS-CoV-2 life cycle that builds on this explosive body of scientific data and expertise? This work proposes to take a novel and intuitive approach to facilitate scientific discourse and dissemination through the development of: (1) detailed molecular 3-D depictions that put a diverse dataset into the context of the SARS-CoV-2 life cycle, and; (2) provide for annotation tools to be used by researchers to explore and capture scientific discussions that will speed up consensus building to promote a mechanistic understanding of how this virus works. If successful, the work will reduce the time of consensus building from years to months. In addition, a graduate student and postdoc will receive training at the intersection of biological and computer sciences.

Specifically, researchers will work with an international group of SARS-CoV-2 experts to develop detailed and accurate visualizations of all stages of the viral life cycle including cellular entry, RNA replication and transcription, and viral assembly and egress with known energy states, rates, and spatial accuracy. These 3-D visualizations, which will be made freely available online, will be used to stimulate discussions within the scientific community, and will be iteratively updated based on community feedback and new data. To facilitate consensus building, annotation tools will be developed to interactively describe the data used to generate the visualizations and will also mediate and capture scientific discourse surrounding the various molecular mechanisms involved in viral infection. This project will rapidly produce a rich and publicly accessible collection of knowledge about SARS-CoV-2 biology for the global community.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

OpenSpace: An Engine for Dynamic Visualization of Earth and Space Science for Informal Education and Beyond

Chuck Hansen
The American Museum of Natural History (AMNH), in collaboration with informal science institutions (ISI), NASA mission teams and Subject Matter Experts (SME), and academic partners, seeks support for a five-year project to enable STEM education and improve U.S. scientific literacy by engaging a broad spectrum of the American public and STEM learners in cutting-edge NASA science and engineering content.

This project will develop an open source software, called OpenSpace, for visualizing NASA astrophysics, heliophysics, planetary science, and Earth science mission engineering activities and science results for the general public, students, teachers, and citizen scientists everywhere. The project will develop and widely disseminate OpenSpace; create innovative and networked programs with ISI partners; produce educational resources for middle and high school teachers and students; and establish robust partnerships with NASA SMD missions, ISIs, and visualization research centers.

The project is based on the success of pilot efforts to visualize the New Horizons mission and heliophysics and space weather simulation data generated by NASA Goddard’s Community Coordinated Modeling Center. It builds on AMNH’s expertise in science visualization and its record of success in partnering with NASA to develop innovative programming, exhibitions, and Space Shows that engage, inspire, and educate students, teachers, and learners of all ages.

Drawing together a highly qualified and exceptionally talented team of scientists, educators, software engineers, and visualization specialists, the project’s aim is to build a pipeline for transmitting visualized science content from across NASA SMD divisions to ISIs, secondary school classrooms, and the public.

To do so, the project proposes the following objectives:

Develop OpenSpace into a robust and flexible interactive visualization software that supports the presentation of dynamic data sets and that is easily updated for the presentation of current science.
Form a network of ISIs to inform the development of OpenSpace and develop associated programming to engage and educate diverse audiences.
Disseminate OpenSpace via the web to individual users, including teachers as a key audience, with resources for leveraging it as an educational tool.

Project outcomes include:

The establishment of a pipeline connecting NASA SMD content and SMEs with ISIs, secondary school classrooms, and the public.
The development of a new and powerful educational tool for the visualization of a wide range of NASA SMD mission activities and data products.
Enhanced understanding and engagement in STEM among youth, informal and formal educators, and the general public.

Project objectives, activities, and outcomes are closely aligned with, and aim to fulfill, the SMD science education objectives of enabling STEM education, improving U.S. scientific literacy, and advancing national education goals of increasing and sustaining youth and public engagement in STEM and leveraging efforts through partnerships.

Because OpenSpace will be open source, it will be freely accessible to users. It is designed to be compatible with multi-video channel cluster operations for high-resolution wall displays and planetarium domes, as well as for single-channel polar rendering fisheye projections and flat screens, in 2D and 3D. A WebGL version will make it possible for anyone with Internet access to explore OpenSpace. Another core design principle of this project is the ability to network across the Internet to synchronize displays in different locations, creating opportunities for shared experiences of high profile NASA content, including live events. This open source project will have a life far beyond the award period, as it will provide science and education communities access to the source code to modify, enhance, and extend its functionality to best serve audiences in the future.

Extracting the Full Information Content of Astrophysical Data Cubes

Bei Wang
An IFU (Integral Field Unit Spectrometer) allows one to take a high-resolution spectrum at multiple physical locations within an external target. The signal from an astronomical target is distributed into a large number of spaxels (spatial pixels), each with noise from the sky and detectors, and a greatly varying signal to noise ratio across the bundle. IFU bundle technique gives rise to 3-dimensional astrophysical data cubes (two spatial directions and one frequency direction) that require advanced analysis techniques to extract their salient features. In many cases the complex kinematic structure of features of interest further complicates the problem. Furthermore, it is intrinsically difficult to visualize such data and common analysis techniques often involve slicing the data cube along a particular axis, either at a fixed frequency or a fixed spatial location.

A common type of data from IFU bundle technique is the Mapping Nearby Galaxies at APO (MaNGA) survey, which is part of the Sloan Digital Sky Survey IV (SDSS-IV). PI Phillips and PI Rosen have been working to analyze similar data cubes taken at radio frequencies with the ALMA telescope in Chile (see http://alma-tda.cspaul.com). They have been using sophisticated mathematical techniques known as topological data analysis, in particular the contour tree, in order to extract features and remove noise for visualizing data cubes very similar to the ones arise from IFU.

Objective
We would like to apply advanced data analysis and visualization techniques, in particular, those from topological data analysis, to data observed at UV, optical and infrared wavelengths, in order to extract features that are currently inaccessible. In particular, we would like to start by studying the SDSS-IV MaNGA dataset, to which Carnegie Institution for Science and the University of Utah (where the MaNGA reduction and analysis pipelines are run via the Center for High Performance Computing) have full access as Institutional members (the SDSS Data Scientist, Prof. Joel Brownstein of the University of Utah is a PI on this project).

Furthermore, we will explore the applicability of such techniques to other similar datasets that have been acquired using other IFU facilities.

Topological Analysis for Energetic Materials Characterization

Valerio Pascucci
This statement of work supports ongoing efforts towards improved analysis of characterization and surveillance data of energetic materials. The goals are to: 1) use topological segmentations to analyze microstructural changes under aging; 2) explore extending the analysis tools to characterize fine-prill materials; 3) develop techniques to quantify permeable surface area of a lower-density system; and 4) extract age-trendable features from2D-surface profile data.

Tasks

1. Analyze microstructural changes under aging: At various Aging points (in time-temperature space):

Determine matching scales and simplification levels to create best matching segmentations for each dataset
Develop techniques to affinely align pre- & post-aged data sets for maximal correspondence
Use per-grain matching to analyze material changes over time

2. Explore extending the analysis tools to the characterization of fine-prill materials:

In previous years the Utah technology could successfully analyze X-ray CT data for coarser-prill HE materials. Explore the effectiveness of such technology in performing similar analysis on X-ray CT data for fine-prill systems.

3. Develop techniques to quantify permeable surface area of lower-density systems:

The topological segmentation theory could be used to quantify the permeable surface area of lower-density (e.g., porous-powder) systems, and to compute the gas-flow rate through such a specimen under a given pressure-gradient. CONTINGENCY: Availability of high-quality micro-CT data.

4. Extract age-trendable features fromsurface profilometry data

Analyze 2D height-map data from pellet surfaces (measured using a surface profilometer) and device quantitative features that can be used to track age-related changes in material morphology and performance.

Advanced Visualization of Silent Error Propagation in HPC Applications

Valerio Pascucci
High Performance Computing (HPC) systems contain increasingly large numbers of components. This trend, combined with practical limitations on component reliability, makes HPC systems vulnerable to a wide range of faults. These faults degrade systems efficiency and even threaten the correctness of application results. The problem is expected to grow even more significant for Exascale systems. Designing resilient software to run efficiently on such hardware is challenging, and uncertainty about how failures affect programs only complicates the problem.

Disruptions to the micro‐architectural state of hardware components (e.g., caches, reorder buffers or pipeline registers), may cause these components to crash or compute erroneous results. These errors then propagate through layers of the software stack, including the runtime system, support libraries, and application logic. Local memory access to erroneous results can easily propagate the effects of errors across cores; and the remote memory access on modern networks propagates errors across nodes. The reordered memory accesses in use by memory systems introduces further difficulties by obscuring the consistency (ordering) of memory accesses when errors occur. Identifying the propagation of errors through space and time and quantifying it in terms developers can understand is a major problem for error recovery schemes. This is especially true for scientific applications that rely on complex physical or numerical invariants and for resilience techniques that need to identify consistent states.

The ultimate goal of this research is to provide a visualization of the propagation of errors through application and system software in order to identify for application developers the vulnerability of their data structures and code regions to different types of errors, and the way these errors propagate through application state and logic.

VisStore: Seamless Acquisition, Storage, and Distribution of Massive Imagery

Ease of Use and Deployment for a Fast, Scalable Data Movement Infrastructure

Publications in Visualization:

Page 6 of 23

Start
Prev
1
2
3
4
5
6
7
8
9
10
Next
End

OpenSpace: Changing the Narrative of Public Dissemination in Astronomical Visualization from What to How
A. Bock, E. Axelsson, C. Emmart, M. Kuznetsova, C. Hansen, A. Ynnerman. In IEEE Computer Graphics and Applications, Vol. 38, No. 3, IEEE, pp. 44--57. May, 2018.
DOI: 10.1109/mcg.2018.032421653

We present the development of an open-source software called OpenSpace that bridges the gap between scientific discoveries and public dissemination and thus paves the way for the next generation of science communication and data exploration. We describe how the platform enables interactive presentations of dynamic and time-varying processes by domain experts to the general public. The concepts are demonstrated through four cases: Image acquisitions of the New Horizons and Rosetta spacecraft, the dissemination of space weather phenomena, and the display of high-resolution planetary images. Each case has been presented at public events with great success. These cases highlight the details of data acquisition, rather than presenting the final results, showing the audience the value of supporting the efforts of the scientific discovery.

Outcomes of an electronic social network intervention with neuro-oncology patient family caregivers
M. Reblin, D. Ketcher, P. Forsyth, E. Mendivil, L. Kane, J. Pok, M. Meyer, Y.Wu, J. Agutter. In Journal of Neuro-Oncology, Springer Nature, pp. 1--7. May, 2018.
DOI: 10.1007/s11060-018-2909-2

Introduction

Informal family caregivers (FCG) are an integral and crucial human component in the cancer care continuum. However, research and interventions to help alleviate documented anxiety and burden on this group is lacking. To address the absence of effective interventions, we developed the electronic Support Network Assessment Program (eSNAP) which aims to automate the capture and visualization of social support, an important target for overall FCG support. This study seeks to describe the preliminary efficacy and outcomes of the eSNAP intervention.

Methods

Forty FCGs were enrolled into a longitudinal, two-group randomized design to compare the eSNAP intervention in caregivers of patients with primary brain tumors against controls who did not receive the intervention. Participants were followed for six weeks with questionnaires to assess demographics, caregiver burden, anxiety, depression, and social support. Questionnaires given at baseline (T1) and then 3-weeks (T2), and 6-weeks (T3) post baseline questionnaire.

Results

FCGs reported high caregiver burden and distress at baseline, with burden remaining stable over the course of the study. The intervention group was significantly less depressed, but anxiety remained stable across groups.

Conclusions

With the lessons learned and feedback obtained from FCGs, this study is the first step to developing an effective social support intervention to support FCGs and healthcare providers in improving cancer care.

TopoMS: Comprehensive topological exploration for molecular and condensed‐matter systems
H. Bhatia, A.G. Gyulassy, V. Lordi, J.E. Pask, V. Pascucci, P.T. Bremer. In Journal of Computational Chemistry, Vol. 39, No. 16, Wiley, pp. 936--952. March, 2018.
DOI: 10.1002/jcc.25181

We introduce TopoMS, a computational tool enabling detailed topological analysis of molecular and condensed‐matter systems, including the computation of atomic volumes and charges through the quantum theory of atoms in molecules, as well as the complete molecular graph. With roots in techniques from computational topology, and using a shared‐memory parallel approach, TopoMS provides scalable, numerically robust, and topologically consistent analysis. TopoMS can be used as a command‐line tool or with a GUI (graphical user interface), where the latter also enables an interactive exploration of the molecular graph. This paper presents algorithmic details of TopoMS and compares it with state‐of‐the‐art tools: Bader charge analysis v1.0 (Arnaldsson et al., 01/11/17) and molecular graph extraction using Critic2 (Otero‐de‐la‐Roza et al., Comput. Phys. Commun. 2014, 185, 1007). TopoMS not only combines the functionality of these individual codes but also demonstrates up to 4× performance gain on a standard laptop, faster convergence to fine‐grid solution, robustness against lattice bias, and topological consistency. TopoMS is released publicly under BSD License. © 2018 Wiley Periodicals, Inc.

Research and Education in Computational Science and Engineering
U. Ruede, K. Willcox, L. C. McInnes, H. De Sterck, G. Biros, H. Bungartz, J. Corones, E. Cramer, J. Crowley, O. Ghattas, M. Gunzburger, M. Hanke, R. Harrison, M. Heroux, J. Hesthaven, P. Jimack, C. Johnson, K. E. Jordan, D. E. Keyes, R. Krause, V. Kumar, S. Mayer, J. Meza, K. M. Mrken, J. T. Oden, L. Petzold, P. Raghavan, S. M. Shontz, A. Trefethen, P. Turner, V. Voevodin, B. Wohlmuth,, C. S. Woodward. In SIAM Review, Vol. 60, No. 3, SIAM, pp. 707--754. Jan, 2018.
DOI: 10.1137/16m1096840

This report presents challenges, opportunities and directions for computational science and engineering (CSE) research and education for the next decade. Over the past two decades the field of CSE has penetrated both basic and applied research in academia, industry, and laboratories to advance discovery, optimize systems, support decision-makers, and educate the scientific and engineering workforce. Informed by centuries of theory and experiment, CSE performs computational experiments to answer questions that neither theory nor experiment alone is equipped to answer. CSE provides scientists and engineers with algorithmic inventions and software systems that transcend disciplines and scales. CSE brings the power of parallelism to bear on troves of data. Mathematics-based advanced computing has become a prevalent means of discovery and innovation in essentially all areas of science, engineering, technology, and society; and the CSE community is at the core of this transformation. However, a combination of disruptive developments—including the architectural complexity of extreme-scale computing, the data revolution and increased attention to data-driven discovery, and the specialization required to follow the applications to new frontiers—is redefining the scope and reach of the CSE endeavor. With these many current and expanding opportunities for the CSE field, there is a growing demand for CSE graduates and a need to expand CSE educational offerings. This need includes CSE programs at both the undergraduate and graduate levels, as well as continuing education and professional development programs, exploiting the synergy between computational science and data science. Yet, as institutions consider new and evolving educational programs, it is essential to consider the broader research challenges and opportunities that provide the context for CSE education and workforce development.

ISAVS: Interactive Scalable Analysis and Visualization System
S. Petruzza, A. Venkat, A. Gyulassy, G. Scorzelli, F. Federer, A. Angelucci, V. Pascucci, P. T. Bremer. In ACM SIGGRAPH Asia 2017 Symposium on Visualization, ACM Press, 2017.
DOI: 10.1145/3139295.3139299

Modern science is inundated with ever increasing data sizes as computational capabilities and image acquisition techniques continue to improve. For example, simulations are tackling ever larger domains with higher fidelity, and high-throughput microscopy techniques generate larger data that are fundamental to gather biologically and medically relevant insights. As the image sizes exceed memory, and even sometimes local disk space, each step in a scientific workflow is impacted. Current software solutions enable data exploration with limited interactivity for visualization and analytic tasks. Furthermore analysis on HPC systems often require complex hand-written parallel implementations of algorithms that suffer from poor portability and maintainability. We present a software infrastructure that simplifies end-to-end visualization and analysis of massive data. First, a hierarchical streaming data access layer enables interactive exploration of remote data, with fast data fetching to test analytics on subsets of the data. Second, a library simplifies the process of developing new analytics algorithms, allowing users to rapidly prototype new approaches and deploy them in an HPC setting. Third, a scalable runtime system automates mapping analysis algorithms to whatever computational hardware is available, reducing the complexity of developing scaling algorithms. We demonstrate the usability and performance of our system using a use case from neuroscience: filtering, registration, and visualization of tera-scale microscopy data. We evaluate the performance of our system using a leadership-class supercomputer, Shaheen II.

CPU Volume Rendering of Adaptive Mesh Refinement Data
I. Wald, C. Brownlee, W. Usher, A. Knoll. In ACM SIGGRAPH Asia 2017 Symposium on Visualization, ACM Press, 2017.
DOI: 10.1145/3139295.3139305

Adaptive Mesh Refinement (AMR) methods are widespread in scientific computing, and visualizing the resulting data with efficient and accurate rendering methods can be vital for enabling interactive data exploration. In this work, we detail a comprehensive solution for directly volume rendering block-structured (Berger-Colella) AMR data in the OSPRay interactive CPU ray tracing framework. In particular, we contribute a general method for representing and traversing AMR data using a kd-tree structure, and four different reconstruction options, one of which in particular (the basis function approach) is novel compared to existing methods. We demonstrate our system on two types of block-structured AMR data and compressed scalar field data, and show how it can be easily used in existing production-ready applications through a prototypical integration in the widely used visualization program ParaView.

Revisiting Abnormalities in Brain Network Architecture Underlying Autism Using Topology-Inspired Statistical Inference,
S. Palande, V. Jose, B. Zielinski, J. Anderson, P.T. Fletcher, B. Wang. In Connectomics in NeuroImaging, Springer International Publishing, pp. 98--107. 2017.
DOI: 10.1007/978-3-319-67159-8_12

A large body of evidence relates autism with abnormal structural and functional brain connectivity. Structural covariance MRI (scMRI) is a technique that maps brain regions with covarying gray matter density across subjects. It provides a way to probe the anatomical structures underlying intrinsic connectivity networks (ICNs) through the analysis of the gray matter signal covariance. In this paper, we apply topological data analysis in conjunction with scMRI to explore network-specific differences in the gray matter structure in subjects with autism versus age-, gender- and IQ-matched controls. Specifically, we investigate topological differences in gray matter structures captured by structural covariance networks (SCNs) derived from three ICNs strongly implicated in autism, namely, the salience network (SN), the default mode network (DMN) and the executive control network (ECN). By combining topological data analysis with statistical inference, our results provide evidence of statistically significant network-specific structural abnormalities in autism, from SCNs derived from SN and ECN. These differences in brain architecture are consistent with direct structural analysis using scMRI (Zielinski et al. 2012).

Worksheets for Guiding Novices through the Visualization Design Process
S. McKenna, A. Lex, M. Meyer. In CoRR, 2017.

For visualization pedagogy, an important but challenging notion to teach is design, from making to evaluating visualization encodings, user interactions, or data visualization systems. In our previous work, we introduced the design activity framework to codify the high-level activities of the visualization design process. This framework has helped structure experts' design processes to create visualization systems, but the framework's four activities lack a breakdown into steps with a concrete example to help novices utilizing this framework in their own real-world design process. To provide students with such concrete guidelines, we created worksheets for each design activity: understand, ideate, make, and deploy. Each worksheet presents a high-level summary of the activity with actionable, guided steps for a novice designer to follow. We validated the use of this framework and the worksheets in a graduate-level visualization course taught at our university. For this evaluation, we surveyed the class and conducted 13 student interviews to garner qualitative, open-ended feedback and suggestions on the worksheets. We conclude this work with a discussion and highlight various areas for future work on improving visualization design pedagogy.

Exploration of Heterogeneous Data Using Robust Similarity
M. Mirzargar, R.T. Whitaker, R.M. Kirby. In CoRR, 2017.

Heterogeneous data pose serious challenges to data analysis tasks, including exploration and visualization. Current techniques often utilize dimensionality reductions, aggregation, or conversion to numerical values to analyze heterogeneous data. However, the effectiveness of such techniques to find subtle structures such as the presence of multiple modes or detection of outliers is hindered by the challenge to find the proper subspaces or prior knowledge to reveal the structures. In this paper, we propose a generic similarity-based exploration technique that is applicable to a wide variety of datatypes and their combinations, including heterogeneous ensembles. The proposed concept of similarity has a close connection to statistical analysis and can be deployed for summarization, revealing fine structures such as the presence of multiple modes, and detection of anomalies or outliers. We then propose a visual encoding framework that enables the exploration of a heterogeneous dataset in different levels of detail and provides insightful information about both global and local structures. We demonstrate the utility of the proposed technique using various real datasets, including ensemble data.

Visualizing Sensor Network Coverage with Location Uncertainty
T. Sodergren, J. Hair, J.M. Phillips, B. Wang. In CoRR, Vol. abs/1710.06925, 2017.

We present an interactive visualization system for exploring the coverage in sensor networks with uncertain sensor locations. We consider a simple case of uncertainty where the location of each sensor is confined to a discrete number of points sampled uniformly at random from a region with a fixed radius. Employing techniques from topological data analysis, we model and visualize network coverage by quantifying the uncertainty defined on its simplicial complex representations. We demonstrate the capabilities and effectiveness of our tool via the exploration of randomly distributed sensor networks.

Visualization in Meteorology---A Survey of Techniques and Tools for Data Analysis Tasks
M. Rautenhaus, M. Böttinger, S. Siemen, R. Hoffman, R.M. Kirby, M. Mirzargar, N. Rober, R. Westermann. In IEEE Transactions on Visualization and Computer Graphics, IEEE, pp. 1--1. 2017.
DOI: 10.1109/tvcg.2017.2779501

This article surveys the history and current state of the art of visualization in meteorology, focusing on visualization techniques and tools used for meteorological data analysis. We examine characteristics of meteorological data and analysis tasks, describe the development of computer graphics methods for visualization in meteorology from the 1960s to today, and visit the state of the art of visualization techniques and tools in operational weather forecasting and atmospheric research. We approach the topic from both the visualization and the meteorological side, showing visualization techniques commonly used in meteorological practice, and surveying recent studies in visualization research aimed at meteorological applications. Our overview covers visualization techniques from the fields of display design, 3D visualization, flow dynamics, feature-based visualization, comparative visualization and data fusion, uncertainty and ensemble visualization, interactive visual analysis, efficient rendering, and scalability and reproducibility. We discuss demands and challenges for visualization research targeting meteorological data analysis, highlighting aspects in demonstration of benefit, interactive visual analysis, seamless visualization, ensemble visualization, 3D visualization, and technical issues.

Taggle: Scalable Visualization of Tabular Data through Aggregation
K. Furmanova, S. Gratzl, H. Stitz, T. Zichner, M. Jaresova, M. Ennemoser, A. Lex, M. Streit. In CoRR, 2017.

Visualization of tabular data---for both presentation and exploration purposes---is a well-researched area. Although effective visual presentations of complex tables are supported by various plotting libraries, creating such tables is a tedious process and requires scripting skills. In contrast, interactive table visualizations that are designed for exploration purposes either operate at the level of individual rows, where large parts of the table are accessible only via scrolling, or provide a high-level overview that often lacks context-preserving drill-down capabilities. In this work we present Taggle, a novel visualization technique for exploring and presenting large and complex tables that are composed of individual columns of categorical or numerical data and homogeneous matrices. The key contribution of Taggle is the hierarchical aggregation of data subsets, for which the user can also choose suitable visual representations.The aggregation strategy is complemented by the ability to sort hierarchically such that groups of items can be flexibly defined by combining categorical stratifications and by rich data selection and filtering capabilities. We demonstrate the usefulness of Taggle for interactive analysis and presentation of complex genomics data for the purpose of drug discovery.

Reducing network congestion and synchronization overhead during aggregation of hierarchical data,
S. Kumar, D. Hoang, S. Petruzza, J. Edwards, V. Pascucci. In 2017 IEEE 24th International Conference on High Performance Computing (HiPC), IEEE, Dec, 2017.
DOI: 10.1109/hipc.2017.00034

Hierarchical data representations have been shown to be effective tools for coping with large-scale scientific data. Writing hierarchical data on supercomputers, however, is challenging as it often involves all-to-one communication during aggregation of low-resolution data which tends to span the entire network domain, resulting in several bottlenecks. We introduce the concept of indexing templates, which succinctly describe data organization and can be used to alter movement of data in beneficial ways. We present two techniques, domain partitioning and localized aggregation, that leverage indexing templates to alleviate congestion and synchronization overheads during data aggregation. We report experimental results that show significant I/O speedup using our proposed schemes on two of today's fastest supercomputers, Mira and Shaheen II, using the Uintah and S3D simulation frameworks.

Vietoris-Rips and Cech Complexes of Metric Gluings
M. Adamaszek, H. Adams, E. Gasparovic, M. Gommel, E. Purvine, R. Sazdanovic, B. Wang, Y. Wang, L. Ziegelmeier. In CoRR, 2017.

We study Vietoris-Rips and Cech complexes of metric wedge sums and metric gluings. We show that the Vietoris-Rips (resp. Cech) complex of a wedge sum, equipped with a natural metric, is homotopy equivalent to the wedge sum of the Vietoris-Rips (resp. Cech) complexes. We also provide generalizations for certain metric gluings, i.e. when two metric spaces are glued together along a common isometric subset. As our main example, we deduce the homotopy type of the Vietoris-Rips complex of two metric graphs glued together along a sufficiently short path. As a result, we can describe the persistent homology, in all homological dimensions, of the Vietoris-Rips complexes of a wide class of metric graphs.

Sheaf-Theoretic Stratification Learning
A. Brown, B. Wang. In CoRR, 2017.

In this paper, we investigate a sheaf-theoretic interpretation of stratification learning. Motivated by the work of Alexandroff (1937) and McCord (1978), we aim to redirect efforts in the computational topology of triangulated compact polyhedra to the much more computable realm of sheaves on partially ordered sets. Our main result is the construction of stratification learning algorithms framed in terms of a sheaf on a partially ordered set with the Alexandroff topology. We prove that the resulting decomposition is the unique minimal stratification for which the strata are homogeneous and the given sheaf is constructible. In particular, when we choose to work with the local homology sheaf, our algorithm gives an alternative to the local homology transfer algorithm given in Bendich et al. (2012), and the cohomology stratification algorithm given in Nanda (2017). We envision that our sheaf-theoretic algorithm could give rise to a larger class of stratification beyond homology-based stratification. This approach also points toward future applications of sheaf theory in the study of topological data analysis by illustrating the utility of the language of sheaf theory in generalizing existing algorithms.

Interactive Visual Exploration And Refinement Of Cluster Assignments
M. Kern, A. Lex, N. Gehlenborg, C. R. Johnson. In BMC Bioinformatics, Cold Spring Harbor Laboratory, April, 2017.
DOI: 10.1101/123844

Background:
With ever-increasing amounts of data produced in biology research, scientists are in need of efficient data analysis methods. Cluster analysis, combined with visualization of the results, is one such method that can be used to make sense of large data volumes. At the same time, cluster analysis is known to be imperfect and depends on the choice of algorithms, parameters, and distance measures. Most clustering algorithms don't properly account for ambiguity in the source data, as records are often assigned to discrete clusters, even if an assignment is unclear. While there are metrics and visualization techniques that allow analysts to compare clusterings or to judge cluster quality, there is no comprehensive method that allows analysts to evaluate, compare, and refine cluster assignments based on the source data, derived scores, and contextual data.

Results:
In this paper, we introduce a method that explicitly visualizes the quality of cluster assignments, allows comparisons of clustering results and enables analysts to manually curate and refine cluster assignments. Our methods are applicable to matrix data clustered with partitional, hierarchical, and fuzzy clustering algorithms. Furthermore, we enable analysts to explore clustering results in context of other data, for example, to observe whether a clustering of genomic data results in a meaningful differentiation in phenotypes.

Conclusions:
Our methods are integrated into Caleydo StratomeX, a popular, web-based, disease subtype analysis tool. We show in a usage scenario that our approach can reveal ambiguities in cluster assignments and produce improved clusterings that better differentiate genotypes and phenotypes.

Massively Parallel Simulations of Spread of Infectious Diseases over Realistic Social Networks
A. Bhatele, J. Yeom, N. Jain, C. J. Kuhlman, Y. Livnat, K. R. Bisset, L. V. Kale, M. V. Marathe. In 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), May, 2017.
DOI: 10.1109/ccgrid.2017.141

Controlling the spread of infectious diseases in large populations is an important societal challenge. Mathematically, the problem is best captured as a certain class of reaction-diffusion processes (referred to as contagion processes) over appropriate synthesized interaction networks. Agent-based models have been successfully used in the recent past to study such contagion processes. We describe EpiSimdemics, a highly scalable, parallel code written in Charm++ that uses agent-based modeling to simulate disease spreads over large, realistic, co-evolving interaction networks. We present a new parallel implementation of EpiSimdemics that achieves unprecedented strong and weak scaling on different architectures — Blue Waters, Cori and Mira. EpiSimdemics achieves five times greater speedup than the second fastest parallel code in this field. This unprecedented scaling is an important step to support the long term vision of real-time epidemic science. Finally, we demonstrate the capabilities of EpiSimdemics by simulating the spread of influenza over a realistic synthetic social contact network spanning the continental United States (∼280 million nodes and 5.8 billion social contacts).

A Virtual Reality Visualization Tool for Neuron Tracing
W. Usher, P. Klacansky, F. Federer, P. T. Bremer, A. Knoll, J. Yarch, A. Angelucci, V. Pascucci. In IEEE Transactions on Visualization and Computer Graphics, IEEE, 2017.
ISSN: 1077-2626
DOI: 10.1109/TVCG.2017.2744079

Tracing neurons in large-scale microscopy data is crucial to establishing a wiring diagram of the brain, which is needed to understand how neural circuits in the brain process information and generate behavior. Automatic techniques often fail for large and complex datasets, and connectomics researchers may spend weeks or months manually tracing neurons using 2D image stacks. We present a design study of a new virtual reality (VR) system, developed in collaboration with trained neuroanatomists, to trace neurons in microscope scans of the visual cortex of primates. We hypothesize that using consumer-grade VR technology to interact with neurons directly in 3D will help neuroscientists better resolve complex cases and enable them to trace neurons faster and with less physical and mental strain. We discuss both the design process and technical challenges in developing an interactive system to navigate and manipulate terabyte-sized image volumes in VR. Using a number of different datasets, we demonstrate that, compared to widely used commercial software, consumer-grade VR presents a promising alternative for scientists.

Progressive CPU Volume Rendering with Sample Accumulation
W. Usher, J. Amstutz, C. Brownlee, A. Knoll, I. Wald . In Eurographics Symposium on Parallel Graphics and Visualization, Edited by Alexandru Telea and Janine Bennett, The Eurographics Association, 2017.
ISBN: 978-3-03868-034-5
ISSN: 1727-348X
DOI: 10.2312/pgv.20171090

We present a new method for progressive volume rendering by accumulating object-space samples over successively rendered frames. Existing methods for progressive refinement either use image space methods or average pixels over frames, which can blur features or integrate incorrectly with respect to depth. Our approach stores samples along each ray, accumulates new samples each frame into a buffer, and progressively interleaves and integrates these samples. Though this process requires additional memory, it ensures interactivity and is well suited for CPU architectures with large memory and cache. This approach also extends well to distributed rendering in cluster environments. We implement this technique in Intel's open source OSPRay CPU ray tracing framework and demonstrate that it is particularly useful for rendering volumetric data with costly sampling functions.

Pathways for Theoretical Advances in Visualization
M. Chen, G. Grinstein, C. R. Johnson, J. Kennedy, M. Tory. In IEEE Computer Graphics and Applications, IEEE, pp. 103--112. July, 2017.

More than a decade ago, Chris Johnson proposed the "Theory of Visualization" as one of the top research problems in visualization. Since then, there have been several theory-focused events, including three workshops and three panels at IEEE Visualization (VIS) Conferences. Together, these events have produced a set of convincing arguments.

Page 6 of 23

Start
Prev
1
2
3
4
5
6
7
8
9
10
Next
End

SCI