- Rubel, Oliver;
- Prabhat;
- Wu, Kesheng;
- Childs, Hank;
- Meredith, Jeremy;
- Geddes, Cameron GR;
- Cormier-Michel, Estelle;
- Ahern, Sean;
- Weber, Gunther H;
- Messmer, Peter;
- Hagen, Hans;
- Hamann, Bernd;
- Bethel, E Wes
One of the central challenges in modern science is the need to quickly derive knowledge and understanding from large, complex collections of data. We present a new approach that deals with this challenge by combining and extending techniques from high performance visual data analysis and scientific data management. This approach is demonstrated within the context of gaining insight from complex, time-varying datasets produced by a laser wakefield accelerator simulation. Our approach leverages histogram-based parallel coordinates for both visual information display as well as a vehicle for guiding a data mining operation. Data extraction and subsetting are implemented with state-of-the-art index/query technology. This approach, while applied here to accelerator science, is generally applicable to a broad set of science applications, and is implemented in a production-quality visual data analysis infrastructure. We conduct a detailed performance analysis and demonstrate good scalability on a distributed memory Cray XT4 system. © 2008 IEEE.