Filament plots for data visualization Strawn, Nate
Applied and computational harmonic analysis,
September 2022, 2022-09-00, Volume:
60
Journal Article
Peer reviewed
Open access
The efficiency of modern computer graphics allows us to explore collections of space curves simultaneously with “drag-to-rotate” interfaces. This inspires us to replace “scatterplots of points” with ...“scatterplots of curves” to simultaneously visualize relationships across an entire dataset. Since spaces of curves are infinite dimensional, scatterplots of curves avoid the “lossy” nature of scatterplots of points. In particular, if two points are close in a scatterplot of points derived from high-dimensional data, it does not generally follow that the two associated data points are close in the data space. Andrews plots provide scatterplots of curves that perfectly preserve Euclidean distances, but simultaneous visualization of these graphs over an entire dataset produces “visual clutter” because graphs of functions generally overlap in 2D. We mitigate this “visual clutter” issue by constructing computationally inexpensive 3D extensions of Andrews plots. First, we construct optimally smooth 3D Andrews plots by considering linear isometries from Euclidean data spaces to spaces of planar parametric curves. We rigorously parametrize the linear isometries that produce (on average) optimally smooth curves over a given dataset. This parameterization of optimal isometries reveals many degrees of freedom, and (using recent results on generalized Gauss sums) we identify a particular member of this set which admits an asymptotic “tour” property that avoids certain local degeneracies as well. Finally, we construct unit-length 3D curves (filaments) from Bishop frames induced by 3D Andrews plots. We conclude with examples of filament plots for several standard datasets,1 illustrating how filament plots avoid “visual clutter”.
The increasing complexity and uncertainties of modern power systems are placing significant demands on signal monitoring techniques. This work proposes the Circular Trajectory Approach (CTA) for ...online sinusoidal signal distortion monitoring and visualization. CTA can detect distortions of a sinusoidal signal. Compared with existing waveform anomaly detection techniques, CTA is faster in detection and less computation intensive. It thus supports edge devices and online applications. CTA also offers a new means of sinusoidal signal distortion visualization. It can reveal the distorted sections in a sinusoidal cycle and clearly display the distortions. The proposed approach is tested on real data from an open source EPRI dataset.
We introduce Tilt Map , a novel interaction technique for intuitively transitioning between 2D and 3D map visualisations in immersive environments. Our focus is visualising data associated with areal ...features on maps, for example, population density by state. Tilt Map transitions from 2D choropleth maps to 3D prism maps to 2D bar charts to overcome the limitations of each. Our article includes two user studies. The first study compares subjects' task performance interpreting population density data using 2D choropleth maps and 3D prism maps in virtual reality (VR). We observed greater task accuracy with prism maps, but faster response times with choropleth maps. The complementarity of these views inspired our hybrid Tilt Map design. Our second study compares Tilt Map to: a side-by-side arrangement of the various views; and interactive toggling between views. The results indicate benefits for Tilt Map in user preference; and accuracy (versus side-by-side) and time (versus toggle).
The grammar of graphics is ubiquitous, providing the foundation for a variety of popular visualization tools and toolkits. Yet support for uncertainty visualization in the grammar graphics-beyond ...simple variations of error bars, uncertainty bands, and density plots-remains rudimentary. Research in uncertainty visualization has developed a rich variety of improved uncertainty visualizations, most of which are difficult to create in existing grammar of graphics implementations. ggdist, an extension to the popular ggplot2 grammar of graphics toolkit, is an attempt to rectify this situation. ggdist unifies a variety of uncertainty visualization types through the lens of distributional visualization, allowing functions of distributions to be mapped to directly to visual channels (aesthetics), making it straightforward to express a variety of (sometimes weird!) uncertainty visualization types. This distributional lens also offers a way to unify Bayesian and frequentist uncertainty visualization by formalizing the latter with the help of confidence distributions. In this paper, I offer a description of this uncertainty visualization paradigm and lessons learned from its development and adoption: ggdist has existed in some form for about six years (originally as part of the tidybayes R package for post-processing Bayesian models), and it has evolved substantially over that time, with several rewrites and API re-organizations as it changed in response to user feedback and expanded to cover increasing varieties of uncertainty visualization types. Ultimately, given the huge expressive power of the grammar of graphics and the popularity of tools built on it, I hope a catalog of my experience with ggdist will provide a catalyst for further improvements to formalizations and implementations of uncertainty visualization in grammar of graphics ecosystems. A free copy of this paper is available at https://osf.io/2gsz6. All supplemental materials are available at https://github.com/mjskay/ggdistpaper and are archived on Zenodo at doi:10.5281/zenodo.7770984.
We propose an efficient algorithm for solving orthogonal canonical correlation analysis (OCCA) in the form of trace-fractional structure and orthogonal linear projections. Even though orthogonality ...has been widely used and proved to be a useful criterion for visualization, pattern recognition and feature extraction, existing methods for solving OCCA problem are either numerically unstable by relying on a deflation scheme, or less efficient by directly using generic optimization methods. In this paper, we propose an alternating numerical scheme whose core is the sub-maximization problem in the trace-fractional form with an orthogonality constraint. A customized self-consistent-field (SCF) iteration for this sub-maximization problem is devised. It is proved that the SCF iteration is globally convergent to a KKT point and that the alternating numerical scheme always converges. We further formulate a new trace-fractional maximization problem for orthogonal multiset CCA and propose an efficient algorithm with an either Jacobi-style or Gauss-Seidel-style updating scheme based on the SCF iteration. Extensive experiments are conducted to evaluate the proposed algorithms against existing methods, including real-world applications of multi-label classification and multi-view feature extraction. Experimental results show that our methods not only perform competitively to or better than the existing methods but also are more efficient.
Interactive Data Comics Wang, Zezhong; Romat, Hugo; Chevalier, Fanny ...
IEEE transactions on visualization and computer graphics,
2022-Jan., 2022-01-00, 2022-1-00, 20220101, Volume:
28, Issue:
1
Journal Article
Peer reviewed
Open access
This paper investigates how to make data comics interactive. Data comics are an effective and versatile means for visual communication, leveraging the power of sequential narration and combined ...textual and visual content, while providing an overview of the storyline through panels assembled in expressive layouts. While a powerful static storytelling medium that works well on paper support, adding interactivity to data comics can enable non-linear storytelling, personalization, levels of details, explanations, and potentially enriched user experiences. This paper introduces a set of operations tailored to support data comics narrative goals that go beyond the traditional linear, immutable storyline curated by a story author. The goals and operations include adding and removing panels into pre-defined layouts to support branching, change of perspective, or access to detail-on-demand, as well as providing and modifying data, and interacting with data representation, to support personalization and reader-defined data focus. We propose a lightweight specification language, COMICSCRIPT, for designers to add such interactivity to static comics. To assess the viability of our authoring process, we recruited six professional illustrators, designers and data comics enthusiasts and asked them to craft an interactive comic, allowing us to understand authoring workflow and potential of our approach. We present examples of interactive comics in a gallery. This initial step towards understanding the design space of interactive comics can inform the design of creation tools and experiences for interactive storytelling.
In response to the rising popularity of online training, this study addresses the crucial need for effective assessment methods at PT XYZ. The research focuses on developing a comprehensive solution ...through a data visualization dashboard and a machine learning model. The data visualization dashboard, created using Tableau, provides an interactive platform for exploring training data. It offers valuable insights into employees learning progress and needs, empowering them to monitor their advancement and identify areas for improvement effectively. Simultaneously, a machine learning model was developed using Python and Google Collab, employing decision trees and random forest algorithms. The model exhibited promising results with an accuracy rate of 69% for decision trees and 70% for random forests, indicating its proficiency in predicting skill groups. Furthermore, the study rigorously evaluated the dashboard and machine learning model using a 20% holdout dataset, affirming their effectiveness. The dashboard, deployed on a web server, ensures accessibility to all PT XYZ employees, enhancing user experience and engagement. Notably, the dashboard's user-friendly interface allows employees to actively participate in their learning journey, while the machine learning model generates personalized training recommendations based on their progress and needs. In summary, this research provides a practical and innovative solution to the challenge of online training assessment at PT XYZ. By combining data visualization techniques and machine learning algorithms, the developed tools significantly enhance the efficiency and effectiveness of training programs. These findings contribute valuable insights into online training assessment methodologies and pave the way for improved learning experiences in the digital age.
Air pollution has become a serious public health problem for many cities around the world. To find the causes of air pollution, the propagation processes of air pollutants must be studied at a large ...spatial scale. However, the complex and dynamic wind fields lead to highly uncertain pollutant transportation. The state-of-the-art data mining approaches cannot fully support the extensive analysis of such uncertain spatiotemporal propagation processes across multiple districts without the integration of domain knowledge. The limitation of these automated approaches motivates us to design and develop AirVis, a novel visual analytics system that assists domain experts in efficiently capturing and interpreting the uncertain propagation patterns of air pollution based on graph visualizations. Designing such a system poses three challenges: a) the extraction of propagation patterns; b) the scalability of pattern presentations; and c) the analysis of propagation processes. To address these challenges, we develop a novel pattern mining framework to model pollutant transportation and extract frequent propagation patterns efficiently from large-scale atmospheric data. Furthermore, we organize the extracted patterns hierarchically based on the minimum description length (MDL) principle and empower expert users to explore and analyze these patterns effectively on the basis of pattern topologies. We demonstrated the effectiveness of our approach through two case studies conducted with a real-world dataset and positive feedback from domain experts.
A Deep Generative Model for Graph Layout Kwon, Oh-Hyun; Ma, Kwan-Liu
IEEE transactions on visualization and computer graphics,
2020-Jan., 2020-Jan, 2020-1-00, 20200101, Volume:
26, Issue:
1
Journal Article
Peer reviewed
Open access
Different layouts can characterize different aspects of the same graph. Finding a "good" layout of a graph is thus an important task for graph visualization. In practice, users often visualize a ...graph in multiple layouts by using different methods and varying parameter settings until they find a layout that best suits the purpose of the visualization. However, this trial-and-error process is often haphazard and time-consuming. To provide users with an intuitive way to navigate the layout design space, we present a technique to systematically visualize a graph in diverse layouts using deep generative models. We design an encoder-decoder architecture to learn a model from a collection of example layouts, where the encoder represents training examples in a latent space and the decoder produces layouts from the latent space. In particular, we train the model to construct a two-dimensional latent space for users to easily explore and generate various layouts. We demonstrate our approach through quantitative and qualitative evaluations of the generated layouts. The results of our evaluations show that our model is capable of learning and generalizing abstract concepts of graph layouts, not just memorizing the training examples. In summary, this paper presents a fundamentally new approach to graph visualization where a machine learning model learns to visualize a graph from examples without manually-defined heuristics.