Search Results

Now showing 1 - 10 of 75
  • Item
    Visual-Interactive Preprocessing of Multivariate Time Series Data
    (The Eurographics Association and John Wiley & Sons Ltd., 2019) Bernard, Jürgen; Hutter, Marco; Reinemuth, Heiko; Pfeifer, Hendrik; Bors, Christian; Kohlhammer, Jörn; Gleicher, Michael and Viola, Ivan and Leitte, Heike
    Pre-processing is a prerequisite to conduct effective and efficient downstream data analysis. Pre-processing pipelines often require multiple routines to address data quality challenges and to bring the data into a usable form. For both the construction and the refinement of pre-processing pipelines, human-in-the-loop approaches are highly beneficial. This particularly applies to multivariate time series, a complex data type with multiple values developing over time. Due to the high specificity of this domain, it has not been subject to in-depth research in visual analytics. We present a visual-interactive approach for preprocessing multivariate time series data with the following aspects. Our approach supports analysts to carry out six core analysis tasks related to pre-processing of multivariate time series. To support these tasks, we identify requirements to baseline toolkits that may help practitioners in their choice. We characterize the space of visualization designs for uncertainty-aware pre-processing and justify our decisions. Two usage scenarios demonstrate applicability of our approach, design choices, and uncertainty visualizations for the six analysis tasks. This work is one step towards strengthening the visual analytics support for data pre-processing in general and for uncertainty-aware pre-processing of multivariate time series in particular.
  • Item
    Guidance or No Guidance? A Decision Tree Can Help
    (The Eurographics Association, 2018) Ceneda, Davide; Gschwandtner, Theresia; May, Thorsten; Miksch, Silvia; Streit, Marc; Tominski, Christian; Christian Tominski and Tatiana von Landesberger
    Guidance methods have the potential of bringing considerable benefits to Visual Analytics (VA), alleviating the burden on the user and allowing a positive analysis outcome. However, the boundary between conventional VA approaches and guidance is not sharply defined. As a consequence, framing existing guidance methods is complicated and the development of new approaches is also compromised. In this paper, we try to bring these concepts in order, defining clearer boundaries between guidance and no-guidance. We summarize our findings in form of a decision tree that allows scientists and designers to easily frame their solutions. Finally, we demonstrate the usefulness of our findings by applying our guideline to a set of published approaches.
  • Item
    Supporting Visual Parameter Analysis of Time Series Segmentation with Correlation Calculations
    (The Eurographics Association, 2018) Eichner, Christian; Schumann, Heidrun; Tominski, Christian; Anna Puig and Renata Raidou
    Parameter analysis can be used to find out how individual parameters influence the output of an algorithm. We aim to support the visual parameter analysis of algorithms for the segmentation of time series. To this end, we automatically search for correlations between parameters and the segmentation outputs. Correlations are not only determined globally, but also locally within parameter subspaces. Calculated correlations are used to visually emphasize parameter and value ranges with high influence on the segmentation. By interactive exploration, the analyst can study the multidimensional parameter space in depth.
  • Item
    Visual Predictive Analytics using iFuseML
    (The Eurographics Association, 2018) Sehgal, Gunjan; Rawat, Mrinal; Gupta, Bindu; Gupta, Garima; Sharma, Geetika; Shroff, Gautam; Christian Tominski and Tatiana von Landesberger
    Solving a predictive analytics problem involves multiple machine learning tasks in a workflow. Directing such workflows efficiently requires an understanding of data so as to identify and handle missing values and outliers, compute feature correlations and to select appropriate models and hyper-parameters. While traditional machine learning techniques are capable of handling these challenges to a certain extent, visual analysis of data and results at each stage can significantly assist in optimal processing of the workflow. We present iFuseML , a visual interactive framework to support analysts in machine learning workflows via insightful data visualizations as well as natural language interfaces where appropriate. Our platform lets the user intuitively search and explore datasets, join relevant datasets using natural language queries, detect and visualize multidimensional outliers and explore feature relationships using Bayesian coordinated views. We also demonstrate how visualization assists in comparing prediction errors to guide ensemble models so as to generate more accurate predictions. We illustrate our framework using a house price dataset from Kaggle, where using iFuseML simplified the machine learning workflow and helped improve the resulting prediction accuracy.
  • Item
    Visualizing Multidimensional Data with Order Statistics
    (The Eurographics Association and John Wiley & Sons Ltd., 2018) Raj, Mukund; Whitaker, Ross T.; Jeffrey Heer and Heike Leitte and Timo Ropinski
    Multidimensional data sets are common in many domains, and dimensionality reduction methods that determine a lower dimensional embedding are widely used for visualizing such data sets. This paper presents a novel method to project data onto a lower dimensional space by taking into account the order statistics of the individual data points, which are quantified by their depth or centrality in the overall set. Thus, in addition to conveying relative distances in the data, the proposed method also preserves the order statistics, which are often lost or misrepresented by existing visualization methods. The proposed method entails a modification of the optimization objective of conventional multidimensional scaling (MDS) by introducing a term that penalizes discrepancies between centrality structures in the original space and the embedding. We also introduce two strategies for visualizing lower dimensional embeddings of multidimensional data that takes advantage of the coherent representation of centrality provided by the proposed projection method. We demonstrate the effectiveness of our visualization with comparisons on different kinds of multidimensional data, including categorical and multimodal, from a variety of domains such as botany and health care.
  • Item
    Feature-Driven Visual Analytics of Chaotic Parameter-Dependent Movement
    (The Eurographics Association and John Wiley & Sons Ltd., 2015) Luboschik, Martin; Röhlig, Martin; Bittig, Arne T.; Andrienko, Natalia; Schumann, Heidrun; Tominski, Christian; H. Carr, K.-L. Ma, and G. Santucci
    Analyzing movements in their spatial and temporal context is a complex task. We are additionally interested in understanding the movements' dependency on parameters that govern the processes behind the movement. We propose a visual analytics approach combining analytic, visual, and interactive means to deal with the added complexity. The key idea is to perform an analytical extraction of features that capture distinct movement characteristics. Different parameter configurations and extracted features are then visualized in a compact fashion to facilitate an overview of the data. Interaction enables the user to access details about features, to compare features, and to relate features back to the original movement. We instantiate our approach with a repository of more than twenty accepted and novel features to help analysts in gaining insight into simulations of chaotic behavior of thousands of entities over thousands of data points. Domain experts applied our solution successfully to study dynamic groups in such movements in relation to thousands of parameter configurations.
  • Item
    Combining the Automated Segmentation and Visual Analysis of Multivariate Time Series
    (The Eurographics Association, 2018) Bernard, Jürgen; Bors, Christian; Bögl, Markus; Eichner, Christian; Gschwandtner, Theresia; Miksch, Silvia; Schumann, Heidrun; Kohlhammer, Jörn; Christian Tominski and Tatiana von Landesberger
    For the automatic segmentation of multivariate time series domain experts at first need to consider a huge space of alternative configurations of algorithms and parameters. We assume that only a small subset of these configurations needs to be computed and analyzed to lead users to meaningful configurations. To expedite this search, we propose the conceptualization of a segmentation workflow. First, with an algorithmic segmentation pipeline, domain experts can calculate segmentation results with different parameter configurations. Second, in an interactive visual analysis step, domain experts can explore segmentation results to further adapt and improve segmentation pipeline in an informed way. In the interactive analysis approach influences of algorithms, parameters, and different types of uncertainty information are conveyed, which is decisive to trigger selective and purposeful re-calculations. The workflow is built upon reflections on collaborations with domain experts working in activity recognition, which also defines our usage scenario demonstrating the applicability of the workflow.
  • Item
    Towards Natural Language Empowered Interactive Data Analysis
    (The Eurographics Association, 2018) Turkay, Cagatay; Henkin, Rafael; Anna Puig and Renata Raidou
    The recent advances in natural language based interaction methodologies offer promising avenues to enhance the interactive processes within the human-machine dialogue of visual analytics. We envisage Multimodal Data Analytics as a novel approach for conducting data analysis that builds on the strengths of visual analytics and natural language as an expressive interaction channel. We investigate the potential enhancements from such a multimodal approach and discusses the preliminary outline for a structured methodology to study the role of natural language in data analytics. Our approach builds on a simple model of human machine dialogue for interactive data analysis which we then propose to instantiate as visual analytics workflows - representations to study and operationalise interactive data analysis routines empowered by natural language interaction.
  • Item
    Hunting High and Low: Visualising Shifting Correlations in Financial Markets
    (The Eurographics Association and John Wiley & Sons Ltd., 2018) Simon, Peter M.; Turkay, Cagatay; Jeffrey Heer and Heike Leitte and Timo Ropinski
    The analysis of financial assets' correlations is fundamental to many aspects of finance theory and practice, especially modern portfolio theory and the study of risk. In order to manage investment risk, in-depth analysis of changing correlations is needed, with both high and low correlations between financial assets (and groups thereof) important to identify. In this paper, we propose a visual analytics framework for the interactive analysis of relations and structures in dynamic, high-dimensional correlation data. We conduct a series of interviews and review the financial correlation analysis literature to guide our design. Our solution combines concepts from multi-dimensional scaling, weighted complete graphs and threshold networks to present interactive, animated displays which use proximity as a visual metaphor for correlation and animation stability to encode correlation stability. We devise interaction techniques coupled with context-sensitive auxiliary views to support the analysis of subsets of correlation networks. As part of our contribution, we also present behaviour profiles to help guide future users of our approach. We evaluate our approach by checking the validity of the layouts produced, presenting a number of analysis stories, and through a user study. We observe that our solutions help unravel complex behaviours and resonate well with study participants in addressing their needs in the context of correlation analysis in finance.
  • Item
    A User-based Visual Analytics Workflow for Exploratory Model Analysis
    (The Eurographics Association and John Wiley & Sons Ltd., 2019) Cashman, Dylan; Humayoun, Shah Rukh; Heimerl, Florian; Park, Kendall; Das, Subhajit; Thompson, John; Saket, Bahador; Mosca, Abigail; Stasko, John; Endert, Alex; Gleicher, Michael; Chang, Remco; Gleicher, Michael and Viola, Ivan and Leitte, Heike
    Many visual analytics systems allow users to interact with machine learning models towards the goals of data exploration and insight generation on a given dataset. However, in some situations, insights may be less important than the production of an accurate predictive model for future use. In that case, users are more interested in generating of diverse and robust predictive models, verifying their performance on holdout data, and selecting the most suitable model for their usage scenario. In this paper, we consider the concept of Exploratory Model Analysis (EMA), which is defined as the process of discovering and selecting relevant models that can be used to make predictions on a data source. We delineate the differences between EMA and the well-known term exploratory data analysis in terms of the desired outcome of the analytic process: insights into the data or a set of deployable models. The contributions of this work are a visual analytics system workflow for EMA, a user study, and two use cases validating the effectiveness of the workflow. We found that our system workflow enabled users to generate complex models, to assess them for various qualities, and to select the most relevant model for their task.