Andrienko, NataliaAndrienko, GennadyShirato, GotaHauser, Helwig and Alliez, Pierre2023-10-062023-10-0620231467-8659https://doi.org/10.1111/cgf.14926https://diglib.eg.org:443/handle/10.1111/cgf14926The term ‘episode’ refers to a time interval in the development of a dynamic process or behaviour of an entity. Episode‐based data consist of a set of episodes that are described using time series of multiple attribute values. Our research problem involves analysing episode‐based data in order to understand the distribution of multi‐attribute dynamic characteristics across a set of episodes. To solve this problem, we applied an existing theoretical model and developed a general approach that involves incrementally increasing data abstraction. We instantiated this general approach in an analysis procedure in which the value variation of each attribute within an episode is represented by a combination of symbols treated as a ‘word’. The variation of multiple attributes is thus represented by a combination of ‘words’ treated as a ‘text’. In this way, the the set of episodes is transformed to a collection of text documents. Topic modelling techniques applied to this collection find groups of related (i.e. repeatedly co‐occurring) ‘words’, which are called ‘topics’. Given that the ‘words’ encode variation patterns of individual attributes, the ‘topics’ represent patterns of joint variation of multiple attributes. In the following steps, analysts interpret the topics and examine their distribution across all episodes using interactive visualizations. We test the effectiveness of the procedure by applying it to two types of episode‐based data with distinct properties and introduce a range of generic and data type‐specific visualization techniques that can support the interpretation and exploration of topic distribution.Attribution 4.0 International Licensevisualizationvisual analyticstopic modelingEpisodes and Topics in Multivariate Temporal Data10.1111/cgf.14926