liu.seSök publikationer i DiVA
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Chatzimparmpas, AngelosORCID iD iconorcid.org/0000-0002-9079-2376
Publikationer (5 of 5) Visa alla publikationer
Ploshchik, I., Chatzimparmpas, A. & Kerren, A. (2023). MetaStackVis: Visually-Assisted Performance Evaluation of Metamodels. In: Proceedings of the 16th IEEE Pacific Visualization Symposium (PacificVis '23), visualization notes track, IEEE, 2023: . Paper presented at 16th IEEE Pacific Visualization Symposium (PacificVis '23), Seoul, Korea, April 18-21, 2023 (pp. 207-211). IEEE
Öppna denna publikation i ny flik eller fönster >>MetaStackVis: Visually-Assisted Performance Evaluation of Metamodels
2023 (Engelska)Ingår i: Proceedings of the 16th IEEE Pacific Visualization Symposium (PacificVis '23), visualization notes track, IEEE, 2023, IEEE , 2023, s. 207-211Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Stacking (or stacked generalization) is an ensemble learning method with one main distinctiveness from the rest: even though several base models are trained on the original data set, their predictions are further used as input data for one or more metamodels arranged in at least one extra layer. Composing a stack of models can produce high-performance outcomes, but it usually involves a trial-and-error process. Therefore, our previously developed visual analytics system, StackGenVis, was mainly designed to assist users in choosing a set of top-performing and diverse models by measuring their predictive performance. However, it only employs a single logistic regression metamodel. In this paper, we investigate the impact of alternative metamodels on the performance of stacking ensembles using a novel visualization tool, called MetaStackVis. Our interactive tool helps users to visually explore different singular and pairs of metamodels according to their predictive probabilities and multiple validation metrics, as well as their ability to predict specific prob- lematic data instances. MetaStackVis was evaluated with a usage scenario based on a medical data set and via expert interviews.

Ort, förlag, år, upplaga, sidor
IEEE, 2023
Serie
IEEE Pacific Visualization Symposium, ISSN 2165-8765, E-ISSN 2165-8773
Nyckelord
Visual analytics, information visualization, interaction, stacking, metamodels, ensemble learning
Nationell ämneskategori
Datavetenskap (datalogi) Människa-datorinteraktion (interaktionsdesign)
Forskningsämne
Datavetenskap, Informations- och programvisualisering
Identifikatorer
urn:nbn:se:liu:diva-193719 (URN)10.1109/PacificVis56936.2023.00030 (DOI)001016413500024 ()2-s2.0-85163320910 (Scopus ID)979-8-3503-2124-1 (ISBN)
Konferens
16th IEEE Pacific Visualization Symposium (PacificVis '23), Seoul, Korea, April 18-21, 2023
Forskningsfinansiär
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Anmärkning

Funding: ELLIIT environment for strategic research in Sweden

Tillgänglig från: 2023-05-15 Skapad: 2023-05-15 Senast uppdaterad: 2025-11-17
Chatzimparmpas, A., Martins, R. M., Kucher, K. & Kerren, A. (2022). FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches. IEEE Transactions on Visualization and Computer Graphics, 28(4), 1773-1791
Öppna denna publikation i ny flik eller fönster >>FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches
2022 (Engelska)Ingår i: IEEE Transactions on Visualization and Computer Graphics, ISSN 1077-2626, E-ISSN 1941-0506, Vol. 28, nr 4, s. 1773-1791Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

The machine learning (ML) life cycle involves a series of iterative steps, from the effective gathering and preparation of the data—including complex feature engineering processes—to the presentation and improvement of results, with various algorithms to choose from in every step. Feature engineering in particular can be very beneficial for ML, leading to numerous improvements such as boosting the predictive results, decreasing computational times, reducing excessive noise, and increasing the transparency behind the decisions taken during the training. Despite that, while several visual analytics tools exist to monitor and control the different stages of the ML life cycle (especially those related to data and algorithms), feature engineering support remains inadequate. In this paper, we present FeatureEnVi, a visual analytics system specifically designed to assist with the feature engineering process. Our proposed system helps users to choose the most important feature, to transform the original features into powerful alternatives, and to experiment with different feature generation combinations. Additionally, data space slicing allows users to explore the impact of features on both local and global scales. FeatureEnVi utilizes multiple automatic feature selection techniques; furthermore, it visually guides users with statistical evidence about the influence of each feature (or subsets of features). The final outcome is the extraction of heavily engineered features, evaluated by multiple validation metrics. The usefulness and applicability of FeatureEnVi are demonstrated with two use cases and a case study. We also report feedback from interviews with two ML experts and a visualization researcher who assessed the effectiveness of our system.

Ort, förlag, år, upplaga, sidor
IEEE COMPUTER SOC, 2022
Nyckelord
Feature selection, feature extraction, feature engineering, machine learning, visual analytics, visualization
Nationell ämneskategori
Datavetenskap (datalogi) Människa-datorinteraktion (interaktionsdesign)
Forskningsämne
Datavetenskap, Informations- och programvisualisering
Identifikatorer
urn:nbn:se:liu:diva-183356 (URN)10.1109/TVCG.2022.3141040 (DOI)000761227900006 ()34990365 (PubMedID)
Forskningsfinansiär
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Tillgänglig från: 2022-03-03 Skapad: 2022-03-03 Senast uppdaterad: 2024-10-28Bibliografiskt granskad
Chatzimparmpas, A., Martins, R. M., Kucher, K. & Kerren, A. (2021). StackGenVis: Alignment of Data, Algorithms, and Models for Stacking Ensemble Learning Using Performance Metrics. Paper presented at IEEE Conference on Visual Analytics Science and Technology (IEEE VAST 2020), 25-30 October 2020, Virtual Conference. IEEE Transactions on Visualization and Computer Graphics, 27(2), 1547-1557
Öppna denna publikation i ny flik eller fönster >>StackGenVis: Alignment of Data, Algorithms, and Models for Stacking Ensemble Learning Using Performance Metrics
2021 (Engelska)Ingår i: IEEE Transactions on Visualization and Computer Graphics, ISSN 1077-2626, E-ISSN 1941-0506, Vol. 27, nr 2, s. 1547-1557Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In machine learning (ML), ensemble methods—such as bagging, boosting, and stacking—are widely-established approaches that regularly achieve top-notch predictive performance. Stacking (also called "stacked generalization") is an ensemble method that combines heterogeneous base models, arranged in at least one layer, and then employs another metamodel to summarize the predictions of those models. Although it may be a highly-effective approach for increasing the predictive performance of ML, generating a stack of models from scratch can be a cumbersome trial-and-error process. This challenge stems from the enormous space of available solutions, with different sets of data instances and features that could be used for training, several algorithms to choose from, and instantiations of these algorithms using diverse parameters (i.e., models) that perform differently according to various metrics. In this work, we present a knowledge generation model, which supports ensemble learning with the use of visualization, and a visual analytics system for stacked generalization. Our system, StackGenVis, assists users in dynamically adapting performance metrics, managing data instances, selecting the most important features for a given data set, choosing a set of top-performant and diverse algorithms, and measuring the predictive performance. In consequence, our proposed tool helps users to decide between distinct models and to reduce the complexity of the resulting stack by removing overpromising and underperforming models. The applicability and effectiveness of StackGenVis are demonstrated with two use cases: a real-world healthcare data set and a collection of data related to sentiment/stance detection in texts. Finally, the tool has been evaluated through interviews with three ML experts.

Ort, förlag, år, upplaga, sidor
IEEE Computer Society Digital Library, 2021
Nyckelord
stacking, stacked generalization, ensemble learning, visual analytics, visualization
Nationell ämneskategori
Datavetenskap (datalogi) Människa-datorinteraktion (interaktionsdesign)
Forskningsämne
Datavetenskap, Informations- och programvisualisering
Identifikatorer
urn:nbn:se:liu:diva-189510 (URN)10.1109/TVCG.2020.3030352 (DOI)000706330100132 ()33048687 (PubMedID)2-s2.0-85099566430 (Scopus ID)2020 (Lokalt ID)2020 (Arkivnummer)2020 (OAI)
Konferens
IEEE Conference on Visual Analytics Science and Technology (IEEE VAST 2020), 25-30 October 2020, Virtual Conference
Tillgänglig från: 2022-10-24 Skapad: 2022-10-24 Senast uppdaterad: 2022-11-17
Chatzimparmpas, A., Martins, R. M., Kucher, K. & Kerren, A. (2021). VisEvol: Visual Analytics to Support Hyperparameter Search through Evolutionary Optimization. Paper presented at 23rd EG/VGTC Conference on Visualization (EuroVis '21), 14-18 June 2021, Zürich, Switzerland. Computer graphics forum (Print), 40(3), 201-214
Öppna denna publikation i ny flik eller fönster >>VisEvol: Visual Analytics to Support Hyperparameter Search through Evolutionary Optimization
2021 (Engelska)Ingår i: Computer graphics forum (Print), ISSN 0167-7055, E-ISSN 1467-8659, Vol. 40, nr 3, s. 201-214Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

During the training phase of machine learning (ML) models, it is usually necessary to configure several hyperparameters. This process is computationally intensive and requires an extensive search to infer the best hyperparameter set for the given problem. The challenge is exacerbated by the fact that most ML models are complex internally, and training involves trial-and-error processes that could remarkably affect the predictive result. Moreover, each hyperparameter of an ML algorithm is potentially intertwined with the others, and changing it might result in unforeseeable impacts on the remaining hyperparameters. Evolutionary optimization is a promising method to try and address those issues. According to this method, performant models are stored, while the remainder are improved through crossover and mutation processes inspired by genetic algorithms. We present VisEvol, a visual analytics tool that supports interactive exploration of hyperparameters and intervention in this evolutionary procedure. In summary, our proposed tool helps the user to generate new models through evolution and eventually explore powerful hyperparameter combinations in diverse regions of the extensive hyperparameter space. The outcome is a voting ensemble (with equal rights) that boosts the final predictive performance. The utility and applicability of VisEvol are demonstrated with two use cases and interviews with ML experts who evaluated the effectiveness of the tool.

Ort, förlag, år, upplaga, sidor
John Wiley & Sons, 2021
Nyckelord
visualization, visual analytics, interpretable machine learning, explainable machine learning, hyperparameter search, evolutionary optimization
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:liu:diva-176506 (URN)10.1111/cgf.14300 (DOI)000667924000017 ()
Konferens
23rd EG/VGTC Conference on Visualization (EuroVis '21), 14-18 June 2021, Zürich, Switzerland
Forskningsfinansiär
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Tillgänglig från: 2021-06-14 Skapad: 2021-06-14 Senast uppdaterad: 2022-11-22Bibliografiskt granskad
Chatzimparmpas, A., Martins, R. M., Jusufi, I., Kucher, K., Rossi, F. & Kerren, A. (2020). The State of the Art in Enhancing Trust in Machine Learning Models with the Use of Visualizations. Paper presented at 22nd EG/VGTC Conference on Visualization (EuroVis '20), STAR track, 25-29 May 2020, Norrköping, Sweden. Computer graphics forum (Print), 39(3), 713-756
Öppna denna publikation i ny flik eller fönster >>The State of the Art in Enhancing Trust in Machine Learning Models with the Use of Visualizations
Visa övriga...
2020 (Engelska)Ingår i: Computer graphics forum (Print), ISSN 0167-7055, E-ISSN 1467-8659, Vol. 39, nr 3, s. 713-756Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Machine learning (ML) models are nowadays used in complex applications in various domains such as medicine, bioinformatics, and other sciences. Due to their black box nature, however, it may sometimes be hard to understand and trust the results they provide. This has increased the demand for reliable visualization tools related to enhancing trust in ML models, which has become a prominent topic of research in the visualization community over the past decades. To provide an overview and present the frontiers of current research on the topic, we present a State-of-the-Art Report (STAR) on enhancing trust in ML models with the use of interactive visualization. We define and describe the background of the topic, introduce a categorization for visualization techniques that aim to accomplish this goal, and discuss insights and opportunities for future research directions. Among our contributions is a categorization of trust against different facets of interactive ML, expanded and improved from previous research. Our results are investigated from different analytical perspectives: (a) providing a statistical overview, (b) summarizing key findings, (c) performing topic analyses, and (d) exploring the data sets used in the individual papers, all with the support of an interactive web-based survey browser. We intend this survey to be beneficial for visualization researchers whose interests involve making ML models more trustworthy, as well as researchers and practitioners from other disciplines in their search for effective visualization techniques suitable for solving their tasks with confidence and conveying meaning to their data.

Ort, förlag, år, upplaga, sidor
John Wiley & Sons, 2020
Nyckelord
trustworthy machine learning, visualization, interpretable machine learning, explainable machine learning
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Datavetenskap, Informations- och programvisualisering
Identifikatorer
urn:nbn:se:liu:diva-189512 (URN)10.1111/cgf.14034 (DOI)000549627300053 ()2-s2.0-85088145692 (Scopus ID)
Konferens
22nd EG/VGTC Conference on Visualization (EuroVis '20), STAR track, 25-29 May 2020, Norrköping, Sweden
Tillgänglig från: 2022-10-24 Skapad: 2022-10-24 Senast uppdaterad: 2022-11-17
Organisationer
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0002-9079-2376

Sök vidare i DiVA

Visa alla publikationer