liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches
Department of Computer Science and Media Technology, Linnaeus University, Växjö, Sweden.ORCID iD: 0000-0002-9079-2376
Department of Computer Science and Media Technology, Linnaeus University, Växjö, Sweden.ORCID iD: 0000-0002-2901-935X
Linköping University, Department of Science and Technology, Media and Information Technology. Linköping University, Faculty of Science & Engineering. Department of Computer Science and Media Technology, Linnaeus University, Växjö, Sweden. (iVis, INV)ORCID iD: 0000-0002-1907-7820
Linköping University, Department of Science and Technology, Media and Information Technology. Linköping University, Faculty of Science & Engineering. Department of Computer Science and Media Technology, Linnaeus University, Växjö, Sweden. (iVis, INV)ORCID iD: 0000-0002-0519-2537
2022 (English)In: IEEE Transactions on Visualization and Computer Graphics, ISSN 1077-2626, E-ISSN 1941-0506, Vol. 28, no 4, p. 1773-1791Article in journal (Refereed) Published
Abstract [en]

The machine learning (ML) life cycle involves a series of iterative steps, from the effective gathering and preparation of the data—including complex feature engineering processes—to the presentation and improvement of results, with various algorithms to choose from in every step. Feature engineering in particular can be very beneficial for ML, leading to numerous improvements such as boosting the predictive results, decreasing computational times, reducing excessive noise, and increasing the transparency behind the decisions taken during the training. Despite that, while several visual analytics tools exist to monitor and control the different stages of the ML life cycle (especially those related to data and algorithms), feature engineering support remains inadequate. In this paper, we present FeatureEnVi, a visual analytics system specifically designed to assist with the feature engineering process. Our proposed system helps users to choose the most important feature, to transform the original features into powerful alternatives, and to experiment with different feature generation combinations. Additionally, data space slicing allows users to explore the impact of features on both local and global scales. FeatureEnVi utilizes multiple automatic feature selection techniques; furthermore, it visually guides users with statistical evidence about the influence of each feature (or subsets of features). The final outcome is the extraction of heavily engineered features, evaluated by multiple validation metrics. The usefulness and applicability of FeatureEnVi are demonstrated with two use cases and a case study. We also report feedback from interviews with two ML experts and a visualization researcher who assessed the effectiveness of our system.

Place, publisher, year, edition, pages
IEEE COMPUTER SOC , 2022. Vol. 28, no 4, p. 1773-1791
Keywords [en]
Feature selection, feature extraction, feature engineering, machine learning, visual analytics, visualization
National Category
Computer Sciences Human Computer Interaction
Research subject
Computer Science, Information and software visualization
Identifiers
URN: urn:nbn:se:liu:diva-183356DOI: 10.1109/TVCG.2022.3141040ISI: 000761227900006PubMedID: 34990365OAI: oai:DiVA.org:liu-183356DiVA, id: diva2:1642064
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsAvailable from: 2022-03-03 Created: 2022-03-03 Last updated: 2024-10-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedVideo

Authority records

Chatzimparmpas, AngelosMartins, Rafael MessiasKucher, KostiantynKerren, Andreas

Search in DiVA

By author/editor
Chatzimparmpas, AngelosMartins, Rafael MessiasKucher, KostiantynKerren, Andreas
By organisation
Media and Information TechnologyFaculty of Science & Engineering
In the same journal
IEEE Transactions on Visualization and Computer Graphics
Computer SciencesHuman Computer Interaction

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 177 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf