liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Effective dimensionality for principal component analysis of time series expression data
Linköping University, The Institute of Technology. Linköping University, Department of Science and Technology.ORCID iD: 0000-0003-0528-9782
NORDITA, Copenhagen, Denmark.
Division of Mechatronics, Chalmers University of Technology, Gothenburg, Sweden.
2003 (English)In: Biosystems (Amsterdam. Print), ISSN 0303-2647, E-ISSN 1872-8324, Vol. 71, no 3, p. 311-317Article in journal (Refereed) Published
Abstract [en]

Large-scale expression data are today measured for thousands of genes simultaneously. This development has been followed by an exploration of theoretical tools to get as much information out of these data as possible. Several groups have used principal component analysis (PCA) for this task. However, since this approach is data-driven, care must be taken in order not to analyze the noise instead of the data. As a strong warning towards uncritical use of the output from a PCA, we employ a newly developed procedure to judge the effective dimensionality of a specific data set. Although this data set is obtained during the development of rat central nervous system, our finding is a general property of noisy time series data. Based on knowledge of the noise-level for the data, we find that the effective number of dimensions that are meaningful to use in a PCA is much lower than what could be expected from the number of measurements. We attribute this fact both to effects of noise and the lack of independence of the expression levels. Finally, we explore the possibility to increase the dimensionality by performing more measurements within one time series, and conclude that this is not a fruitful approach. © 2003 Elsevier Ireland Ltd. All rights reserved.

Place, publisher, year, edition, pages
2003. Vol. 71, no 3, p. 311-317
Keywords [en]
Dimensionality, Expression data, Noise effects, PCA
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:liu:diva-46465DOI: 10.1016/S0303-2647(03)00128-XOAI: oai:DiVA.org:liu-46465DiVA, id: diva2:267361
Available from: 2009-10-11 Created: 2009-10-11 Last updated: 2017-12-13

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Hörnquist, Michael

Search in DiVA

By author/editor
Hörnquist, Michael
By organisation
The Institute of TechnologyDepartment of Science and Technology
In the same journal
Biosystems (Amsterdam. Print)
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 358 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf