liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Klassificering av vinkvalitet
Linköping University, Department of Computer and Information Science, The Division of Statistics and Machine Learning.
Linköping University, Department of Computer and Information Science, The Division of Statistics and Machine Learning.
2017 (Swedish)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesisAlternative title
A classification of wine quality (English)
Abstract [en]

The data used in this paper is an open source data, that was collected in Portugal over a three year period between 2004 and 2007. It consists of the physiochemical parameters, and the quality grade of the wines.

This study focuses on assessing which variables that primarily affect the quality of a wine and how the effects of the variables interact with each other, and also compare which of the different classification methods work the best and have the highest degree of accuracy.

The data is divided into red and white wine where the response variable is ordered and consists of the grades of quality for the different wines. Due to the distribution in the response variable having too few observations in some of the quality grades, a new response variable was created where several grades were pooled together so that each different grade category would have a good amount of observations.

The statistical methods used are Bayesian ordered logistic regression as well as two data mining techniques which are neural networks and decision trees.

The result obtained showed that for the two types of wine it is primarily the alcohol content and the amount of volatile acid that are recurring parameters which have a great influence on predicting the quality of the wines.

The results also showed that among the three different methods, decision trees were the best at classifying the white wines and the neural network were the best for the red wines.

Place, publisher, year, edition, pages
2017. , p. 34
Keywords [sv]
Bayesiansk statistik, ordnad logistisk regression, beslutsträd, neurala nätverk, klassificering, vin, vinkvalitet
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:liu:diva-139932ISRN: LIU-IDA/STAT-G--17/007—SEOAI: oai:DiVA.org:liu-139932DiVA, id: diva2:1135019
Subject / course
Statistics
Supervisors
Examiners
Available from: 2017-08-22 Created: 2017-08-22 Last updated: 2017-08-22Bibliographically approved

Open Access in DiVA

fulltext(2037 kB)269 downloads
File information
File name FULLTEXT01.pdfFile size 2037 kBChecksum SHA-512
6b7eb470a59865ecd8615f1bbf5b3f87d3f628e107d2615a9062d439089d9e2786568ee09525e5b62127b4107e198064ab8b71130f30f49779f4163ad61a2422
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Brouwers, JackThellman, Björn
By organisation
The Division of Statistics and Machine Learning
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 269 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 838 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf