Influence of variable selection and sample size on classification results with classy

التفاصيل البيبلوغرافية
العنوان: Influence of variable selection and sample size on classification results with classy
المؤلفون: Fr Hindriks, H Vandervoet, Jan Hemel, W Vanderslik
المصدر: Analytica Chimica Acta. 220:119-134
بيانات النشر: Elsevier BV, 1989.
سنة النشر: 1989
مصطلحات موضوعية: Variable (computer science), Training set, Chemistry, Sample size determination, Statistics, Environmental Chemistry, Feature selection, Biochemistry, Spectroscopy, Selection (genetic algorithm), Reliability (statistics), Analytical Chemistry, Multivariate classification
الوصف: To investigate the influence of selection of variables and sample size on the performance of the multivariate classification method CLASSY, these parameters were varied systematically. In addition to the usual classificatory performance, the reliability of the assigned probabilities is considered. A small training set with only about five variables was shown to yield satisfactory results. After the variables had been ranked according to decreasing utility for the classification, the inclusion of many variables made the probabilities more unreliable. This over-confidence was not easily remedied by adding training objects. The classificatory ability was not affected by using more variables than necessary.
تدمد: 0003-2670
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::aa96b461b1d70b53b87fbb5225a7b064
https://doi.org/10.1016/s0003-2670(00)80256-1
حقوق: CLOSED
رقم الأكسشن: edsair.doi...........aa96b461b1d70b53b87fbb5225a7b064
قاعدة البيانات: OpenAIRE