The aim of this paper is to introduce a new approach to outlier analysis in which the detection is carried out on data with a hierarchical structure and a complex pattern of variability, e.g. pupils in classes, employees in firms, etc. In particular, we analyze the data collected by the Italian National Evaluation Institute of the Ministry of Education (INVALSI) in which the micro units –students- are nested within classes and schools, with a strong presence of outliers at the second level -class- of hierarchy. By the analysis of within class variability, we have developed a procedure to detect outlier units at class level combining the factorial analysis with a fuzzy clustering approach. The purpose of this method is to go over the dichotomous logic which classifies each unit as outlier or not outlier (hard clustering), computing an “outlier level” measure for each unit and in such a way calibrating the correction of overstimation of children ability due to the outlier presence.

A fuzzy clustering approach to improve the accuracy of Italian student data. An experimental procedure to correct the impact of outliers on assessment test scores

QUINTANO C;
2009-01-01

Abstract

The aim of this paper is to introduce a new approach to outlier analysis in which the detection is carried out on data with a hierarchical structure and a complex pattern of variability, e.g. pupils in classes, employees in firms, etc. In particular, we analyze the data collected by the Italian National Evaluation Institute of the Ministry of Education (INVALSI) in which the micro units –students- are nested within classes and schools, with a strong presence of outliers at the second level -class- of hierarchy. By the analysis of within class variability, we have developed a procedure to detect outlier units at class level combining the factorial analysis with a fuzzy clustering approach. The purpose of this method is to go over the dichotomous logic which classifies each unit as outlier or not outlier (hard clustering), computing an “outlier level” measure for each unit and in such a way calibrating the correction of overstimation of children ability due to the outlier presence.
2009
outlier correction
data accuracy
assessment test scores
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12570/17378
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact