Your browser doesn't support javascript.
loading
Identifying biologically implausible values in big longitudinal data: an example applied to child growth data from the Brazilian food and nutrition surveillance system.
de Mello E Silva, Juliana Freitas; de Jesus Silva, Natanael; Carrilho, Thaís Rangel Bousquet; Jesus Pinto, Elizabete de; Rocha, Aline Santos; Pedroso, Jéssica; Silva, Sara Araújo; Spaniol, Ana Maria; da Costa Santin de Andrade, Rafaella; Bortolini, Gisele Ane; Paixão, Enny; Kac, Gilberto; de Cássia Ribeiro-Silva, Rita; Barreto, Maurício L.
Afiliação
  • de Mello E Silva JF; Centre for Data and Knowledge Integration for Health, Gonçalo Moniz Institute, Oswaldo Cruz Foundation, Salvador, BA, Brazil.
  • de Jesus Silva N; Centre for Data and Knowledge Integration for Health, Gonçalo Moniz Institute, Oswaldo Cruz Foundation, Salvador, BA, Brazil.
  • Carrilho TRB; ISGlobal, Hospital Clínic. Universitat de Barcelona, Barcelona, Spain.
  • Jesus Pinto E; Nutritional Epidemiology Observatory, Josué de Castro Nutrition Institute, Federal University of Rio de Janeiro, Rio de Janeiro, RJ, Brazil.
  • Rocha AS; Department of Obstetrics and Gynaecology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada.
  • Pedroso J; Centre for Data and Knowledge Integration for Health, Gonçalo Moniz Institute, Oswaldo Cruz Foundation, Salvador, BA, Brazil.
  • Silva SA; Federal University of Recôncavo da Bahia, Santo Antônio de Jesus, BA, Brazil.
  • Spaniol AM; Centre for Data and Knowledge Integration for Health, Gonçalo Moniz Institute, Oswaldo Cruz Foundation, Salvador, BA, Brazil.
  • da Costa Santin de Andrade R; Food and Nutrition Coordinating Unit, Ministry of Health, Brasília, DF, Brazil.
  • Bortolini GA; Food and Nutrition Coordinating Unit, Ministry of Health, Brasília, DF, Brazil.
  • Paixão E; Food and Nutrition Coordinating Unit, Ministry of Health, Brasília, DF, Brazil.
  • Kac G; Food and Nutrition Coordinating Unit, Ministry of Health, Brasília, DF, Brazil.
  • de Cássia Ribeiro-Silva R; Food and Nutrition Coordinating Unit, Ministry of Health, Brasília, DF, Brazil.
  • Barreto ML; Food and Nutrition Coordinating Unit, Ministry of Health, Brasília, DF, Brazil.
BMC Med Res Methodol ; 24(1): 38, 2024 Feb 15.
Article em En | MEDLINE | ID: mdl-38360575
ABSTRACT

BACKGROUND:

Several strategies for identifying biologically implausible values in longitudinal anthropometric data have recently been proposed, but the suitability of these strategies for large population datasets needs to be better understood. This study evaluated the impact of removing population outliers and the additional value of identifying and removing longitudinal outliers on the trajectories of length/height and weight and on the prevalence of child growth indicators in a large longitudinal dataset of child growth data.

METHODS:

Length/height and weight measurements of children aged 0 to 59 months from the Brazilian Food and Nutrition Surveillance System were analyzed. Population outliers were identified using z-scores from the World Health Organization (WHO) growth charts. After identifying and removing population outliers, residuals from linear mixed-effects models were used to flag longitudinal outliers. The following cutoffs for residuals were tested to flag those -3/+3, -4/+4, -5/+5, -6/+6. The selected child growth indicators included length/height-for-age z-scores and weight-for-age z-scores, classified according to the WHO charts.

RESULTS:

The dataset included 50,154,738 records from 10,775,496 children. Boys and girls had 5.74% and 5.31% of length/height and 5.19% and 4.74% of weight values flagged as population outliers, respectively. After removing those, the percentage of longitudinal outliers varied from 0.02% (<-6/>+6) to 1.47% (<-3/>+3) for length/height and from 0.07 to 1.44% for weight in boys. In girls, the percentage of longitudinal outliers varied from 0.01 to 1.50% for length/height and from 0.08 to 1.45% for weight. The initial removal of population outliers played the most substantial role in the growth trajectories as it was the first step in the cleaning process, while the additional removal of longitudinal outliers had lower influence on those, regardless of the cutoff adopted. The prevalence of the selected indicators were also affected by both population and longitudinal (to a lesser extent) outliers.

CONCLUSIONS:

Although both population and longitudinal outliers can detect biologically implausible values in child growth data, removing population outliers seemed more relevant in this large administrative dataset, especially in calculating summary statistics. However, both types of outliers need to be identified and removed for the proper evaluation of trajectories.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Estatura / Gráficos de Crescimento Tipo de estudo: Prognostic_studies / Risk_factors_studies / Screening_studies Limite: Child / Female / Humans / Male País/Região como assunto: America do sul / Brasil Idioma: En Revista: BMC Med Res Methodol Assunto da revista: MEDICINA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Brasil País de publicação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Estatura / Gráficos de Crescimento Tipo de estudo: Prognostic_studies / Risk_factors_studies / Screening_studies Limite: Child / Female / Humans / Male País/Região como assunto: America do sul / Brasil Idioma: En Revista: BMC Med Res Methodol Assunto da revista: MEDICINA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Brasil País de publicação: Reino Unido