Parametric quantile regression models for continuous proportions data : an evaluation of mean versus median modeling beyond beta

dc.contributor.advisorBurger, Divan A.
dc.contributor.emailu16001223@tuks.co.zaen_US
dc.contributor.postgraduateWeideman, Maricelle
dc.date.accessioned2025-02-12T12:50:40Z
dc.date.available2025-02-12T12:50:40Z
dc.date.created2025-04
dc.date.issued2025-02
dc.descriptionMini Dissertation (MSc (Advanced Data Analytics))--University of Pretoria, 2025.en_US
dc.description.abstractIn the modeling of bounded data, outliers or influential values present unique challenges, particularly when dealing with continuous proportions data. Traditional models, such as the beta regression model, although widely adopted, lack robustness against outliers, thus motivating the need for alternative models capable of addressing these limitations. This dissertation provides a comprehensive evaluation of various parametric models, including the beta, beta rectangular, Kumaraswamy, and Johnson-t models, emphasizing their robustness in handling outliers. A simulation study was conducted to examine the performance of each model under scenarios with and without outliers, measuring bias and coverage for key parameters. Results confirm the beta regression model’s sensitivity to outliers, as evidenced by increased bias and reduced coverage when influential values were introduced. In contrast, the Johnson-t regression model maintained stability in both bias and coverage, demonstrating greater resilience in outlier-inclusive datasets. Application to the Australian Institute of Sport data set further validated these findings, highlighting the Johnson-t model’s effectiveness in achieving robust median regression compared to mean-based approaches, which were less reliable with outliers. This study concludes that while beta regression remains popular for bounded data, the Johnson-t regression model offers a preferable alternative due to its robustness in median modeling, a critical factor in data analysis where influential values cannot be ignored.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMSc (Advanced Data Analytics)en_US
dc.description.departmentStatisticsen_US
dc.description.facultyFaculty of Natural and Agricultural Sciencesen_US
dc.description.sdgSDG-09: Industry, innovation and infrastructureen_US
dc.identifier.citation*en_US
dc.identifier.doihttps://cran.r-project.org/web/packages/DAAG/DAAG.pdfen_US
dc.identifier.otherA2025en_US
dc.identifier.urihttp://hdl.handle.net/2263/100786
dc.language.isoen_USen_US
dc.publisherUniversity of Pretoria
dc.rights© 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectSustainable Development Goals (SDGs)en_US
dc.subjectBounded dataen_US
dc.subjectOutliersen_US
dc.subjectRobustnessen_US
dc.subjectMedian regressionen_US
dc.subjectMean-based modelsen_US
dc.subjectParametric modelsen_US
dc.titleParametric quantile regression models for continuous proportions data : an evaluation of mean versus median modeling beyond betaen_US
dc.typeMini Dissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Weideman_Parametric_2025.pdf
Size:
5.49 MB
Format:
Adobe Portable Document Format
Description:
Mini Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: