Commercial Shop Price Assessment: Comparing Machine Learning and Ordinary Least Squares for Improved Accuracy
Keywords:
Commercial Valuation, Machine Learning, Random Forest, Ordinary Least SquaresAbstract
This study compares the predictive performance of OLS and five ML algorithms in valuing commercial shop properties using 2,480 transactions from Kuala Lumpur from 2013 to 2023. While OLS showed limited predictive power, the Random Forest algorithm, applied with log-transformed target variables, achieved superior accuracy (R² = 0.9974, RMSE = 0.03, MAPE = 0.02%). These findings support the use of machine learning as a reliable and efficient alternative for property valuation, offering enhanced precision and scalability in commercial real estate assessment.
References
Abidoye, R. B., & Chan, A. P. (2018). Achieving property valuation accuracy in developing countries: the implication of data source. International Journal of Housing Markets and Analysis, 11(3), 573-585.
Antipov, E. A., & Pokryshevskaya, E. B. (2012). Mass appraisal of residential apartments: An application of Random Forest for valuation. Expert Systems with Applications, 39(2), 1772–1778. https://doi.org/10.1016/j.eswa.2011.08.077
Bourassa, S. C., Hoesli, M., Mayer, M., & Stalder, N. (2025). Reflections on hedonic price modeling. Journal of European Real Estate Research.
Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.
Chikodili, N. B., Abdulmalik, M. D., Abisoye, O. A., & Bashir, S. A. (2020, November). Outlier detection in multivariate time series data using a fusion of K-medoid, standardized euclidean distance and Z-score. In International Conference on Information and Communication Technology and Applications (pp. 259-271). Cham: Springer International Publishing.
Chong, W. T., Lim, L. H., & Masron, T. (2020). Hedonic House Price Modelling: Comparing OLS and Machine Learning Methods. International Journal of Housing Markets and Analysis, 13(1), 105–123. https://doi.org/10.1108/IJHMA-06-2019-0060
Ismail, S. (2006). Spatial autocorrelation and real estate studies: A literature review. Malaysian Journal of Real Estate, 1(1), 1-13.
Jamaludin, M. H., Ismail, S., & Ismail, N. (2021). The Development of Penang Shop Price Index (PSPI) Using Laspeyres Hedonic Price Model. In Journal of the Malaysian Institute of Planners VOLUME (Vol. 19).
Jang, D., & Lee, B. (2022). When machine learning meets social science: A comparative study of ordinary least square, stochastic gradient descent, and support vector regression for exploring the determinants of behavioral intentions to tuberculosis screening. Asian Communication Research, 19(3), 101-118.
Johnson, S., Elms, J., Madhavan, K., Sugasi, K., Sharma, P., Kurban, H., & Dalkilic, M. M. (2023, August). Are They What They Claim: A Comprehensive Study of Ordinary Linear Regression Among the Top Machine Learning Libraries in Python. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA (pp. 6-10).
Khamis, R., Mohamed, A. A., & Mohd, R. (2020). Real estate valuation using machine learning: A review. International Journal of Advanced Science and Technology, 29(6), 1905–1917.
Lucas, R. E. (1975). Hedonic price functions. Economic Inquiry, 13(2), 157-178.
Malpezzi, S. (2003). Hedonic pricing models: A selective and applied review. In T. O’Sullivan & K. Gibb (Eds.), Housing Economics and Public Policy (pp. 67–89). Blackwell Science.
Mayer, M., Bourassa, S. C., Hoesli, M., & Scognamiglio, D. (2019). Estimation and updating methods for hedonic valuation. Journal of European real estate research, 12(1), 134-150.
Owusu-Ansah, A. (2018). Construction and application of property price indices. Routledge.
Pusat Maklumat Hartanah Negara (NAPIC). (2021). Manual Definisi NAPIC (3.0).
Rosen, S. (1974). Hedonic prices and implicit markets: product differentiation in pure competition. Journal of political economy, 82(1), 34-55.
Sauerbrei, W., Perperoglou, A., Schmid, M., Abrahamowicz, M., Becher, H., Binder, H., ... & TG2 of the STRATOS initiative Abrahamowicz Michal Becher Heiko Binder Harald Dunkler Daniela Harrell Frank Heinze Georg Perperoglou Aris Rauch Geraldine Royston Patrick Sauerbrei Willi. (2020). State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues. Diagnostic and prognostic research, 4(1), 3.
Selim, H. (2009). Determinants of house prices in Turkey: Hedonic regression versus artificial neural network. Expert Systems with Applications, 36(2), 2843–2852. https://doi.org/10.1016/j.eswa.2008.01.044
Topraklı, A. Y. (2025). AI-driven valuation: a new era for real estate appraisal. Journal of European Real Estate Research, 18(1), 105-120.
Yin, L., Recker, W., & Chen, A. (2021). Improving property valuation with machine learning: Evidence from the real estate market. Computers, Environment and Urban Systems, 85, 101549. https://doi.org/10.1016/j.compenvurbsys.2020.101549
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Junainah Mohamad, Intan Faiqah Hamizah Mohd Firazan, Suraya Masrom, Abdul Rehman Gila

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.