Commercial Shop Price Assessment: Comparing Machine Learning and Ordinary Least Squares for Improved Accuracy

Authors

  • Junainah Mohamad Department of Built Environment Studies and Technology, Faculty of Built Environment, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia
  • Intan Faiqah Hamizah Mohd Firazan Department of Built Environment Studies and Technology, Faculty of Built Environment, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia
  • Suraya Masrom College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Perak Branch, Tapah Campus, Malaysia
  • Abdul Rehman Gila Knight Foundation School of Computing and Information Sciences, Florida International University, USA

Keywords:

Commercial Valuation, Machine Learning, Random Forest, Ordinary Least Squares

Abstract

This study compares the predictive performance of OLS and five ML algorithms in valuing commercial shop properties using 2,480 transactions from Kuala Lumpur from 2013 to 2023. While OLS showed limited predictive power, the Random Forest algorithm, applied with log-transformed target variables, achieved superior accuracy (R² = 0.9974, RMSE = 0.03, MAPE = 0.02%). These findings support the use of machine learning as a reliable and efficient alternative for property valuation, offering enhanced precision and scalability in commercial real estate assessment.

References

Abidoye, R. B., & Chan, A. P. (2018). Achieving property valuation accuracy in developing countries: the implication of data source. International Journal of Housing Markets and Analysis, 11(3), 573-585.

Antipov, E. A., & Pokryshevskaya, E. B. (2012). Mass appraisal of residential apartments: An application of Random Forest for valuation. Expert Systems with Applications, 39(2), 1772–1778. https://doi.org/10.1016/j.eswa.2011.08.077

Bourassa, S. C., Hoesli, M., Mayer, M., & Stalder, N. (2025). Reflections on hedonic price modeling. Journal of European Real Estate Research.

Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.

Chikodili, N. B., Abdulmalik, M. D., Abisoye, O. A., & Bashir, S. A. (2020, November). Outlier detection in multivariate time series data using a fusion of K-medoid, standardized euclidean distance and Z-score. In International Conference on Information and Communication Technology and Applications (pp. 259-271). Cham: Springer International Publishing.

Chong, W. T., Lim, L. H., & Masron, T. (2020). Hedonic House Price Modelling: Comparing OLS and Machine Learning Methods. International Journal of Housing Markets and Analysis, 13(1), 105–123. https://doi.org/10.1108/IJHMA-06-2019-0060

Ismail, S. (2006). Spatial autocorrelation and real estate studies: A literature review. Malaysian Journal of Real Estate, 1(1), 1-13.

Jamaludin, M. H., Ismail, S., & Ismail, N. (2021). The Development of Penang Shop Price Index (PSPI) Using Laspeyres Hedonic Price Model. In Journal of the Malaysian Institute of Planners VOLUME (Vol. 19).

Jang, D., & Lee, B. (2022). When machine learning meets social science: A comparative study of ordinary least square, stochastic gradient descent, and support vector regression for exploring the determinants of behavioral intentions to tuberculosis screening. Asian Communication Research, 19(3), 101-118.

Johnson, S., Elms, J., Madhavan, K., Sugasi, K., Sharma, P., Kurban, H., & Dalkilic, M. M. (2023, August). Are They What They Claim: A Comprehensive Study of Ordinary Linear Regression Among the Top Machine Learning Libraries in Python. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA (pp. 6-10).

Khamis, R., Mohamed, A. A., & Mohd, R. (2020). Real estate valuation using machine learning: A review. International Journal of Advanced Science and Technology, 29(6), 1905–1917.

Lucas, R. E. (1975). Hedonic price functions. Economic Inquiry, 13(2), 157-178.

Malpezzi, S. (2003). Hedonic pricing models: A selective and applied review. In T. O’Sullivan & K. Gibb (Eds.), Housing Economics and Public Policy (pp. 67–89). Blackwell Science.

Mayer, M., Bourassa, S. C., Hoesli, M., & Scognamiglio, D. (2019). Estimation and updating methods for hedonic valuation. Journal of European real estate research, 12(1), 134-150.

Owusu-Ansah, A. (2018). Construction and application of property price indices. Routledge.

Pusat Maklumat Hartanah Negara (NAPIC). (2021). Manual Definisi NAPIC (3.0).

Rosen, S. (1974). Hedonic prices and implicit markets: product differentiation in pure competition. Journal of political economy, 82(1), 34-55.

Sauerbrei, W., Perperoglou, A., Schmid, M., Abrahamowicz, M., Becher, H., Binder, H., ... & TG2 of the STRATOS initiative Abrahamowicz Michal Becher Heiko Binder Harald Dunkler Daniela Harrell Frank Heinze Georg Perperoglou Aris Rauch Geraldine Royston Patrick Sauerbrei Willi. (2020). State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues. Diagnostic and prognostic research, 4(1), 3.

Selim, H. (2009). Determinants of house prices in Turkey: Hedonic regression versus artificial neural network. Expert Systems with Applications, 36(2), 2843–2852. https://doi.org/10.1016/j.eswa.2008.01.044

Topraklı, A. Y. (2025). AI-driven valuation: a new era for real estate appraisal. Journal of European Real Estate Research, 18(1), 105-120.

Yin, L., Recker, W., & Chen, A. (2021). Improving property valuation with machine learning: Evidence from the real estate market. Computers, Environment and Urban Systems, 85, 101549. https://doi.org/10.1016/j.compenvurbsys.2020.101549

Published

2025-09-25

How to Cite

Mohamad, J., Hamizah Mohd Firazan, I. F., Masrom, S., & Gila, A. R. (2025). Commercial Shop Price Assessment: Comparing Machine Learning and Ordinary Least Squares for Improved Accuracy . Environment-Behaviour Proceedings Journal, 10(33). Retrieved from https://ebpj.e-iph.co.uk/index.php/EBProceedings/article/view/7256