Predicting House Sale Price Using Fuzzy Logic, Artificial Neural Network and K-Nearest Neighbor
Predicting House Sale Price Using Fuzzy Logic, Artificial Neural Network and K-Nearest Neighbor
Predicting House Sale Price Using Fuzzy Logic, Artificial Neural Network and K-Nearest Neighbor
Abstract—Determining the value of land and home are market in Hong Kong [6]. Several machine learning methods
regularly determined at the earliest by the seller, however are compared to get the best prediction of house pricing [7].
determining the right price in the sales process will affect the
buyer's desire to elect and bid. Special characteristics in In advance, spatial analysis is used to determine house and
Indonesia, tax object value (NJOP) and location parameters are land prices [8]. Jeffrey et. al discussed the spatial effects on the
high influence to the price. In this paper we proposed the dynamics of house prices in the USA that have a significant
prediction of land and house value using several methods. Fuzzy influence in urban house price growth rates [9]. In addition,
logic, Artificial Neural Network and K-Nearest Neighbor are real estate pricing in Vienna (Austria) was investigated that
compared in this paper to discover the most appropriate method requires recognition of spatial heterogeneity in housing prices
that can be used as a reference for determining the price by the [10]. However, only few studies that apply the prediction
sellers. Google Maps is used to represent the spatial data for model using spatial data as analysis factor for the prediction.
prediction parameter. The variables that used in the methods are
NJOP of land, the locations, the age, NJOP of house, and the In this study, several prediction methods were compared to
valuable location of the land. The experimental methods are find out the best predicted results for determining the selling
tested by comparing between the real price transaction and the price of a house compared to the real price. The methods
prediction using MAPE formula. (Fuzzy, Neural Network, k-nearest neighbor) were tested to
predict house prices in Indonesia. In this study, spatial data and
Keywords—House Price, Fuzzy, Artificial Neural Networks, K- attribute data are employed as determinant parameters in the
Nearest Neighbor. determination of house and land prices. Spatial data consists of
the selling value of the tax object (NJOP) of the land and the
I. INTRODUCTION strategic location of the land, while the attribute data includes
House is one of the important needs for people which has the selling value of tax object (NJOP) of the building, the
function as a place for rest and gather with family [1]. In the condition of the house and the age of the house. This study
housing market, the initial prices are an important factor in the uses spatial data in the form of a strategic location of the land,
process of buying and selling houses and land. Determining the called accessibility level that influence by infrastructure
initial selling price of a house or land usually depends on the facilities such as public health services, education, downtown
seller, however determining the right price in the sales process and economy. In spatial data the Geographic Information
will affect the buyer's desire to bid and make selections. The System (GIS) has three elements, namely vertices, lines (arcs),
initial price for each house and land are varied according to and broad (polygons) in the form of vectors or raster
residential facilities and home geographical conditions. In representing geometric topology, size, shape, position and
Indonesia, one of the parameters to determine the selling price direction.
of the house that can be quantity calculated are the value of the
tax object (NJOP) of the land and the value of the tax object of II. METHOD
the building [2]. Both parameters are influenced by several A. Fuzzy Logic
factors such as strategic location and also age of building.
Fuzzy logic was first introduced by Lotfi Zadeh in 1965.
The initial house prices prediction is challenging and Fuzzy logic is a mathematical framework used to represent
requires the best method to get the best prediction accuracy. In uncertainty. The presentation of set A with the zero-one
the predicting the sale price of a house that has an uncertainty membership function, otherwise known as the crisp set can be
parameter, fuzzy logic becomes one of the solutions that can be represented by the following equation [12].
used in solving the problem [3-4]. Moreover, artificial neural
network methods are used to predict house selling prices [5]. In μ_(A ) (x)={ _(0 if x ∉ A)^(1 if x ∈ A) (1)
addition to using fuzzy logic and artificial neural networks, The above equation shows that the Fuzzy set is a
predictions can also use the K-Nearest Neighbors algorithm, generalization of the crisp set by allowing the membership
for estimating residential prices for the residential property function to retrieve the values in the interval [0, 1]. In other
words, the membership function of the crisp set can only take a
172
= Value of Real Transaction
= Prediction Value
n = Number of data
III. PROPOSED MODEL
The real house prices in Pedurungan Sub-district of
Semarang City are used in this study. The training data consists
of 7 predictor’s attributes and 1 label attribute. In order to
improve the data quality and eliminating duplication,
preprocessing was applied. Fig. 2. Membership fuction of NJOP-L
173
TABLE I. FIRST RULEBASE FUZZY
Sales Value
of Taxable House Output Predicted
No House Age
Object Condition House Prices
Building
1 LOW LOW LOW LOW
2 LOW MID LOW LOW
3 LOW HIGH LOW LOW
4 LOW LOW MID LOW
5 LOW MID MID LOW
Fig. 7. Membership Function of Predicted House Prices Value 6 LOW HIGH MID LOW
7 LOW LOW HIGH HIGH
8 LOW MID HIGH LOW
9 LOW HIGH HIGH LOW
10 MID LOW LOW HIGH
11 MID MID LOW LOW
12 MID HIGH LOW LOW
13 MID LOW MID HIGH
14 MID MID MID LOW
15 MID HIGH MID LOW
16 MID LOW HIGH HIGH
17 MID MID HIGH HIGH
Fig. 8. Membership Function of Predicted Land Prices Value 18 MID HIGH HIGH LOW
19 HIGH LOW LOW HIGH
20 HIGH MID LOW HIGH
21 HIGH HIGH LOW LOW
22 HIGH LOW MID HIGH
23 HIGH MID MID HIGH
24 HIGH HIGH MID LOW
25 HIGH LOW HIGH HIGH
26 HIGH MID HIGH HIGH
27 HIGH HIGH HIGH HIGH
174
.
175
REFERENCES [8] Basu, Sabyasachi, and Thomas G. Thibodeau. "Analysis of spatial
autocorrelation in house prices." The Journal of Real Estate Finance and
[1] Raharjo, N. P. “Dinamika pemenuhan Kebutuhan Perumahan Economics 17, no. 1, pp. 61-85. 1998
Masyarakat Berpenghasilan rendah”. Semarang: Magister Teknik
[9] Cohen, Jeffrey P., Yannis M. Ioannides, and Win Wirathip
Pembangunan wilayah dan Kota Universitas Diponegoro, p. 30. 2010.
Thanapisitikul. "Spatial effects and house price dynamics in the USA."
[2] Fahirah, F., Basong, A. & Tagala, H. H. “Identifikasi Faktor yang Journal of Housing Economics 31, pp. 1-13. 2016.
Mempengaruhi Nilai Jual Lahan dan Bangunan pada Perumahan Tipe
Sederhana”. Jurnal Smartek, Volume 4, pp. 251 - 269 . 2010. [10] Helbich, Marco, and Daniel A. Griffith. "Spatially varying coefficient
models in real estate: Eigenvector spatial filtering and alternative
[3] Kuşan, Hakan, Osman Aytekin, and İlker Özdemir. "The use of fuzzy approaches." Computers, Environment and Urban Systems 57, pp. 1-11.
logic in predicting house selling price." Expert systems with 2016
Applications 37, no. 3, pp. 1808-1813. 2010.
[11] Klir, George, and Bo Yuan. Fuzzy sets and fuzzy logic. Vol. 4. New
[4] Gerek, Ibrahim Halil. "House selling price assessment using two Jersey: Prentice hall, 1995.
different adaptive neuro-fuzzy techniques." Automation in Construction
41, pp. 33-39. 2014. [12] Sivanandam, S., Sumanthi, S. & Deepa, S. Introduction to Fuzzy Logic
Using Matlab. Heidelberg: Springer-Verlag Berlin. 2007.
[5] Peterson, Steven, and Albert Flanagan. "Neural network hedonic pricing
models in mass real estate appraisal." Journal of Real Estate Research [13] Sayyed Mohsen Vazirizade, Saeed Nozhati, and Mostafa Allameh
31, no. 2, pp. 147-164. 2009 Zadeh, "Seismic Reliability Assessment Of Structure Using Artificial
Neural Network," Journal of Building Engineering, vol. XI, pp. 230-
[6] Cheung, Simon KC, and Sahminan Sahminan. "A Localized Model for 235, 2017.
Residential Property Valuation: Nearest Neighbor with Attribute
Differences." International Real Estate Review 20, no. 2, pp. 221-250. [14] Larose, Daniel T. "K‐nearest neighbor algorithm." Discovering
2017. Knowledge in Data: An Introduction to Data Mining, pp. 90-106. 2005
[15] Murai,GIS Work Book. Tokyo: Institute of Industrial Science. 1999.
[7] Byeonghwa Park and Jae Kwon Bae, "Using Machine Learning
Algorithms For Housing Price Prediction: The Case Of Fairfax County, [16] Chtiara, C. Implementasi sistem sistem informasi geografis (SIG)
Virginia Housing Data," Expert Systems with Applications, vol. XLII, Universitas Indonesia (UI) berbasis Web dengan menggunakan google
pp. 2928-2934, 2015. maps api. Universita Indonesia. Jakarta: Jurnal UI. 2008
176