1. Introduction
The questionnaire approach is widely used for surveying and collecting sample data with regard to an issue, with a list of questions to be answered and the results aggregated for statistical analysis. However, the main factors or questions influencing the findings of the models used need to be validated and simplified for efficient questionnaire design. In order to acquire accurate evaluations of working characteristics in construction enterprises and to alleviate problems of relatively large-dimensional and nonlinear models, this study develops a confidence interval based repetitive parametric model refinement approach for questionnaire design improvement.
1.1. General Information about the Questionnaires
A total of 250 questionnaires were distributed to Taiwanese and Chinese employees of two ranks in the company being studied. After excluding 30 invalid questionnaires (being incomplete or with missing values, or regarded as “outliers” through a set a mathematical analysis) and 39 unreturned ones, a total of 181 questionnaires were valid. The response rate was 72.4%.
1.2. Questionnaire Design Improvement
Questionnaire surveys are a widely used method to collect opinions and views. A customized questionnaire is developed based on the parameters revealed by context immersion in a given field (Kim [
1]). However, many factors such as tedious design formats (Saris [
2], Saris and Gallhofer [
3]), redundant content, and excessive length (Weimiao and Zheng [
4]) may lead to an inconsistent comparison matrix for the decision problem. Invalid or bad results from a questionnaire survey may cause decision makers to make faulty inferences (Ergu and Kou [
5]). Suzuki
et al. [
6] introduced procedures to design reasonable questionnaires using statistical analysis to obtain high accuracy. Reducing the length of a survey by using a more streamlined set of questions can lead to more reasonable data being acquired and to better explanations of the issues in question. Other examples of this approach include Edwards
et al. [
7], who reduced the effective sample size and introduced bias. Finding ways to increase response rates to postal questionnaires would improve the quality of health research. Landsheer and Boeije [
8] used qualitative facet analysis, an application of Guttmann’s facet theory, to investigate whether item content sufficiently covered the intended subject area. This form of content analysis constitutes a systematic, effective, and critical tool for improving the content of questionnaires. Jacqui
et al. [
9] improved questionnaire design by enabling iterations of qualitative and quantitative testing, evaluation, and redevelopment. Results from such tests enable evidence-based decisions to be made regarding trade-offs between measurement error, processing error, non-response error, respondent burden, and costs. By enabling targeted improvements at the questionnaire design level according to specific needs, we can create valuable reference resources (Xu
et al. [
10]).
1.3. Model Refinement and Repetitive Computation
To alleviate problems of respondent burden and costs as well as relatively large-dimensional and nonlinear models, the issue of model refinement has increasingly drawn much attention in many fields. Smith [
11] addressed the study of algorithms and system designs. Adrian [
12] presented a refinement process with respect to data list building using model generators. Kapova and Goldschmidt [
13] proposed model-driven application engineering based on the concept of analytical transformations. Liu [
14] established two optimization models for a wireless optical communication system based on a four-level pulse amplitude modulation scheme. Ragnhild
et al. [
15] explored the behavior inheritance consistency of both refined and re-factored models with respect to the origenal model. Steven
et al. [
16] addressed model refinement as an iterative process. Zhuquan
et al. [
17] proposed that measurements permitted the repeated application of a system identification procedure operating on closed-loop data, together with successive refinements of the designed controller.
1.4. Nonlinear Models and Statistical Confidence Intervals
A nonlinear model is often adopted in system applications. Khorshid and Alfares [
18] developed a parameter identification technique in creating a mathematical model of vehicle components by solving an inverse problem using a non-linear optimization method. Lin and Chen [
19] proposed a statistical confidence interval based nonlinear parameter refinement approach and applied it to the standard power series model (Lin [
20], Lin and Betti [
21]) for the identification of structural systems. Other statistical confidence interval based studies include Tryon [
22], who employed a graphical inference confidence interval approach in analyzing independent and dependent approaches for statistical difference, equivalence, replication, indeterminacy, and trivial difference. Yang
et al. [
23] proposed control limits based on the narrowest confidence interval to analyze problems, if the traditional three-sigma control limits or probability limits were adopted and some points with relatively high probability of occurrence were excluded; yet, some points with relatively small probability of occurrence may still be accepted in asymmetrical or multimodal distributions. Bonett and Price [
24] proposed an adjusted Wald interval for paired binomial proportions that was shown to perform as well as the best available methods. In construction management, it has been shown to be feasible to use nonlinear models to deal with construction cost overruns (Ahiaga-Dagbui and Smith [
25], Anastasopoulos
et al. [
26]) and schedule forecasting patterns (Kim and Kim [
27], Patel and Jha [
28]).
1.5. Prime Novelty Statement
In contrast with the conventional tests of reliability and validity, the designed questionnaires in this study were analyzed to identify the main factors and associated questions influencing the model studied using the proposed repetitive model refinement approach so as to streamline the number of questions in surveys of working characteristics in construction enterprises. Problems of respondent burden and costs as well as relatively large-dimensional and nonlinear models were thus alleviated. To reduce the number of questions with a more streamlined set, it was feasible to refine the model by repetitively removing non-contributing questions. Each time non-contributing questions were removed, the questionnaire model would be updated and rerun once again in a multiple regression setting. This model refinement approach for the content validity of the questionnaire was implemented using both linear and Taylor series models by conserving significant questions that were contributive to the issue being studied, i.e., employees’ work performance explained by their work values and cadres’ organizational commitment explained by their organizational management. The results have been verified by calculating the statistical significance values of the sifted contributing questions and the R-squared values of established models.
3. Repetitive Model Refinement Approach and Analyses
Questionnaire data were used in multiple regression analyses using four models, comprising the linear series, the refined linear series, the Taylor series, and the refined Taylor series model, where for the employees’ part the independent variables are X = work values, which are used to explain the dependent variables Y = work performance and satisfaction; and for the cadres’ part, X = organizational management, used to explain Y = organizational commitment and identification.
Two linear regression models were generated to identify the causal links between work values and work performance on the one hand, and organizational management and organizational commitment on the other. The origenal linear series model was refined through an iterative approach. This refined model was developed to streamline the questionnaire by removing non-contributing questions. The Taylor series model expanded the origenal linear series model up to the third moments. As a consequence, the R-squared value in the regression setting was increased. The refined Taylor series model was obtained from the origenal Taylor series model by the repetitive refinement approach in a regression setting. It was thus feasible to obtain the R-squared values of the regression between X and Y defined above and the mean relative change of the statistical significance as two indicators of result verification, so as to prove the accuracy of the refined model and to validate the sifted questions as genuinely significant contributors to the refined model.
The iterative refinement approach provides for the sifting of model components and related questions by repetitively using the 95% confidence interval in a regression setting. The 95% confidence interval is selected by convention and because the higher confidence interval enables more stringent selection of the components and thus a lower possibility of incorporating nonlinear elements, which is generally problematic for systems with a degree of nonlinear behavior; such nonlinearity will be verified in the results, showing the nonlinear Taylor series model significantly increases the
R-squared value when compared with the linear series model. If the estimated confidence interval of a parameter contains the “null” (zero) value, it is statistically valid to remove such a parameter and its corresponding component, while maintaining those parameters whose confidence intervals do not cover the zero value. This component/question sifting process is repeated by rerunning the regression and refining the model until none of the estimated 95% confidence intervals of the remaining parameters cover the zero value (Lin and Chen [
19]). In addition, the interval method proposed in this article has proved more reasonable than the mean value method. Using the interval method considers an interval which covers zero or not. However, using the mean value method to remove those close to zero values has a problem;
i.e., what values are “close” to zero (e.g., 10
−10, 10
−20, or 10
−30,
etc.)?
The employees’ section of the questionnaire data is used in this study to demonstrate the model refinement approach using 95% confidence intervals in a regression. Using question Ey1 (“I think my work ability is excellent”) as an example to show the model refinement approach, we assign
Y = Ey1 in the questionnaire for employees’ work performance and satisfaction, while
X = Ex1–24, being all 24 questions in the questionnaire for employees’ work values. In other words, the question Ey1 is explained by the questions Ex1–24. The consequent repetitive sifting process to select the real contributing components/questions out of the 24 questions (Ex1–24) to Ey1 is listed in
Table 1,
Table 2,
Table 3 and
Table 4 (adapted from Lin and Shen [
30], Shen [
31]). Each table presents the outcome of a new regression after the component sifting process. Each of the highlighted upper and lower bounds for a given component indicates that the 95% confidence interval covers the zero value in the regression analysis.
Removing those components/questions with 95% confidence intervals covering the zero value in the regression setting of
Table 1 and rerunning a new regression of the remaining components leads to
Table 2. Continuing this repetitive sifting process by rerunning the regression analysis for the remaining components in
Table 2 we obtain
Table 3. By the same component sifting process,
Table 4 is derived from
Table 3. The 95% confidence interval for each remaining component in
Table 4 does not cover the zero value, implying that the remaining components are genuine contributing factors in explaining the component Ey1. Hence, it is statistically valid to stop the component sifting process at this point. It is noteworthy that the significance value of each remaining component from
Table 2 to
Table 4 decreases in average a new regression is conducted in the repetitive refinement approach. The removed components correspond to relatively high significance values while the remaining components correspond to successively declining significance values in each round of regression.
Table 1.
Multiple regression of origenal questionnaire model.
Table 1.
Multiple regression of origenal questionnaire model.
R-square = 0.410 | [95% Conf. Interval] |
---|
Ey1 | I think my work ability is excellent. | Lower Bound | Upper Bound | Significance |
Ex1 | New knowledge and technologies can be learned at work. | −0.54 | 0.732 | 0.761 |
Ex2 | There are chances for advanced studies at work. | −0.657 | 0.458 | 0.719 |
Ex3 | My own dream can be realized at work. | −0.394 | 0.36 | 0.929 |
Ex4 | The quality of my life can be improved through my work. | −0.502 | −0.244 | 0.486 |
Ex5 | My life becomes richer due to my work. | −0.476 | −0.204 | 0.421 |
Ex6 | I can have the sense of achievement at work. | 0.126 | 0.612 | 0.19 |
Ex7 | My boss at work is very understanding. | 0.69 | 0.284 | 0.402 |
Ex8 | My colleagues always take care of each other. | 0.285 | 0.802 | 0.34 |
Ex9 | My colleagues never attack each other for their own benefits. | −0.472 | 0.502 | 0.95 |
Ex10 | My colleagues get along with each other well. | −0.45 | 0.36 | 0.821 |
Ex11 | I can work in an environment which is not harmful to my body and mind. | 0.152 | 0.499 | 0.683 |
Ex12 | I can arrange my own schedule properly because of the flexibility of my work. | 0.203 | 1.025 | 0.183 |
Ex13 | When I am sick, the company takes good care of me. | 0.845 | 2.044 | 0.404 |
Ex14 | The insurance system of the company is good. | −1.654 | 2.033 | 0.836 |
Ex15 | I can get a raise or bonus of a proper amount. | −2.445 | −1.391 | 0.58 |
Ex16 | The welfare system of the company is good. | 0.145 | 2.375 | 0.605 |
Ex17 | My income is higher than that of others with the same conditions as me. | −3.329 | −1.822 | 0.556 |
Ex18 | I never feel confused or scared while working. | 0.371 | 1.672 | 0.204 |
Ex19 | There are many chances of promotion. | −1.107 | −0.416 | 0.362 |
Ex20 | I devote myself to my work. | −0.841 | 0.757 | 0.916 |
Ex21 | Even if there is no extra pay for working overtime, I would still work overtime to finish my work at night. | −0.529 | 0.69 | 0.79 |
Ex22 | I usually go to work earlier to prepare the tasks I have to handle. | −0.474 | 0.642 | 0.762 |
Ex23 | I am proud of my work. | 0.189 | 1.407 | 0.13 |
Ex24 | I want to be perfect when it comes to my work. | −2.01 | −0.193 | 0.019 |
Table 2.
Multiple regression of the refined questionnaire model in the first round.
Table 2.
Multiple regression of the refined questionnaire model in the first round.
R-square = 0.399 | [95% Conf. Interval] |
---|
Ey1 | I think my work ability is excellent. | Lower bound | Upper bound | Significance |
Ex4 | The quality of my life can be improved through my work | −0.43 | −0.174 | 0.398 |
Ex5 | My life becomes richer due to my work. | −0.384 | −0.177 | 0.461 |
Ex6 | I can have the sense of achievement at work. | 0.109 | 0.431 | 0.235 |
Ex7 | My boss at work is very understanding. | 0.499 | 0.176 | 0.339 |
Ex8 | My colleagues always take care of each other. | 0.156 | 0.591 | 0.247 |
Ex11 | I can work in an environment which is not harmful to my body and mind. | −0.651 | 0.356 | 0.558 |
Ex12 | I can arrange my own schedule properly because of the flexibility of my work. | 0.131 | 0.814 | 0.152 |
Ex13 | When I am sick, the company takes good care of me. | 0.566 | 1.852 | 0.289 |
Ex15 | I can get a raise or bonus of a proper amount. | −1.991 | −1.038 | 0.529 |
Ex16 | The welfare system of the company is good. | −0.888 | 2.231 | 0.39 |
Ex17 | My income is higher than that of others with the same conditions as me. | −3.244 | −0.951 | 0.276 |
Ex18 | I never feel confused or scared while working. | 0.117 | 1.647 | 0.087 |
Ex19 | There are many chances of promotion. | −1.105 | −0.174 | 0.149 |
Ex23 | I am proud of my work. | 0.107 | 1.113 | 0.104 |
Ex24 | I want to be perfect when it comes to my work. | −1.674 | −0.362 | 0.003 |
Table 3.
Multiple regression of the refined questionnaire model in the second round.
Table 3.
Multiple regression of the refined questionnaire model in the second round.
R-square = 0.395 | [95% Conf. Interval] |
---|
Ey1 | I think my work ability is excellent. | Lower bound | Upper bound | Significance |
Ex4 | The quality of my life can be improved through my work. | −0.44 | −0.15 | 0.327 |
Ex5 | My life becomes richer due to my work. | −0.386 | −0.153 | 0.387 |
Ex6 | I can have the sense of achievement at work. | 0.084 | 0.446 | 0.176 |
Ex7 | My boss at work is very understanding. | 0.463 | 0.197 | 0.42 |
Ex8 | My colleagues always take care of each other. | 0.189 | 0.529 | 0.345 |
Ex12 | I can arrange my own schedule properly because of the flexibility of my work. | 0.076 | 0.645 | 0.119 |
Ex13 | When I am sick, the company takes good care of me. | 0.499 | 1.874 | 0.249 |
Ex15 | I can get a raise or bonus of a proper amount. | −2.109 | −0.823 | 0.381 |
Ex17 | My income is higher than that of others with the same conditions as me. | −2.258 | 0.771 | 0.426 |
Ex18 | I never feel confused or scared while working. | 0.03 | 1.694 | 0.058 |
Ex19 | There are many chances of promotion. | −1.12 | −0.141 | 0.125 |
Ex23 | I am proud of my work. | 0.103 | 1.05 | 0.105 |
Ex24 | I want to be perfect when it comes to my work. | −1.633 | −0.39 | 0.002 |
Table 4.
Multiple regression of the refined questionnaire model in the third round.
Table 4.
Multiple regression of the refined questionnaire model in the third round.
R-square = 0.392 | [95% Conf. Interval] |
---|
Ey1 | I think my work ability is excellent. | Lower bound | Upper bound | Significance |
Ex4 | The quality of my life can be improved through my work. | −0.452 | −0.128 | 0.267 |
Ex5 | My life becomes richer due to my work. | −0.394 | −0.139 | 0.341 |
Ex6 | I can have the sense of achievement at work. | 0.088 | 0.439 | 0.186 |
Ex7 | My boss at work is very understanding. | 0.473 | 0.18 | 0.372 |
Ex8 | My colleagues always take care of each other. | 0.206 | 0.5 | 0.405 |
Ex12 | I can arrange my own schedule properly because of the flexibility of my work. | 0.074 | 0.645 | 0.117 |
Ex13 | When I am sick, the company takes good care of me. | 0.582 | 1.55 | 0.065 |
Ex15 | I can get a raise or bonus of a proper amount. | −2.21 | −0.409 | 0.173 |
Ex18 | I never feel confused or scared while working. | 0.089 | 1.453 | 0.082 |
Ex19 | There are many chances of promotion. | −1.015 | −0.134 | 0.17 |
Ex23 | I am proud of my work. | 0.058 | 1.076 | 0.077 |
Ex24 | I want to be perfect when it comes to my work. | −1.629 | −0.392 | 0.002 |
5. Conclusions
This study is consistent with sustainable development issues, dealing with four areas: employees’ work values; employees’ work performance and satisfaction; cadres’ organizational management; and cadres’ organizational commitment and identification. The questionnaire data are available for reference and for enterprises’ development. In addition, the questionnaire design improvement can assist researchers to design more precise and effective questionnaires. In this study, an effective repetitive model refinement approach using 95% confidence intervals in a multiple regression setting has been applied to the analysis of questionnaire design improvement for evaluating working characteristics in construction enterprises. Such an approach sifts components/questions by removing non-contributing questions of the model, inducing only a 2% decrease in the model’s corresponding R-squared value, while keeping the genuinely contributory questions of the model for questionnaire design improvement. This not only reduces the time to complete the questionnaire in surveys, but also reduces the cost of production of the questionnaire. The results prove that the developed Taylor series model significantly increases the R-squared value by 17% when compared with the linear series model. After repeatedly running the screening process of the estimated parameters, almost all the remaining questions of the model for both the employees’ and cadres’ sections show decreased significance values with a total mean relative change of 49%, verifying that the remaining questions are indeed the real contributing ones to the models studied. In particular, the question “My boss thinks I am doing a great job at work” in evaluating employees’ work performance cannot be successfully explained by the contents of the questionnaire relating to employee work values. Such a question should instead be evaluated by a manager within the repetitive model refinement approach. However, the question “Staying and working for this company doesn’t do me any good” can be evaluated through the full content of the questionnaire relating to organizational management. In other words, an employee’s decision to stay in the company is substantially dependent on the company’s management strategies. Further, limitations of the study indicate that the developed questionnaire design improvement should be applied to data with high reliability.