Abstract

Industry is an important pillar of the national economy, and industrial projects are the most complex and difficult to manage and control in the construction industry; thus, the resource scheduling control of industrial projects is one of the core issues for industrial construction projects. The performance rate of the contract time periods of previous industrial construction projects has been very low. In scheduling control based on case-based reasoning (CBR), the goal is to implement preventive measures by referring to existing scheduling control cases and control the scheduling of resources through reasoning on emergency measures to prevent scheduling control deviations. In this paper, the rough set approach is used to represent the case feature information in a case reasoning model for industrial project scheduling control, attribute reduction is used to determine the weights of the feature attributes in the rough set representation, and the similarity between cases is calculated for case retrieval. The accuracy of the rough-set-based similarity calculation is verified through matrix similarity calculations and a visual analysis of the all closeness centrality and weighted all degree centrality of the corresponding complex network; thus, similar cases of industrial project scheduling control are identified. To verify the applicability and effectiveness of the proposed methodology, a typical coal chemical general contract project case is carried out. The rough set comprehensive similarity results were 0.733, 0.621, 0.536, 0.614, 0.559, 0.950, 0.708, 0.546, 0.733, 0.664, 0.526, and 0.743, and the matrix similarity results were 0.417, 0.583, 0.417, 0.417, 0.417, 0.833, 0.417, 0.500, 0.417, 0.500, 0.333, and 0.500. The results showed that the case retrieval accuracy of traditional matrix similarity is not as high as the rough set comprehensive similarity, so is the most similar case to the target case Y. Case retrieval results indicate that the proposed methodology can provide a good similar case selection strategy with project managers, and the final required preventive measures for the target case can be found. Based on the identified similar cases, preventive measures for scheduling control are formulated to effectively prevent scheduling deviations of industrial projects.

1. Introduction

With the development of science and technology, industrial construction projects are becoming increasingly large and complex. Industrial construction projects are large in scale and involve large investments, complex process systems, dense layouts, open-air environments, automatic control, high construction technology requirements, many design companies, many suppliers, many construction companies, many collaborating companies, many work interface relations, and long construction periods, which make it difficult to control the resource scheduling problem. The performance rate of the contract time period is typically very low, and it is difficult to deliver projects within contractual deadlines. Furthermore, for project management perspective, the construction industry is an empirical industry. Looking for similar past projects is crucial to project management; it can help project managers learn from past experiences and avoid past mistakes. The urgent problem of industrial project is scheduling control for success. Therefore, a new method of industrial project scheduling control based on reasoning concerning emergency measures is needed to ensure the contract time period performance of industrial projects.

The project scheduling problem was first proposed in the 1960s, and the research of Conway et al. [1] on the scheduling problem is usually regarded as the formal beginning of scheduling theory research. Subsequently, Pritsker et al. [2] proposed a resource-constrained project scheduling problem in 1969. With further research, project scheduling theory has been expanded and enriched. Since the 1980s, with the development of computer technology and the rise of artificial intelligence algorithms, the focus of scheduling research has shifted from theory to applications, and scholars have begun to devote themselves to solving complex project scheduling problems. At present, great progress has been made in scheduling theory in terms of theoretical modeling, algorithm design, and experimental verification of the project scheduling problem [1, 2].

Pritsker et al. [2] first proposed the basic resource-constrained project scheduling problem (RCPSP) model. RCPSPs represent an important class of combinatorial optimization problems in operations research. RCPSP research includes research on robust RCPSPs, random RCPSPs, fuzzy RCPSPs, the critical chain method, and interfering project scheduling theory.

Aritigues et al. [3] took minimizing the absolute regret value under all scenarios as the goal when solving a robust RCPSP and adopted heuristic algorithms based on integer programming and scenario relaxation to solve for the earliest execution strategy for each activity. Wu et al. [4] proposed four cloud-theory-based simulated annealing (CSA) hyperheuristic algorithms incorporating seven low-level heuristics to solve a robust two-stage assembly flowshop problem with scenario-dependent processing times. Wu et al. [5] also proposed five heuristics, adopting combined two-scenario-based processing times to produce initial solutions and then improve each solution through pairwise interchange.

Case-based reasoning (CBR) is an important branch of artificial intelligence in which existing cases are used to solve new problems. Based on the existing cases, the degrees of similarity between different engineering cases are based on selected case feature attributes and their weight coefficients in order to identify similar cases that can be used to solve new problems. Rough set theory is a data analysis theory proposed by Polish mathematician Pawlak et al. [6] in 1982, which is considered a new mathematical tool for dealing with fuzzy and uncertain problems. A large amount of literature shows that the application of rough set in case-based reasoning is very correct. Previous researches on scheduling control mainly focused on resource optimization, which neither considered the reference significance of similar case measures to resource optimization nor applied case-based reasoning to scheduling control.

The existing literature review had made a lot of contributions for case retrieval of industrial projects, but there is no systematic and scientific method.

Since the beginning of the 21st century, case-based reasoning has undergone rapid development and has been applied in the contexts of construction safety accidents, risk management, emergency decision-making, network public opinion analysis, and so on. Yan and Li [7] applied ontology-supported case-based reasoning to emergency treatment for construction safety accidents. Tan et al. [8] proposed an emergency decision-making method based on case-based reasoning in a heterogeneous information environment. In 2020, Xie et al. [9] proposed a case-based reasoning model for emergency network public opinion. Case-based reasoning models have been successfully applied for the emergency treatment of project management and project control. Zhai et al. [10] have taken advantage of advanced artificial intelligence techniques, in particular, the case-based reasoning approach, to estimate the reference evapotranspiration and therefore to calculate the amount of irrigation water in grape farming. For improving the current case-based reasoning approach, especially the solution revision part, this paper proposes a learning-based adaptation strategy by fully making use of the hidden information in the case base. Cao et al. [11] proposed a novel reasoning strategy based on an extensible P-Graph. In the proposed methodology, P-Graph is regarded as the basic method of superstructure optimization. In the absence of available solutions, a reasoning strategy is integrated into extensible P-Graph to obtain new feedstock scheduling solutions. Park et al. [12] proposed a case-based reasoning- (CBR-) based model for estimating the time when the first repair will be needed after the completion of construction, even in phases where maintenance-related information is scarce. CBR and fuzzy-analytic hierarchy process (AHP) were employed as research methodologies. The method will help in the preliminary estimation of the repair time of building components.

Industrial projects are the most complex and difficult to manage and control in the construction industry; accordingly, there are still deficiencies in theoretical applications with the goal of effectively using past construction experience and selecting appropriate construction cases to guide the scheduling control of proposed industrial projects. The research question is how to effectively carry out scheduling control through similar cases.

Therefore, the application of a case-based reasoning model for emergency treatment and deviation control in project scheduling can compensate for the deficiencies of project scheduling control theory. The similarity calculation and visual analysis of the all closeness centrality and weighted all degree centrality are the way to solve this problem.

2. Reasoning Model for Emergency Measures in Industrial Project Scheduling Control

The proposed reasoning model for emergency measures in industrial project scheduling control uses rough set theory to approach the selection of feature attributes for similarity measurement. The weights of feature attributes can be determined by using only the existing information, without assuming any preparatory information or additional data. The reasoning model combines rough set theory and case-based reasoning to solve the problems of feature information representation and case similarity calculation. The proposed reasoning model for emergency measures in industrial project scheduling control is divided into four steps: feature information representation, case retrieval, case recommendation, and case base maintenance. The proposed model is shown in Figure 1.

2.1. Feature Information Representation

With the incorporation of the basic concept of rough set theory, the feature information of a sample case can be expressed as an information system . Here, , where is the i-th case in the historical case set for industrial project scheduling control; U is a nonempty finite set. , where C is a limited set that represents the feature attribute information relevant to scheduling control, such as technical factors, procurement factors, construction factors, and management factors, whereas D is the decision attribute set, which indicates the preventive measures taken, such as resource allocation and scheme optimization; . V is the set of attribute values. f is an information function of the form . An industrial project scheduling control case has m feature attributes, expressed as , where is the soft value of the m-th feature attribute of . The set of sample cases can be expressed as , where n represents the number of sample cases. The target case is represented by Y, where .

2.2. Rough Set for Case Retrieval

For case retrieval, the target case is compared with the sample cases by calculating the similarities between cases to identify which sample cases are most similar to the target case, based on which the corresponding preventive measures can be output.

The comprehensive similarity between target case Y and sample case is denoted by . The calculation formula is as follows:where is the similarity between and , which are the j-th feature attributes of target case Y and sample case , respectively, and is the weight of attribute j. The weights satisfy . By comparing the comprehensive similarity scores, the most similar cases are selected for reference.

2.2.1. Determination of Attribute Weights

Through the concept of the rough degree in rough set theory as well as the dependence and importance of the attributes, redundant attributes are simplified, and useful feature information is extracted. Thus, rough set theory is used to assign weights to the case feature attributes.

If is a subset of case attribute A, then the indistinguishability relation determined by is

For X ∈ U, the subset is called the lower approximation set of X, also known as the positive domain of X, denoted by .

Therefore, the dependence of decision attribute D on the conditional attribute set (indicator) is as follows:

In accordance with the above, the importance degree of attribute in the case group can be calculated via the following formula:

The obtained attribute importance degree is normalized to obtain the weight of attribute :

2.2.2. Determination of Attribute Similarity

A record of industrial project scheduling control event information will contain various types of feature information; thus, it is necessary to determine a method of calculating the similarity in accordance with the way in which these feature attributes are expressed. In this model, the main feature expression types are numerical and symbolic.(1)Numerical features are represented by precise numerical values, such as resource allocation deviation and progress deviation. The Manhattan distance formula is used for the similarity calculation:(2)The properties of a symbolic feature are represented by exact text. For the similarity calculation, when the attribute value of the target case is the same as that of the sample case for the considered attribute, the similarity is 1; otherwise, it is 0. The formula is as follows:

2.3. Matrix Similarity Calculation and Complex Network Visualization Analysis for Case Retrieval

For the comprehensive similarity between the target case Y and sample case , the calculation formula is defined as follows:

The similarity between the target case Y and the sample case is represented in the form of a matrix with elements , where represents the similarity between target case Y and sample case in terms of their corresponding values and of the j-th feature attribute. The value of can be 1 or 0, where 1 indicates that target case Y and sample case are similar in terms of the corresponding feature information and 0 indicates no similarity.

A complex network is also used to represent the similarity between target case Y and each sample case , where  = 1 means that the cases are connected by an edge and  = 0 means that the cases are not connected; more generally, the value of represents the weight of the complex network arc between the two cases.

The complex network centrality result is a vector that includes the centrality indices of each node and the entire network, calculated as the and of the target case Y. These centrality indices are used to identify similar cases based on visualizations of the corresponding complex networks.

2.4. Case Recommendation

After case retrieval and similarity calculation, when the similarity between a retrieved sample case and the target case is high, the preventive measures corresponding to the sample case are confirmed by experts as appropriate preventive measures for the target case; when the differences between the sample cases and the target case are very large, various aspects of the preventive measures corresponding to many sample cases may need to be combined to form a suitable scheme through repeated revision by experts. After assessment, the preventive measures for the target case are determined.

2.5. Case Base Maintenance

The process of case base maintenance includes adding and deleting cases and adjusting the structure of the case base.

Regarding the addition and deletion of cases, experts will evaluate whether a case should be reused after the final solution measures for the problem case have been output. Representative cases of each case type in the case library will be selected for continued storage, and redundant and out-of-date cases will be selectively deleted.

Regarding case base structure adjustment, in accordance with the features of the industrial project scheduling control event domain and newly updated domain information in combination with the particular situation that is the current target of reasoning, the structure of the case base is adjusted regularly to ensure its quality and retrieval efficiency.

3. Case Study

The chemical, petrochemical, thermoelectric, metallurgy, and other industrial projects all adopt the same design, procurement, and construction standards, while chemical projects are the most complex of the whole industrial projects, so a coal chemical general contract project is typical.

A coal chemical industry EPC project is taken here as an example because of the many scheduling control problems that arose during the project execution process, as follows:(1)In the detailed design stage, the process plant/unit frame structure had to be changed from the original concrete structure design scheme to a steel structure design scheme, which caused delays in the frame construction schedule.(2)In the intermediate stage of procurement, the key long-term reactor transportation process was affected by transportation difficulties delaying the arrival of equipment, leading to delays in the reactor equipment installation period.(3)In the later stage of construction, the allocated construction manpower was insufficient, which caused delays in the construction process.

To avoid the occurrence of dispatching control accidents, preventive measures in scheduling control are necessary in advance.

3.1. Feature Information Representation Based on the Rough Set Reasoning Model

The selection of feature attributes is based on the item attributes that affect scheduling control, such as contract period, contract amount, process technology, changes to contract scope, changes to major design scheme, delay of key equipment, change in construction scheme, allocation of direct construction labor, schedule impact of safety accidents, scope of total time deviation, and node type affected by schedule node deviation. These characteristics and attributes cover all aspects of project execution, including contract, technology, schedule, cost, quality, safety, design, and construction. The value assignment is based on the experience of previous industrial projects.

The feature information is represented by attributes , with the value assignments shown in Table 1.

To facilitate calculation, 12 previous cases of industrial project scheduling control associated with preventive measures are selected from the case database, expressed as . The feature attribute values for each sample case are expressed as , and the target case is similarly represented by . The case features are shown in Table 2.

3.2. Case Retrieval Based on the Rough Set Reasoning Model
3.2.1. Attribute Weight Calculation

According to the rough set calculation method, the weights are calculated as follows:

After the removal of attribute ,

Thus, the importance degree of ’ is

After the removal of attribute ,

Thus, the importance degree of ’ is

After the removal of attribute ,

Thus, the importance degree of ’ is

Similarly, it is calculated that

Then, the weight of each attribute can be calculated as follows:

Thus, the following values are obtained:

3.2.2. Attribute Similarity Calculation

Following the similarity calculation method presented in the previous section, the similarity of the feature attributes between the target case and each sample case is calculated, where , , , , , and are symbolic attributes and , , , , and are numerical attributes. The calculation results are shown in Table 3.

For example, the similarity between sample case and target case Y in terms of the contract period, , and the similarity between sample case and target case Y in terms of are calculated as follows:

3.2.3. Case Similarity Calculation

Using formula (1), the comprehensive similarity between the target case and each sample case is calculated. The calculation results are shown in Table 4.

In accordance with the comprehensive similarity and the basic information of the cases, the most similar case is finally determined to be .

3.3. Case Retrieval Based on the Matrix Similarity Calculation and Complex Network Visualization Analysis

In the matrix similarity calculation, the comparative analysis matrix is used to record the information on various feature comparisons in a simple form. This matrix can then be used to find the correlation between the target case and a sample case once the information on each feature for the target case Y and each sample case has been determined and compared in a standard form.

For example, the similarity between target case Y and sample case can be determined based on the feature information for each node type (C1C12: contract period, contract amount, process technology, changes to contract scope, changes to major design scheme, delay of key equipment, change in construction scheme, allocation of direct construction labor, schedule impact of safety accidents, schedule impact of quality accidents, scope of total time deviation, and node type affected by schedule node deviation).

The matrix similarity calculation allows similar cases to be grouped together, important parameter information (contract period, contract amount, process technology, etc.) to be extracted, and the case correlations to be described based on a complex network correlation diagram. The selected parameter information or feature information is not necessarily the general comparison standard but rather represents the relatively important parameter information or feature information for the target case as extracted and summarized from the general comparison standard. Then, the similarity matrix is constructed according to the parameter information, and the similarity matrix between cases is obtained. Finally, the correlation diagram between the target case and the sample cases is drawn based on the similarity matrix between cases and used to obtain the correlation degrees between cases (see Tables 5 and 6 for details).

According to the similarity matrix between cases, the similarity analysis diagram is drawn and analyzed by means of complex network theory. The and are calculated; see Figures 2 and 3, respectively, for details.

In Figures 2 and 3, the green node represents the target case, and the grey nodes represent sample cases.

In Figure 2, it is clearly seen that the centrality between the target case Y and sample case is 1. The weight of the edge between target case Y and sample case is 10, higher than that of the edge between the target case Y and any other sample case , indicating that is closest to the target case.

Similarly, in Figure 3, it is clearly seen that the values of the target case Y and the sample case are 71 and 69, respectively. Sample case has a node weight of 69, larger than that of any of the other sample cases . Therefore, the most similar case to the target case Y can be identified as case .

Such a case similarity evaluation can assist in determining the correlation degree between target case Y and sample case . The similarity between target case Y and sample case is expressed as a percentage. When there is absolutely no similarity between the cases, this situation is expressed by a value of 0%; when the feature information is identical, indicating a perfect correlation between the cases, the similarity is 100%. In general, the similarity between the two cases will be between 0% and 100%. A value between 1% and 40% is considered to mean that the cases show no similarity, values of 41–60% represent cases with low similarity, values of 61–80% represent cases with high similarity, and values of 81% to 100% represent cases with very high similarity.

For example, in the above example, the similarity percentages based on the twelve feature information indicators between the target case Y and the sample cases are as shown in Table 7 (reported to two decimal places).

The similarity percentages of sample cases , , , , , , , , , , and are between 41% and 60%, which means that these are cases with low similarity to the target case. The similarity percentage of is 83.33%, between 81% and 100%, which means that it shows a very high similarity to the target case. Therefore, we choose as the most similar case to the target case Y.

3.4. The Applicability, Effectiveness, and Reliability for Case Retrieval Based on the Rough Set Reasoning Model

The above rough set case retrieval successfully calculated the similar case with the target case, indicating that rough set case retrieval is applicable to the reasoning concerning emergency measures for industrial project scheduling control.

After comparing the similarity between rough set case retrieval and matrix case retrieval, it can be seen that, for 12 similar cases, matrix case retrieval calculated 5 kinds of similarity, the highest similarity is 0.833, while rough set case retrieval calculated 11 kinds of similarity, the highest similarity is 0.950. The calculation results are shown in Table 8. The results showed that the case retrieval accuracy of traditional matrix similarity is not as high as the rough set comprehensive similarity, so the effectiveness and reliability of rough set case retrieval are superior to the traditional case retrieval method.

3.5. Formulation of Preventive Measures for the Target Case

According to the results of the above case similarity calculation, the preventive schemes associated with sample case are also the proposed scheduling control schemes for target case Y. The recommended schemes are listed in Table 9.

By comparing the basic engineering situations between sample case and target case Y, we can see that the basic situation is roughly similar for both cases. Therefore, the proposed solutions for can be directly output as the accident prevention measures for target case Y after some slight modifications, as shown in Table 10.

In the preliminary planning stage of such a coal chemical industry EPC project, the project management personnel should review the design, procurement, and construction scheme; then, follow-up on the case execution record should be completed during project development in accordance with the evaluation results of the project team and department experts; and finally, the new case should be stored in the case database to prepare for matching with similar cases in the future for the recommendation of solutions. Hence, similar projects can adopt the same scheme to prevent the occurrence of scheduling control deviations.

4. Discussion

This study contributes to the literature by exploratively examining the similar projects of scheduling control. There has been limited research into the scheduling control. How to find similar cases of scheduling control through intelligent computing is a direction of scheduling control, and how to establish an effective scheduling similar case base is another direction of scheduling control.

5. Conclusions

Through case study, the preventive schemes associated with sample case are also the proposed scheduling control schemes for target case Y, and the proposed solutions for can be directly output as the accident prevention measures for target case Y for the target case scheduling control.

Deviations in project scheduling control will cause delays in project duration and increases in cost. It is necessary and significant for the planning and execution of new projects to prevent scheduling control deviations by referring to the scheduling control cases of previous projects. In this paper, rough set theory is introduced into the CBR method; the case feature information is represented in the form of an information table, and the weights of the feature attributes are effectively determined in accordance with historical data. The CBR model can be used to solve the problem of project scheduling control case similarity calculation while effectively avoiding dependence on subjective experience, improving the validity and credibility of the case retrieval results, and providing a new direction and basis for the formulation of preventive measures for early warning and feedback tracking feedback in industrial project scheduling control. The application of the rough-set-based similarity calculation and the complex network to scheduling control is demonstrated by a case study.

This paper has obtained the following conclusions:(1)Rough set theory is incorporated into CBR. The feature attribute weights are determined by using attribute reduction to calculate the similarity between cases, and analyses based on matrix similarity calculation and the visualization of complex network density degree centrality and degree weights are added to review the rough set calculation in order to ensure accurate case similarity calculation and case retrieval.(2)The proposed reasoning model for emergency measures can be applied in the scheduling control of industrial projects, which is an excellent way to provide effective case support and decision data for the improvement of early warning and feedback tracking theory in project scheduling control.(3)A knowledge base for reasoning concerning emergency measures for industrial project scheduling control has a certain application value for the subsequent execution of industrial projects and future scheduling control cases. In view of the importance of early warning and feedback tracking in industrial project scheduling control, further research is warranted.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (51808424 and 51478384) and the Industrial Building Environment and Energy Conservation Innovation Team of China (2017KCT-14).

Supplementary Materials

Table S1: feature information for sample industrial project scheduling control cases. Table S2: matrix similarity for sample industrial project scheduling control cases. (Supplementary Materials)