A machine learning approach for flow shop scheduling problems with alternative resources, sequence-dependent setup times, and blocking

Benda, Frank; Braune, Roland; Doerner, Karl F.; Hartl, Richard F.

doi:10.1007/s00291-019-00567-8

A machine learning approach for flow shop scheduling problems with alternative resources, sequence-dependent setup times, and blocking

Regular Article
Open access
Published: 18 November 2019

Volume 41, pages 871–893, (2019)
Cite this article

Download PDF

You have full access to this open access article

OR Spectrum Aims and scope Submit manuscript

A machine learning approach for flow shop scheduling problems with alternative resources, sequence-dependent setup times, and blocking

Download PDF

Frank Benda ORCID: orcid.org/0000-0001-5553-1683^1,2,
Roland Braune³,
Karl F. Doerner^3,4 &
…
Richard F. Hartl³

6634 Accesses
15 Citations
Explore all metrics

Abstract

In proposing a machine learning approach for a flow shop scheduling problem with alternative resources, sequence-dependent setup times, and blocking, this paper seeks to generate a tree-based priority rule in terms of a well-performing decision tree (DT) for dispatching jobs. Furthermore, generating a generic DT and RF that yields competitive results for instance scenarios that structurally differ from the training instances was another goal of our research. The proposed DT relies on high quality solutions, obtained using a constraint programming (CP) formulation. Novel aspects include a unified representation of job sequencing and machine assignment decisions, as well as the generation of random forests (RF) to counteract overfitting behaviour. To show the performance of the proposed approaches, different instance scenarios for two objectives (makespan and total tardiness minimisation) were implemented, based on randomised problem data. The background of this approach is a real-world physical system of an industrial partner that represents a typical shop floor for many production processes, such as furniture and window construction. The results of a comparison of the DT and RF approach with two priority dispatching rules, the original CP solutions and tight lower bounds retrieved from a strengthened mixed-integer programming (MIP) formulation show that the proposed machine learning approach performs well in most instance sets for the makespan objective and in all sets for the total tardiness objective.

Machine learning and optimization for production rescheduling in Industry 4.0

Article Open access 09 September 2020

An efficient model-based branch-and-price algorithm for unrelated-parallel machine batching and scheduling problems

Article 13 April 2022

A random forest-based job shop rescheduling decision model with machine failures

Article 14 November 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the emergence of topics related with smart manufacturing and data analytics, established machine learning approaches may gain new life to achieve deeper insights into production processes (Wuest et al. 2016). Machine learning approaches can be used to capture complex processing environments in a way such that scheduling policies, in particular dispatching rules, can be derived.

In this context, we propose a machine learning approach that is used to generate a tree-based priority dispatching rule for material movement decisions. The subject of this paper is a problem setting based on a real-world physical system of our industrial partner—a business consultancy located in Vienna. In essence, it can be classified as a hybrid flow shop scheduling problem with alternative resources, sequence-dependent setup times, limited intermediate buffers, and blocking. A transport resource with limited capacity is also part of the model configuration.

We intend to show that our machine learning approach performs well in such scheduling problems, with makespan and total tardiness minimisation as objectives, using solution information obtained from optimisation runs of a constraint programming (CP) solver that can provide high quality, feasible solutions. These solutions are then read-in with the help of a deterministic shop floor simulator and transformed into training examples. Finally, the training examples are used to build a decision tree (DT). The training examples consist of pairwise comparisons of all possible material movements, attributes, and the corresponding classification. To calculate the classification error, we build the tree on a training set and test it on both the training and a test set. We use a cross-validation procedure to estimate the off-training-set error rate of the tree. The DT acts as a classifier, indicating whether a possible job movement should be conducted or not.

Prior literature offers different approaches to exploit the training examples for building a DT. Shahzad and Mebarki (2012) generate the training examples using an optimisation module that solves instances based on tabu search. Olafsson and Li (2010) transform dispatching lists, created by a simulated scheduler in combination with a weighted earliest due date rule (EDD), into a training set.

To the best of our knowledge, the combination of our flow shop scheduling problem characteristics has not yet been discussed in the literature. Flow shops with limited intermediate buffer in general are considered by Brucker et al. (2003) and Leisten (1990). Hall and Sriskandarajah (1996) describe blocking as a lack of storage, noting that the flow shop problem with a finite buffer is a common scheduling problem. Mascis and Pacciarelli (2002) introduce blocking caused by a processed job waiting to be moved to the next machine. In our case, we consider limited buffer both before and after the processing slot of a machine. The limited storage space in front of (and behind) the machines prohibits accumulating arbitrary amounts of material between processing stages and can thus lead to blocking behaviour if all the available buffer on a stage has been depleted. Ruiz et al. (2005) describe sequence-dependent setup times caused by switching between two different job types on a processing slot. In comparison with these approaches, we deal with greater complexity regarding the shop floor and order configuration.

For machine learning in similar environments, Doh et al. (2014) suggest a decision tree-based approach to select a priority rule combination, depending on the current status quo on the shop floor. For our configuration, such an approach is not suitable because the goal is to develop a priority rule instead of choosing one.

The contribution of this paper is threefold. First, we extract information from solutions provided by a CP solver, based on an appropriate formulation of the problem, to generate the training data. Second, the generated dispatching rule directly combines the job sequencing with the machine assignment in one step. The decision involves not only the next job to be moved but also onto which machine the job should be transported to. This approach allows for more complex decision making. Third, we embed our DT in a random forest (RF) approach to counteract overfitting.

The content of the paper is organised as follows: Section 2 contains the problem statement in detail. In Sect. 3, we describe how DTs and RFs are actually built and applied to the problem at hand. The configuration and generation of test instances finally used in the computational experiments are presented in Sect. 4. The computational results for the DT and RF approach are summarised and discussed in Sect. 5. Finally, the discussion in Sect. 6 briefly reviews the contributions and offers suggestions for future research.

2 Problem statement

We implemented an abstract model based on the basic shop floor configuration of the industrial partner, which is depicted exemplarily in Fig. 1 for an exemplary configuration of the shop floor. Three different type of jobs have to be processed on every processing stage starting at the raw material stage. The processing stages consist of one or more parallel identical or unrelated machines (depending on the scenario). After having passed the last processing stage, all the jobs are collected in a box. As soon as all the jobs of an order are finished, this order is marked as executed. The order constraint is important for the total tardiness objective function.

The machines are equipped with buffer slots, a processing slot, and a transport slot. For every processed job that is situated on a transport slot of a machine, a crane movement has to be initiated. The crane as a limited transport resource is able to move one job at a time from a transport slot or raw material stage to a buffer slot of a subsequent stage. A job transport between two machines can only be conducted by picking up the job at a transport slot and moving it to the first buffer slot of the subsequent machine. Overtaking other jobs on slots of that machine is not allowed. Blocking might occur if all buffer slots of each machine within a stage are fully occupied, which also would create blocking on machines in preceding stages.

Finally, job processing is subject to sequence-dependent setup times. Depending on the type of the preceding job, additional work might be necessary to set up the machine for the next job. It is assumed here that the setup activity can already start before the next job has arrived at the machine.

Table 1 List of attributes

A machine learning approach for flow shop scheduling problems with alternative resources, sequence-dependent setup times, and blocking

Abstract

Similar content being viewed by others

Machine learning and optimization for production rescheduling in Industry 4.0

An efficient model-based branch-and-price algorithm for unrelated-parallel machine batching and scheduling problems

A random forest-based job shop rescheduling decision model with machine failures

1 Introduction

2 Problem statement

3 Solution approach

3.1 Attribute selection

3.2 Building the decision tree using the C4.5 algorithm

3.3 Training the DT from CP solution information using a shop floor simulator

3.4 Applying the trained DT: the yes–no voting procedure

3.5 Counteracting overfitting using a RF approach

3.6 Example selection based on critical paths

4 Experiments

4.1 Configuration of the instance scenarios in the makespan experiments

4.2 Configuration of the instance scenarios in the total tardiness experiments

5 Computational results

5.1 Results for the makespan objective with unrelated parallel machines

5.2 Results for the total tardiness objective with identical parallel machines

5.3 Results for applying the total tardiness objective function to the makespan instance scenarios with identical and unrelated parallel machines

5.4 Applying a generic DT and RF on different instance scenarios

5.5 Evaluation of the classification accuracy performance

6 Discussion and conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (pdf 244 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation