Abstract
Several newly approved protein-based therapeutics in the past decade are manufactured in modern production plants with automated systems for process control and comprehensive data archival. The hundreds of process parameters and key output variables for several production batches in the vast historical databases provide a valuable resource to improve process understanding and robustness. Multivariate data analysis is a critical process analytical technology tool to unearth any hidden patterns within process trends and identify key parameters for enhancing process performance and product quality. Cell culture process data from more than hundred “trains” comprising production as well as inoculum bioreactors was investigated in this study. Each batch encompasses over 130 on-line and off-line temporal parameters. A maximum margin support vector algorithm was coupled with a kernel-based machine learning approach to develop multivariate predictive models for critical cell culture performance parameters. A differential weighting scheme was incorporated in the model to prioritize the process parameters with strong associations with process outcome and to identify key performance indicators at every stage of the production train. Model evaluations indicate that cell culture performance can be accurately predicted several days before harvest and downstream purification. Further, multiple parameters in the inoculum and early stages of production bioreactors were identified as precocious markers of the final process outcome. This process-data driven approach for knowledge discovery in manufacturing processes represents an important step towards implementing a real-time decision making scheme based on critical product and process traits.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aggarwal S. 2008. What’s fueling the biotech engine-2007. Nat Biotechnol 26(11): 1227–1233.
Chang CC, Lin CJ. 2001. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.
Charaniya S, Hu WS, Karypis G. 2008. Mining bioprocess data: opportunities and challenges. Trends Biotechnol 26(12): 690–699.
Glassey J, Montague GA, Ward AC, Kara BV. 1994. Artificial neural network based experimental design procedures for enhancing fermentation development. Biotechnol Bioeng 44(4): 397–405.
Kirdar AO, Conner JS, Baclaski J, Rathore AS. 2007. Application of multivariate analysis toward biotech processes: case study of a cell-culture unit operation. Biotechnol Prog 23(1): 61–7.
Noble WS. 2004. Support vector machine applications in computational biology. In: Scholkofp B, Tsuda K, Vert J. (eds.), Kernel methods in computational biology. MIT Press, Cambridge, MA, pp. 71–92.
Stephanopoulos G, Locher G, Duff MJ, Kamimura R, Stephanopoulos G. 1997. Fermentation database mining by pattern recognition. Biotechnol Bioeng 53(5): 443–452.
Vapnik VN. 1998. Statistical learning theory. New York, NY: Wiley-Interscience. 736p.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media B.V.
About this paper
Cite this paper
Charaniya, S., Le, H., Mills, K., Johnson, K., Karypis, G., Hu, WS. (2012). Towards Enhancing Manufacturing Process Performance Through Multivariate Data Mining. In: Jenkins, N., Barron, N., Alves, P. (eds) Proceedings of the 21st Annual Meeting of the European Society for Animal Cell Technology (ESACT), Dublin, Ireland, June 7-10, 2009. ESACT Proceedings, vol 5. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-0884-6_43
Download citation
DOI: https://doi.org/10.1007/978-94-007-0884-6_43
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-0883-9
Online ISBN: 978-94-007-0884-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)