Towards Enhancing Manufacturing Process Performance Through Multivariate Data Mining

Charaniya, Salim; Le, Huong; Mills, Keri; Johnson, Kevin; Karypis, George; Hu, Wei-Shou

doi:10.1007/978-94-007-0884-6_43

Salim Charaniya⁴,
Huong Le⁵,
Keri Mills⁶,
Kevin Johnson⁶,
George Karypis⁷ &
…
Wei-Shou Hu⁵

Part of the book series: ESACT Proceedings ((ESACT,volume 5))

870 Accesses

Abstract

Several newly approved protein-based therapeutics in the past decade are manufactured in modern production plants with automated systems for process control and comprehensive data archival. The hundreds of process parameters and key output variables for several production batches in the vast historical databases provide a valuable resource to improve process understanding and robustness. Multivariate data analysis is a critical process analytical technology tool to unearth any hidden patterns within process trends and identify key parameters for enhancing process performance and product quality. Cell culture process data from more than hundred “trains” comprising production as well as inoculum bioreactors was investigated in this study. Each batch encompasses over 130 on-line and off-line temporal parameters. A maximum margin support vector algorithm was coupled with a kernel-based machine learning approach to develop multivariate predictive models for critical cell culture performance parameters. A differential weighting scheme was incorporated in the model to prioritize the process parameters with strong associations with process outcome and to identify key performance indicators at every stage of the production train. Model evaluations indicate that cell culture performance can be accurately predicted several days before harvest and downstream purification. Further, multiple parameters in the inoculum and early stages of production bioreactors were identified as precocious markers of the final process outcome. This process-data driven approach for knowledge discovery in manufacturing processes represents an important step towards implementing a real-time decision making scheme based on critical product and process traits.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aggarwal S. 2008. What’s fueling the biotech engine-2007. Nat Biotechnol 26(11): 1227–1233.
Article PubMed CAS Google Scholar
Chang CC, Lin CJ. 2001. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.
Charaniya S, Hu WS, Karypis G. 2008. Mining bioprocess data: opportunities and challenges. Trends Biotechnol 26(12): 690–699.
Article PubMed CAS Google Scholar
Glassey J, Montague GA, Ward AC, Kara BV. 1994. Artificial neural network based experimental design procedures for enhancing fermentation development. Biotechnol Bioeng 44(4): 397–405.
Article PubMed CAS Google Scholar
Kirdar AO, Conner JS, Baclaski J, Rathore AS. 2007. Application of multivariate analysis toward biotech processes: case study of a cell-culture unit operation. Biotechnol Prog 23(1): 61–7.
Article PubMed CAS Google Scholar
Noble WS. 2004. Support vector machine applications in computational biology. In: Scholkofp B, Tsuda K, Vert J. (eds.), Kernel methods in computational biology. MIT Press, Cambridge, MA, pp. 71–92.
Google Scholar
Stephanopoulos G, Locher G, Duff MJ, Kamimura R, Stephanopoulos G. 1997. Fermentation database mining by pattern recognition. Biotechnol Bioeng 53(5): 443–452.
Article PubMed CAS Google Scholar
Vapnik VN. 1998. Statistical learning theory. New York, NY: Wiley-Interscience. 736p.
Google Scholar

Download references

Author information

Authors and Affiliations

Genentech, Inc., Oceanside, CA, 92056, USA
Salim Charaniya
Department of Chemical Engineering and Materials Science, University of Minnesota, Minneapolis, MN, 55455, USA
Huong Le & Wei-Shou Hu
Genentech, Inc, Vacaville, CA, 95688, USA
Keri Mills & Kevin Johnson
Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, 55455, USA
George Karypis

Authors

Salim Charaniya
View author publications
You can also search for this author in PubMed Google Scholar
Huong Le
View author publications
You can also search for this author in PubMed Google Scholar
Keri Mills
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Johnson
View author publications
You can also search for this author in PubMed Google Scholar
George Karypis
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Shou Hu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei-Shou Hu .

Editor information

Editors and Affiliations

, University College Dublin, National Institute of Bioprocessing Rese, Engineering Building, Belfield, Dublin, Ireland
Nigel Jenkins
, National Institute for Cellular Biotechn, Dublin City University, Dublin, Ireland
Niall Barron
Tecnologica (IBET), ITQB, Instituto de Biologia Experimental e, Oeiras, Portugal
Paula Alves

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Charaniya, S., Le, H., Mills, K., Johnson, K., Karypis, G., Hu, WS. (2012). Towards Enhancing Manufacturing Process Performance Through Multivariate Data Mining. In: Jenkins, N., Barron, N., Alves, P. (eds) Proceedings of the 21st Annual Meeting of the European Society for Animal Cell Technology (ESACT), Dublin, Ireland, June 7-10, 2009. ESACT Proceedings, vol 5. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-0884-6_43

Download citation

DOI: https://doi.org/10.1007/978-94-007-0884-6_43
Published: 25 July 2011
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-0883-9
Online ISBN: 978-94-007-0884-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics