Introduction

Machining is a high-rate severe plastic deformation (SPD) manufacturing process. The process can be described using the idealized model shown in Fig. 2a. The imposed thermomechanical loading is fairly extreme with imposed strains as large as \(\gamma \sim 10\), deformation rates up to 105s− 1, and cutting temperatures as high as 0.6𝜃 (homologous temperature) [1]. These imposed deformation conditions result in microstructure refinement in both the deformed chip and the component surface [2,3,4,5,6, 6,7,8]. The corresponding mechanical properties of both the chip and the workpiece surface are naturally sensitive to the produced structures [3, 8,9,10]. Therefore, identifying the process-structure-property (PSP) relationships that characterize machining is critical for establishing a synergistic framework where designers, materials scientists, and manufacturers can cooperate to engineer functional surfaces. PSP relationships are often expressed via mechanism maps as shown in Fig. 1. Furthermore, the SPD structures produced in machining bear a resemblance to structures produced in other SPD processes, such as equal channel angular extrusion [11], high pressure torsion [12], and dynamic processes where shear banding may occur [13,14,15]. Therefore, the merit in studying machining as a high-rate SPD process translates to other fields as well.

Fig. 1
figure 1

Machining process-structure-property map [7]

The predominant microstructure evolution mechanism in machining under ambient conditions is either continuous or discontinuous dynamic recrystallization (CDRX or DDRX) [3, 9]. CDRX is driven by the formation of dislocation cells that transform to low angle boundary (LAB) sub-grain structures, and finally relative sub-grain rotations generate high angle boundary (HAB)-refined grains [9, 16]. DDRX is more closely related to classic recrystallization where new grains nucleate and grow, often near existing grain boundaries [17]. Since the mechanism driving CDRX is driven by lattice rotations, the structure evolution can be quantified by considering measures of crystallographic misorientation [8, 18,19,20,21]. Mechanical constitutive property measurements are usually limited to hardness since the produced samples are small in scale (machined chips and workpiece surface) [3, 7,8,9,10].

Materials informatics (MI) is an emerging field within the materials community which, like cheminformatics and bioinformatics, seeks to employ statistics for addressing important domain science problems [22,23,24,25]. Materials research is conducted utilizing statistical approaches for establishing data-driven models, quantifying uncertainty, and the design and planning of experiments. MI addresses the fundamental challenge in materials research, identifying PSP relationships, by building mathematically rigorous models. The models, which may be data-driven or mixed data/physics models, may then be exploited for the design of functional materials. Recent works have established reduced-order structure-property (SP) models for single-phase polycrystalline systems [26, 27]. These authors utilized generalized spherical harmonics (GSH) to quantify bulk textures and used spatial statistics to quantify the spatial structure describing various simulated microstructure realizations. Another recent work utilized a deep adversarial learning model coupled with a Gaussian process (GP) Bayesian design criteria for computational materials design [28].

In this work, we study the evolution of pure copper subject to a high-rate SPD machining process. Microstructure is quantified using orientation imaging microscopy (OIM). A microstructure statistic which quantifies the local crystal spatial misorientation is derived. This is done by utilizing a GSH basis to describe the crystallographic orientation and a unique spatial autocorrelation function, which exploits the orthogonality of the GSH basis. Constitutive mechanical properties are quantified using spherical nanoindentation tests. Finally, a multiple output Gaussian process regression (MOGPR) model is developed, which captures the full PSP relationships as well as their associated uncertainties. The model is flexible and is well suited for handling multiple kinds of data, e.g., multi-fidelity modeling.

Experimental Methods

Oxygen-free high conductivity copper (OFHC Cu) bars were obtained from a supplier (McMaster Carr). The material was subjected to SPD via a machining process. Tube turning experiments were carried out to emulate the idealized two-dimensional orthogonal cutting experiment shown in Fig. 2a. High speed steel cutting tools with nominal rake angles α = 5,15,25,45 were used for all experiments. A constant feed (or undeformed chip thickness) (to) of 300μ m was prescribed for all tests. The prescribed geometry was chosen to impose large shear strains in the primary shear zone, which the machining theory predicts to be \(\gamma \sim 1-8\) [1]. Four cutting speeds V = 0.20,0.33,0.50,1.00m ⋅s− 1 were studied, which generate strain rates \(\sim 10^{3}- 10^{4} \text {s}^{-1}\). Higher cutting speeds correspondingly yield increases in chip temperatures as there is less time available for diffusion of heat away from the chip. From the measured cutting forces, the chip temperatures were estimated to reach \(\sim 165~^{\circ }\text {C}\) for the lowest rake angle (highest strain) and fastest cutting speeds utilized [1]. Generated chips fell into a quench tank filled with water to freeze the as-machined microstructure.

Fig. 2
figure 2

a Machining process schematic. Controllable parameters include cutting speed (V ), the uncut chip thickness (to), and rake angle (α). b Chips and metallographic sample

Collected chips were mounted in epoxy as shown in Fig. 2b. Small sample-to-sample deviations in the chip orientation within the casting will affect the perceived two-dimensional morphology of observed micrographs. Furthermore, OIM results will be affected due to uncertainty in the reference sample orientation. Therefore, special care was taken to mount samples such that the observed cross section correspond as closely as possible to the idealized two-dimensional orthogonal configuration. Grinding of the metallographic samples was performed to reach the “mid chip” (1mm) thickness which is far from free boundaries and therefore minimally affected by side flow transverse to the direction of chip flow. Samples were subsequently mechanically polished with up to 1μ m diamond suspension polish. Final surface preparation was performed via vibratory polishing in a Buehler VibroMet 2. A Tescan Mira XMH field emission scanning electron microscope (FE-SEM) was utilized to image the generated microstructures. A backscatter emissions (BSE) detector was utilized for all imaging as it was found to yield images with extremely good contrast (see, Fig. 3). A EDAX Hikari EBSD detector with TSL OIM analysis was utilized for orientation imaging (Fig. 1).

Fig. 3
figure 3

BSE-SEM and EBSD images of the generated microstructures. Top images correspond to process conditions that impose less strain relative to the bottom images. BSE and EBSD images are not coincident

Nanoindentation experiments were performed on a Agilent G200 nanoindenter with an XP head and continuous stiffness monitoring (CMS). A 100μ m diamond indenter was used for all experiments. Spherical indentation stress-strain protocols were utilized to further process experimental data [29, 30]. The derived indentation stress-strain curves capture the mechanical response of the material deformed beneath the indenter. The corresponding contact radius for these experiments varied between 10 and 20 μ m. The microstructures considered vary greatly in their degree of refinement. Under some conditions, very fine structures (d < 1μ m) were generated suggesting that the obtained indentation responses are likely well homogenized. Coarser structures, however, suggest that the local material heterogeneity may introduce additional response variation. In our analysis, we will account for this by attempting to establish mean property quantities.

Microhardness measurements were performed using a Buehler series 1600 microhardness tester. A diamond tip Vickers indenter loaded to 500 g was used for all tests.

Methods

Microstructure Quantification

BSE and EBSD micrographs for two different process conditions are shown in Fig. 3. Images at larger values of the rake angle α (or smaller values of strain since γα− 1) produced correspondingly coarser microstructures; therefore, larger fields of view were required at these settings. The field of view at each setting is illustrated in Fig. 4. The total number of raster steps in each image was maintained at 300 × 300 to avoid unnecessarily long scans.

Fig. 4
figure 4

EBSD images of the various microstructures produced via machining

In Fig. 3, it is clear from both the BSE and EBSD scans that the microstructures are morphologically different. In the α = 25 BSE image, however, it is difficult to discern which features are grain boundaries; the BSE image is sensitive to defect structures besides grain boundaries. An even clearer pattern is visible in Fig. 4, particularly at low rake angles of 5 and 15; with increasing cutting speed, it appears as if the structure becomes smeared. Statistically, it can be stated that crystal orientations are more spatially correlated at higher cutting speeds than at lower cutting speeds. Furthermore, this pattern is also present with increasing rake angle. Consider an experiment where a point is chosen randomly in the micrograph for 5,1.00 m ⋅s− 1, and we note the crystal orientation at the chosen pixel and at a location 5 μ m to its right. Subsequently, the same experiment is performed on the micrograph for (5,0.20 m ⋅s− 1). On average, over many repetitions, the two pixels from 5,1.00 m ⋅s− 1 would yield more “similar” orientations than in the micrograph for 5,0.20 m ⋅s− 1. It is this feature that we wish to quantify and exploit for assessing microstructural anisotropy.

Recent advances in the MI community have established statistically rigorous methods for quantifying stochastic material systems [24]. In this work we quantify microstructure via crystallographic orientation which can be quantified using the Bunge-Euler angles \(\boldsymbol {g}=\left (\phi _{1},{\Phi },\phi _{2}\right )\), which are continuously defined over the fundamental zone (FZ) [31]. The probability of finding orientation g at spatial location x is \(f_{x}\left (\boldsymbol {g}\right )\) [26, 32]. Note that in the MI literature this quantity is referred to as the microstructure function [33].

Spatial correlations between microstructure states can be quantified through the use of spatial statistics [24, 34]. The simplest of the n-point spatial statistics is two-point statistics. These quantities capture spatial correlations by considering the vector distance between two points. The example posed earlier in this section used two-point statistics to qualitatively describe the “spread” of crystals. Formally, the two-point statistics can be described by a conditional probability as follows:

$$ \begin{array}{ccc} p\left( \boldsymbol{g},\boldsymbol{g}^{\prime}|\boldsymbol{t}\right) = \frac{1}{|\mathcal{X}|} {\int}_{\mathcal{X}} f_{\boldsymbol{x}}\left( \boldsymbol{g}\right)f_{\boldsymbol{x}+\boldsymbol{t}}\left( \boldsymbol{g}^{\prime}\right) d\mathcal{X}, \end{array} $$
(1)

where t is the vector that separates two points in the microstructure, g is the microstructure state at the tail of the vector, and \(\boldsymbol {g}^{\prime }\) is the microstructure state at the head of the vector. Note that if \(\boldsymbol {g}^{\prime }=\boldsymbol {g}\), then this quantity describes autocorrelation. Also, note that this quantity is solely a function of the difference in spatial location between two points (t); therefore, this definition assumes stationarity of the microstructure.

Consider now that we wish to obtain a compact representation of fx(g). There have been several works that have adopted the use of GSH for describing this quantity in polycrystalline systems [26, 27, 32]. Using a GSH basis fx(g) can be rewritten as follows:

$$ \begin{array}{ccc} f_{\boldsymbol{x}}\left( \boldsymbol{g}\right) = \sum\limits_{\mu,n,l} F_{l\boldsymbol{x}}^{\mu n} \dot{\dot{T}}_{l}^{\mu n}\left( \boldsymbol{g}\right), \end{array} $$
(2)

where μ, n, l represent multiple indices for multiple sums, and \(F_{l\boldsymbol {x}}^{\mu n}\) is the complex-valued GSH coefficient at x which corresponds to the complex-valued GSH basis \(\dot {\dot {T}}_{l}^{\mu n}\). Note that the \(\dot {\dot {T}}_{l}^{\mu n}\) preserve crystal symmetries and are orthogonal to their complex conjugate \(\dot {\dot {T}}_{l}^{\mu n *}\). The coefficients \(F_{l\boldsymbol {x}}^{\mu n}\) can be obtained in the analogous way to how Fourier coefficients are determined (i.e. by exploiting orthogonality) \(F_{l\boldsymbol {x}}^{\mu n} = (2l + 1) \int f_{\boldsymbol {x}}\left (\boldsymbol {g}\right ) \dot {\dot {T}}_{l}^{\mu n*}\left (\boldsymbol {g}\right ) d\boldsymbol {g}\). In the case where spatial bin x is occupied by a single orientation go (an individual pixel in an indexed EBSD scan), then \(F_{l\boldsymbol {x}}^{\mu n} = (2l + 1) \dot {\dot {T}}_{l}^{\mu n*}\left (\boldsymbol {g}_{o}\right )\).

Naturally, the next step is to redefine the two-point statistics using the GSH basis representation. One practical consideration is that there are an infinite number of g to chose from since it is a continuous function. In recent works, this problem is overcome by computing spatial statistics over the complex-valued GSH coefficients themselves [26, 27]. The interpretation is that the different microstructure states are described by the different GSH coefficients indexed over μ, n, l. However, in this work, we will introduce one additional definition which produces a different interpretation of the spatial statistics. Here, we define an averaged quantity for the spatial autocorrelation, which averages over all g. In doing so, information about texture is lost, but this new definition is well suited for capturing the local misorientation or local morphological spatial behavior. Therefore, we define as follows:

$$ \begin{array}{ccc} \bar{p}_{\boldsymbol{t}} = \frac{1}{V_{FZ}}{\int}_{FZ} p\left( \boldsymbol{g},\boldsymbol{g}|\boldsymbol{t}\right) d \boldsymbol{g}, \end{array} $$
(3)

where VFZ is the fundamental zone volume. Again, this quantity describes the spatial autocorrelation of crystallographic orientation averaged over all possible crystal orientations. Some information (texture) is lost but the structural morphological information is retained. The advantage of adopting this strategy is that often very large scans are needed to capture texture, which is inherently a volume-averaged quantity. Therefore, texture requires a large representative volume element (RVE) to be statistically representative of the material as a whole. Conversely, local morphological features may be representative at much smaller RVE length scales.

Combining the GSH representation of Eq. 2, definition in Eq. 3, and two-point statistics in Eq. 1, the following expressions may be derived as follows:

$$ \begin{array}{lll} \bar{p}_{\boldsymbol{t}} &=\displaystyle \frac{1}{|\mathcal{X}|}\frac{1}{V_{FZ}} \int\limits_{\mathcal{X}} \int\limits_{FZ} f_{\boldsymbol{x}}\left( \boldsymbol{g}\right)f^{*}_{\boldsymbol{x}+\boldsymbol{t}}\left( \boldsymbol{g}\right) d \boldsymbol{g} d\mathcal{X} \\ &=\displaystyle \frac{1}{|\mathcal{X}|}\frac{1}{V_{FZ}} \int\limits_{\mathcal{X}}\int\limits_{FZ} \sum\limits_{\mu,n,l} F_{l\boldsymbol{x}}^{\mu n} \dot{\dot{T}}_{l}^{\mu n}\left( \boldsymbol{g}\right) \sum\limits_{\mu,n,l} F_{l\boldsymbol{x}+\boldsymbol{t}}^{\mu n *} \dot{\dot{T}}_{l}^{\mu n *}\left( \boldsymbol{g}\right) d \boldsymbol{g} d\mathcal{X} \\ &=\displaystyle \frac{1}{|\mathcal{X}|}\frac{1}{V_{FZ}} \frac{1}{2l + 1} \int\limits_{\mathcal{X}} \sum\limits_{\mu,n,l} F_{l\boldsymbol{x}}^{\mu n} F_{l\boldsymbol{x}+\boldsymbol{t}}^{\mu n *} d\mathcal{X} \\ &=\displaystyle \frac{1}{|\boldsymbol{S}|}\frac{1}{V_{FZ}} \frac{1}{2l + 1} \sum\limits_{\boldsymbol{s}= 1}^{\boldsymbol{S}} \sum\limits_{\mu,n,l} F_{l\boldsymbol{s}}^{\mu n} F_{l\boldsymbol{s}+\boldsymbol{t}}^{\mu n *} \end{array} $$
(4)

where \(f^{*}_{\boldsymbol {x}+\boldsymbol {t}}\left (\boldsymbol {g}\right )\) is the complex conjugate. Since f is a real valued function then f = f. This trick enables significant simplification when computing the product of the two large sums since we are using an orthogonal basis; \({\int }_{FZ} \dot {\dot {T}}_{l}^{\mu n} \dot {\dot {T}}_{l^{\prime }}^{\mu ^{\prime } n^{\prime } *} d\boldsymbol {g} = \left (2l + 1\right )^{-1}\) if all the indices “match” else 0. A similar manipulation was found in [32], but in their case, it was for computing localization relationships and not spatial autocorrelations. In fact, the definition introduced in Eq. 3 was purposefully introduced to exploit the orthogonality found in the GSH basis similar to what was done in [32]. This simplification only works for the case of autocorrelation; the orthogonality cannot be exploited when considering cross-correlations. The final expression obtained is a function (mean) of the autocorrelation statistics derived in [26, 27]. However, our derivation can be justified with some novel physical interpretation (mean autocorrelation over all g).

Note that although fx(g) is described using a truncated GSH expansion each of the GSH coefficients themselves is a complex-valued continuous variable. This treatment allows for gradations of similarity between pixels. For instance, pixels misoriented by only a few degrees will yield higher autocorrelation than pixels with large misorientation. If instead the continuous-valued microstructure state (orientation g) was discritized using a “binning” strategy [35], then pixels with similar orientations that happen to fall into different bins would erroneously suggest a lack of autocorrelation. Furthermore, binning of the three-dimensional orientation space would be cumbersome and inefficient [32].

The final line of Eq. 4 discretizes the spatial domain over \(\mathcal {X}\) into a two-dimensional-binned spatial domain over \(\mathcal {S}\) which corresponds to the EBSD scan pixels. The final expression is a convolution over S, which can be efficiently computed using discrete Fourier transforms (DFTs) [24]. The quantity |S| is the total number of spatial bins considered, e.g., total number of pixels in a image. Note for partial scans, scans where a portion of the image contains unreliable or “bad” measurements, recent algorithms have been established that account for this complication by modification of Eq. 4 [36].

The proposed microstructure descriptor is sensitive to the degree of GSH discretization introduced in Eq. 2. If too few terms are used in the sum, then it may be possible that fx(g) will be unable to accurately describe certain orientations present in the observed micrographs. Consequently, the morphologies associated with those inadequately resolved orientations will be neglected in the mean autocorrelation function (4). There are two recent works which address the question of GSH truncation when quantifying spatial microstructure data. Paulson et al. published a work on the homogenization of elastic and inelastic properties of polycrystalline HCP systems using a similar MI approach [26]. For HCP systems l = (0,2) yields 6 terms and l = (0,2,4) yields 15 terms in Eq. 2. In their study, they considered various crystallographic textures and found that truncation at 15 terms yielded marginally better results than at 6 terms. Yabansu, Patel, and Kalidindi found that truncation with l = (0,4), yielding a total of 10 terms, was suitable for building reduced-order elastic localization relationships in polycrystalline FCC systems [32]. Therefore, since there is evidence that both localization and homogenization relationships can be captured with minimal terms, we argue that a ten term GSH truncation should be sufficient for adequately describing the microstructures studied in this work.

Feature Selection and Bootstrapping

The previous section describes a rigorous method for quantifying the microstructure. The mean two-point statistics, \(\bar {p}_{\boldsymbol {t}}\), derived however is of the same dimensionality as t. Correspondingly, t is a vector that can be placed into the microstructure; hence, in this case, it is bounded by the size of the EBSD scans/images. Therefore, \(\bar {p}_{\boldsymbol {t}} \in \mathcal {R}^{N\times M}\) where N and M are the height and width of the images measured in pixels. All EBSD scans in this work are square; hence, the dimensionality of each statistic derived from the images is N2. Therefore, it is clear that for interpretability of the results some dimensionality reduction will be necessary. In this work, we utilize unsupervised principal component analysis (PCA), which computes a statistically optimal basis for describing the full feature space. PCA has been employed successfully in many MI works for compact representation of microstructure statistics [24, 26, 27, 37,38,39,40,41,42]. Dimensionality reduction is achieved by suitably truncating the basis expansion and using the basis weights (PC weights) to describe the data. This is analogous to Fourier representation of a one-dimensional signal where the Fourier coefficients can compactly describe the signal.

Another consideration when constructing the microstructure feature space is the need to ensure rotational invariance of the images. Consider that small deviations in how the samples are mounted in the microscope may result in angular rotation of the images, which therefore affects the microstructure statistics. Looking ahead at Fig. 8, careful inspection reveals that the statistics are slightly rotationally misoriented relative to one another. Failure to capture this experimental artifact could result in falsely discriminating two otherwise statistically identical microstructures. Rotational invariance is introduced by utilizing the methods found in [37]. Full details of this method are found in the referenced work and are not reproduced here.

Finally, a strategy is needed to obtain measurements of the dispersion of the PC weights. A naive and experimentally costly strategy would require that multiple EBSD scans be taken. From the dispersion (variance, covariance) measures, hypothesis testing could be performed or data-driven models could be built. This approach would be extremely expensive as each single scan is costly to obtain. An alternative strategy is to use the single observations and obtain dispersion estimates from bootstrapping of the images [43]. A similar strategy was utilized in [44, 45] for generating computationally efficient statistical volume elements (SVEs). Niezgoda, Yabansu, and Kalidindi utilized bootstrapping to obtain estimates of the structural variance of three-dimensional-simulated microstructures [46].

Bootstrapping seeks to establish dispersion estimates for mean quantities by a resampling of the data [43]. It is appealing because no distributional assumptions are needed (e.g., normality). Furthermore, it can be used to obtain dispersion estimates for complicated functions of the observed data. Consider that we make N observations of a normally distributed quantity X but we want the mean and mean-variance of some complicated function f(X).

In our setting, the data are the EBSD scans and the transformation is the pipeline that transforms the EBSD scans to \(\bar {p}_{\boldsymbol {t}}\) and then to the truncated PC weights. Special care is also needed to preserve the spatial correlation structure present in our data. Therefore, we used a strategy that is analogous to bootstrapping time-series data [43]: 7.5 μ m × 7.5 μ m images were sub-sampled four times and used to reconstruct a tiled 15 μ m × 15 μ m image. The 7.5 μ m × 7.5 μ m tiles were obtained by randomly selecting pixels from the image and then obtaining 3.75 μ m worth of pixels left, right, above, and below the selected point. In the case where the randomly selected pixel was within 3.75 μ m of a boundary the sub-sampled image was obtained by “wrapping” around the original image. For computing spatial statistics, this is acceptable since the convolution in Eq. 4 assumes periodic boundary conditions, which is equivalent to assuming that the image “wraps” around itself. This is shown schematically in Fig. 5. Each resampled 15 μ m × 15 μ m image corresponds to a single bootstrapped sample. For each setting, 100 bootstrapped samples were generated. The entire ensemble was then utilized to establish a PC basis and the corresponding PC weights for each bootstrapped sample were determined. The mean and variance of these bootstrapped PC weights were utilized to establish the mean and mean-dispersion at each unique process setting.

Fig. 5
figure 5

Bootstrapping schematic for estimating confidence bounds on mean feature statistics. a Original EBSD scan and corresponding random samples. b Reconstruction from random sampling and associated bootstrapped mean spatial crystallographic autocorrelation \(\bar {p}_{\boldsymbol {t}}\) sample

Multiple Output Gaussian Process Regression

A data-driven model is needed to efficiently map the controllable process parameters to the material quantities of interest. In this setting, the structure behaves as an intermediate variable that fundamentally controls the physics and is responsible for the exhibited properties. A statistical interpretation is that the structure variable is a latent variable; it is critically important but is either not possible to observe or perhaps can only be observed with great effort. This is an important consideration when identifying the relevant length scales and corresponding salient microstructural features. For instance, consider that TEM micrographs are rich with information at the lowest length scales but are costly to obtain. Conversely, optical micrographs are relatively easy to obtain but may have limited utility for certain problems, for instance, properties that are dependent on the lower length scale physics. Process-property models can sometimes capture the underlying relationships [47]; however, inclusion of structure into the modeling pipeline is preferred [24]. The justification is that structure physically governs the underlying property behavior and inclusion of such information may alleviate potential ambiguities associated with non-unique process-property mappings.

Modeling of PSP relationships traditionally follows a sequential strategy where the process-structure (PS) and structure-property (SP) relationships are established independently of one another and combined in sequence [40]. This is illustrated in the top of Fig. 6. A difficultly associated with such a framework is that it is not straightforward to quantify uncertainty propagation between PS and SP models. The PS model accepts process parameters as inputs, which are considered to be deterministic. The output microstructure estimates are naturally stochastic since the microstructure observations are stochastic. Computing confidence bounds for the output microstructure estimates is trivial in most statistical frameworks. Additional care however is needed in the subsequent SP modeling step when transferring forward stochastic structure estimates. As was just argued, the PS outputs are stochastic; hence, the SP inputs are stochastic. However, most data-driven models assume the model inputs to be deterministic.

Fig. 6
figure 6

Schematic of two modeling strategies for establishing PSP linkages. Note that italicized P refers to the process and normal font P represents properties. a A sequential strategy where process-structure and structure-property models are built independently and predictions flow sequentially, b jointly developed model using multiple output GP structure, which captures possible cross-correlations in the structure-property structure

Another limitation of the sequential PSP modeling strategy is that information is not shared across the PS and SP models. Consider that the model of interest is actually the full PSP model. This model is of course built using the two PS and SP submodels, which are usually established independently. A better PSP model could perhaps be established if the PS and SP models were built concurrently or perhaps with iteration; the best PS and SP models established independently may not produce the best PSP model.

GPR is a nonparametric curve fitting technique [48, 49]. Unlike traditional linear and nonlinear regression, nonparametric methods do not require a priori knowledge of the trends’ functional form. Instead, the data is assumed to come from a Gaussian data generating process where observations, yi and yj, may be correlated based on their proximity to each other, xixj. Future predictions, y(x), can be shown to be a weighted average of all the \(\left [y_{1},\ldots ,y_{N}\right ]\) where the weights depend on the proximity of x relative to all observations in the dataset \(\left [\boldsymbol {x}_{1},\ldots , \boldsymbol {x}_{N}\right ]\). The final form of the GPR statistical model is closely related to kernel regression and smoothing methods [48, 50].

In classical regression, the statistical inference or learning is performed by optimally estimating the unknown regression coefficients. In the GPR setting, the inference is performed by estimating the unknown statistical hyperparameters. These quantities define the correlation structure, which is embedded in the collected observations. For instance, correlation length scales are used to precisely quantify the relative measures of “proximity” mentioned above. In some problems, xixj = 1 may be an insignificant difference; yet in other instances, this may be large.

In this work, we attempt to address both these considerations by utilizing a multiple output Gaussian process regression (MOGPR) model for simultaneously identifying the full PSP model (Fig. 6). Multiple output implies that y need not be a scalar. This model choice offers several promising features not available using a sequential strategy. Firstly, structure and properties are modeled together as a function of process inputs using a multivariate normal structure to quantify structure-property correlations. In this way, process-property is possible; however, the model will also infer possible SP correlations when present. The SP cross-correlation jointly considers the full PSP linkage rather than independent submodels. Secondly, quantifying the SP variables simultaneously in a multiple output setting allows for easy uncertainty quantification of all relevant quantities including their cross-correlation structure. Finally, the MOGPR framework is flexible in its treatment of data and enables the inclusion of partial datasets with missing data. For instance consider a study where there are two microstructure descriptors. One is obtained using efficient experiments such as optical microscopy. The other descriptor is obtained using TEM and is therefore costly to acquire. The dataset may therefore contain many times more optical images than TEM images. However, in establishing the SP linkages standard regression models require both covariates for each individual property measurement. Clearly, such a framework cannot pair the two descriptors since one is much more numerous! The state of the art in this setting is to implement a transfer learning model which enables sharing of information between the two kinds of structure data [51]. The MOGPR model can automatically accommodate this setup. Additional details on the GPR framework, estimation of hyperparameters, prediction estimates, and details on the implementation used in this work are found in Appendix A.

Multi-Fidelity Property Modeling

In this work, structural descriptors come from the PC weights of the mean crystallographic autocorrelation function (\(\hat {p}_{\boldsymbol {t}}\)). Property measurements are obtained using spherical nanoindentation. The indentation stress-strain yield strength is used to quantify material strength [29]. The fraction explained variance (Fig. 7) illustrates that two PC components capture 97% of the observed variance. Therefore, in this study, M = 3 where j = 1,2 are the first two PC weights and j = 3 is the indentation yield, e.g., the MOGPR model represents the vector \(\left (PC_{1},PC_{2},Y_{\text {ind}}\right )\). Yind has some physically meaningful interpretation but is a somewhat noisy observation (see, Fig. 16). This variation is inherited from various sources including microstructure and surface characteristics. In Figs. 3 and 4, it is clear that the indenter could possibly engage different crystallographic orientations from test to test. Furthermore, there is also morphological heterogeneity across microstructures. Although the final contact radius using a 100 μ m indenter is on the order of 10–20 μ m, the contact radius is at the yield point roughly 1–2 μ m. Even using a larger 500 μ m indenter would not produce RVEs of crystallographic orientation and larger indenters (the next available indenter is 1500 μ m) are not feasible due to the load limits of the machine and the size of our samples (the smallest is 500 μ m in thickness). A brute-force strategy would require EBSD imaging of every SVE indentation site, which is experimentally costly. Finally, the response is sensitive to nanoscale asperities on the prepared surfaces, which introduces variation in the form of noise.

Fig. 7
figure 7

Rotationally invariant mean spatial crystallographic autocorrelation basis and accumulated variance explained statistics

Therefore, in this work, our strategy is to simply homogenize over these effects; therefore, we have conducted many repeated indentation experiments for each unique process setting. However, a complimentary strategy is available that allows the combination of nanoindentation data with cheaper lower fidelity property data. In the statistics community, this is referred to as multi-fidelity modeling [52,53,54]. For this work, we consider the Vickers microhardness (HV) as a cheap property measure. The justification is that the spherical indentation stress-strain protocols enable granular interpretation of both elastic and post-elastic behavior of the indented material whereas hardness does not. Nevertheless, microhardness is easy to obtain and therefore may aid in bolstering confidence in our inferences. Additionally, the hardness data is less noisy because it is less sensitive to the previously described heterogeneities since the volume of material probed is much larger; diagonals produced during indentation at a 500 g of load were on the order of 80–100 μ m. A key assumption here is that Yind and HV follow the same trends. We will introduce some flexibility, however, in case they do not follow the same trends or if they do not follow the same trends under certain process settings. The necessary statistical framework for incorporation of multi-fidelity property data may be found in Appendix B.

Results

The mean crystallographic autocorrelation for each micrograph is shown in Fig. 8. Note that these autocorrelation statistics are empirical quantities as they are computed directly from the data using Eq. 4, which is free from any parametric assumptions. It is important to acknowledge this as subsequent modeling is performed by directly comparing these statistics and therefore the same field of view (FOV) must always be used. All the statistics shown in Fig. 8 have a field of view of 15 μ m. Therefore, images obtained at α = 25,45, which have FOV of 45 and 105 μ m, were subsampled. The analysis therefore does not consider autocorrelation information available at larger correlation lengths in these images. However, this “clipping” is necessary to maintain identical scales across all the empirically computed autocorrelations.

Fig. 8
figure 8

Mean spatial crystallographic autocorrelation \(\bar {p}_{\boldsymbol {t}}\) for each process setting. Note that for direct comparison of these statistics must be over the same length scale, therefore larger image statistics cropped down to 15 μ m

Bootstrapped samples of the rotationally invariant mean crystallographic autocorrelation are shown in Fig. 9. Recall that 97% of the variance can be captured with a truncated PCA expansion using only two principal components (see, Fig. 7). Also shown in Fig. 9 is the predicted MOGPR path in PC space. The bootstrapped samples visually appear to generate scatter close to a bivariate normal distribution. Both the degree of scatter and the correlation in the scatter varies for each unique process setting. Therefore, the components of the observation error covariance matrix, Σ, which correspond to these structural variables were prescribed using frequent estimates for each unique process setting. This simplification is justified since the scope of our work is to quantify and model mean quantities. Additionally, bootstrapping is an effective method for estimating the dispersion of statistics; therefore, the hyperparameter inference in Eq. A.8 can be simplified. Furthermore, since the repetitions themselves only capture dispersion information of the data, and the observation error is specified, it is only necessary to utilize the mean value structure variables, \(\bar {\boldsymbol {PC}}_{i}\), when building the MOGPR model. This final point saves a great deal of computational burden associated with inverting C + Σ. This simplification requires only 16 two-dimensional mean values rather than the full data set.

Fig. 9
figure 9

Mean PC1 and PC2 evolution over process settings and GP model path prediction. Shown data are the 100 bootstrap samples at each process setting and each corresponding mean (⊕)

In Figs. 10 and 11, the structure-property relations are shown. Note that SP data are not paired; there is not a “corresponding” property measure for each micrograph. Visualization, however, requires pairing; therefore, the mean values and the associated confidence intervals are shown for experimental data. The mean MOGPR path and the confidence region are also shown. Note that there is a clear distinction between the confidence region of the mean and confidence region of future observations. Future observations will also contain some observation errors and would therefore have a correspondingly larger confidence region. At V = 1.00m ⋅s− 1, the trends appear to change despite the behavior being fairly consistent across cutting speeds V < 1.00m ⋅s− 1. This experimental setting corresponds to the largest imposed temperatures since \({\Delta } t \sim 1/V\); hence, there is less time available for conduction of heat away from the generated chips [55].

Fig. 10
figure 10

Mean PC1 and Yind evolution over process settings and GP model path prediction and 95% confidence region. Error bars correspond to mean variation for Yind and the bootstrapped variation for PC1

Fig. 11
figure 11

Mean PC2 and Yind evolution over process settings and GP model path prediction and 95% confidence region. Error bars correspond to mean variation for Yind and the bootstrapped variation for PC2

PS relationships are shown in Figs. 1213 and 15. It is clear that the rake angle, α, has the greatest influence on the generated structures. This agrees with intuition as α controls the geometric configuration of the experiment, and therefore has the greatest impact on the imposed shear strains γ. Deformation conversely drives structural refinement and evolution via the DRX mechanism [3].

Fig. 12
figure 12

Mean PC1 evolution versus α and the corresponding GP model prediction and 95% confidence bounds. Error bars correspond to the bootstrapped variation for PC1

Fig. 13
figure 13

Mean PC2 evolution versus α and the corresponding GP model prediction and 95% confidence bounds. Error bars correspond to the bootstrapped variation for PC2

Fig. 14
figure 14

Mean PC1 evolution versus V and the corresponding GP model prediction and 95% confidence bounds. Error bars correspond to the bootstrapped variation for PC1

Fig. 15
figure 15

Mean PC2 evolution versus V and the corresponding GP model prediction and 95% confidence bounds. Error bars correspond to the bootstrapped variation for PC2

Finally, the process-property maps are shown in Fig. 16. Note that process-property implicitly considers structural relationships via the MOGPR model. The Vickers hardness data generally follows trends similar to the indentation yield. At the highest cutting speed, V = 1.00m ⋅s− 1, there is a significant decrease in hardness/strength going from α = 15 to α = 5. This is only observed at the highest speed, which suggests that physically this anomalous behavior is driven by thermal effects.

Fig. 16
figure 16

Mean Yind and HV evolution versus α and the corresponding GP model prediction and 95% confidence bounds.

Discussion

The proposed mean crystallographic autocorrelation spatial statistic is an effective measure of microstructural morphology. The power of this metric is that it quantifies morphology without the need to explicitly define microstructural features. A common assumption when analyzing EBSD data is to define a threshold misorientation value for defining high angle boundaries. At other times, the misorientation distribution function (ODF) itself is utilized as a metric, but this necessitates identification of grain boundaries, which is again based on assumed threshold values [5]. Since our statistic only captures morphological features, it may be well suited in settings where the scan size is smaller than what is required for accurately quantifying texture. Crystallographic texture is a homogenized quantity; therefore, larger scans are typically necessary to accurately capture the representative crystallographic texture. The 15μ m × 15μ m images in Fig. 4 are certainly not sufficient for identifying texture but can still be used for quantifying morphological features.

Physical interpretation of the obtained microstructure evolution results is possible by considering the PCA bases shown in Fig. 7. Recall that \(\bar {p}_{\boldsymbol {t}}\) measures the degree of spatial crystallographic autocorrelation (similarity). The first principal basis corresponding to PC1 is highly localized with large negative values towards the center of the basis, some positive asymmetric values away from 𝜃 = 0, and slightly positive in the remainder of the region. The peaked negative region corresponds to a length of about 10 pixels which is 500nm (50nm/pixel). Note that this corresponds to the refined crystallite size observed at the largest strains. Conversely, PC2 has an even sharper, but faint, negative peak in the center, positive values in the 0.5 − 2μ m range, and negative values at large distances. Therefore, one contribution of the PC1 basis is to control a high autocorrelation region concentrated within a 500-nm region. PC2 captures competing autocorrelation trends in the 0.5 − 2μ m and > 3 μ m range. Therefore, it is reasonable that PC1 is observed to displays the greatest sensitivity to the applied rake angle (Figs. 12 and 13). As the rake angle is decreased, strains are increased, DRX drives refinement, and therefore pixels only retain autocorrelation with very close neighbor points (roughly within a crystal). However, PC1 does not appear to significantly change with cutting speed (Fig. 14). This is because cutting speed does not influence spatial similarity at these small scales. PC2, however, does appear to be sensitive to cutting speed (Fig. 15), and this sensitivity decreases with increasing rake angle (decreasing strain). This implies that at large imposed strains, as the cutting speed is increased, similarity of crystal orientation extends to include larger neighborhoods in the 0.5 − 2μ m region. This observation agrees with the process physics where it is known that cutting temperatures increase with both increasing speeds and strains. Additional straining drives heat generation via plastic dissipation and increased cutting speeds limit the efficacy of conduction to remove heat away from the process zone. At higher temperatures, DRX is less impactful [3]; thus, there is less misorientation and hence crystal similarity extends over larger spatial distances (less misorientation). Therefore, PC2 is sensitive to thermal effects, which are implicitly tied to the cutting speeds. With respect to the rake angle, PC2 has a significant quadratic interaction and this complex behavior may be explained as follows. At high rake angles (low strains), the similarity extends over large distances (> 3μ m) and PC2 is negative, which yields large positive autocorrelation values at large distances. With increasing strain (decreasing rake angle), there is less autocorrelation at large length scales but correlations in the intermediate values (0.5 − 2μ m) persist and hence PC2 increases. However, this trend reverses at the lowest rake angles (highest strains) when the autocorrelation becomes extremely localized (< 500 nm); thus, less similarity is observed in the 0.5 − 2μ m range. These interactions are complex because each basis captures several coupled physical features (e.g., PC2 captures negative long range and positive medium range autocorrelation). Furthermore, the bases must interact and balance their respective contributions in order to describe the changing physics at different machining process settings.

Figure 16 illustrates that the Vickers hardness and indentation yield produce similar trends with respect to the rake angle. For reference, the mean virgin material hardness is HV = 87.5 ± 5.0 (95% confidence interval). At large rake angles (low strains), the generated chips have higher hardness than the virgin material but produce lower range properties relative to measurements at small rake angles (larger strains). This observation is in accordance with deformation induced strain hardening. For cutting speeds V = 0.20,0.33,0.50m ⋅s− 1, the hardness appears to saturate with decreasing rake angle, which indicates that additional straining does not drive an increase in hardness. At the lowest cutting speed, however, indentation yield produces a fairly linear trend which decreased with increasing speed. Therefore, hardness and yield do not always share a one-to-one correspondence, but nevertheless, the inclusion of hardness is informative. At the highest speed and lowest rake angle (highest strain), there is a significant decrease in both hardness and strength. This is likely driven by recovery processes, which occur due to the higher cutting temperatures experienced under these conditions.

In this study, we only consider structural morphology and therefore neglect crystallographic effects. This is one potential source of the scatter observed in Fig. 16. The local crystallographic orientation of the indented site will likely influence the indentation response. However, for simplicity, we adopt a strategy where this was neglected and instead homogenized over many observations. When crystallographic information is desirable, the stand-alone GSH representation (which quantifies the ODF) may be augmented as additional features to \(\bar {p}_{\boldsymbol {t}}\). Another possibility is to use the strategy established in [26, 27] and use the paired two-point statistics between each of the GSH coefficients. Recall that the GSH representation is a sum over multiple indices (μ, n, l), and in this work, we truncate to 10 terms. Each of these 10 terms can be used as a measure of microstructural state. Therefore, these state descriptors may be used to compute two-point spatial correlations [26]. Including constraints and symmetry considerations, it may be shown that there are 2 ⋅ 10 − 1 = 19 unique correlation pairs [26]. The derived expression in Eq. 4 happens to be the mean over all autocorrelation pairs considered in [26]. The derivation in this work is fairly compact and proves that this mean quantity has a physical interpretation and is a descriptor of morphological spatial crystallographic “spread”, which includes misorientation.

Bootstrapping methodology appears to be an effective method for quantifying the dispersion of microstructure in reduced order PC space, as shown in Fig. 9. In our regression model, bootstrapping is useful as it eliminates the need to estimate the measurement error variances when training the MOGPR model—instead, they can be estimated directly from bootstrapping. Note, however, that bootstrapping of correlated data requires that the original sample be sufficiently large such that it “contains” the relevant correlation length scales. In our setting, the correlation length scales, particularly at large rake angles (low strain), are larger than the image field of view. Nevertheless, the bootstrapped variance estimates will reflect this artifact; inadequately sized images will yield more variance. Additionally, the disparity in autocorrelation at the lower spatial length scales is sufficiently significant that trends are still clear despite “missing” information at very large length scales.

The MOGPR model is effective at quantifying PSP relationships and provides estimates for coupled SP uncertainties. A natural concern, however, is that perhaps the obtained hyperparameters, \(\hat {\boldsymbol {\Phi }}\) in Eq. A.8, neglect SP relationships. In Eq. A.6, the SP linkage is captured through the cross-correlation matrix S, which must be inferred from the observed data. This matrix quantifies the covariance (or correlation) between all the outputs considered (PC1, PC2, Yind). The case where structure-property linkages are neglected the covariance matrix would take a block form as follows:

$$ \begin{array}{llll} \boldsymbol{C} = \left[\begin{array}{cccc} \boldsymbol{C}_{11} + \boldsymbol{\Sigma}_{11} & \boldsymbol{C}_{21} + \boldsymbol{\Sigma}_{21} & {\cdots} & \boldsymbol{0} \\ \boldsymbol{C}_{12} + \boldsymbol{\Sigma}_{12} & \boldsymbol{C}_{22} + \boldsymbol{\Sigma}_{22} & {\cdots} & \boldsymbol{0}\\ {\vdots} & {\vdots} & {\ddots} & {\vdots} \\ \boldsymbol{0} & {\cdots} & \boldsymbol{0} & \boldsymbol{C}_{M} + \boldsymbol{\Sigma}_{M} \end{array}\right], \end{array} $$
(5)

which suggests no correlation between the PC s and Yind. This degenerate case corresponds to two independent GP models; one for the PS and another for process-properties. Yet, another degenerate case corresponds to a diagonal covariance structure where no correlation exists between any of the considered variables; thus, the result is M independent GP models. However, consider that this model is data-driven; therefore, it is possible that perhaps a process-property relationship does exist. In fact, there is some recent evidence in the literature that suggests that these mappings are plausible in some settings [56]. The inclusion of structural information is physically motivated and is expected to yield better performance as the data is much richer if structure information is included. The merit of the MOGPR model is that all possibilities may be considered at once; if a direct process-property linkage exists then the model will identify it. Note that it may seem inappropriate to assume structure-structure cross-correlations between the PC weights as PCA theory generates PC weights which are independent, e.g., \(\text {Cov}\left (PC_{1},PC_{2}\right )= 0\). However, this is only true in the unsupervised setting; PC weights are independent when nothing is known about the process settings. The PC basis and weights are computed from the unlabeled pt ensemble of observations. In the MOGPR model, correlation between PC1 and PC2 is possible because the correlation is conditional on also knowing the process settings. Two uncorrelated random variables may become correlated when conditioned on a third random variable related to the first two. Clearly, in the second case, the two otherwise independent experiments become correlated due to the extra information.

Cross-validation results using a leave-one-unique-process-setting-out strategy are displayed in Fig. 17. The cross-validation results may be exactly computed from the fully trained model by employing a shortcut formula (see, Appendix C). Four different results are shown to illustrate a few key points: (a) cross-validation using Yind property data not including output cross-correlations (the case of M independent GP models); (b) cross-validation using Yind property data with SP cross-correlations; (c) cross-validation using Yind and HV as coupled properties with no SP cross-correlations; and finally, (d) cross-validation considering all available property data (Yind and HV ) and including output cross-correlations. Notice that strategy (d), which considers all property data and all correlations, yields the best cross-validation error (25% improvement in Yind prediction relative to model (a)). Therefore, inclusion of the hardness data did improve the overall model performance. Furthermore, each increase in model complexity provides slight improvements over the previous model. In general, however, this may not always be the case. GPR models are also prone to over-fitting when there is an imbalance between model complexity and data. For this reason, some researchers prefer to use cross-validation strategies for model training [57].

Fig. 17
figure 17

Cross-validation results removing one unique process setting at a time

Conclusions

In this work, we studied a severe plastic deformation machining process which drives microstructure evolution via continuous dynamic recrystallization. Various stages of microstructure evolution were captured by considering a wide range of rake angles, which induce a wide range of shear strains. Rate and temperature effects were considered by varying the cutting speed. Large strain conditions produced submicron crystal structures, whereas low-strain experiments yielded highly deformed structures, which still resembled the coarse parent material. At the largest strains, a dependence on the cutting speed was observed with higher cutting speeds producing structures with lower crystallographic misorientations. Generalized spherical harmonics were used to efficiently quantify the local orientation state and a novel autocorrelation spatial statistic was derived that captures orientation “spread” or misorientation. The novel descriptor is physically intuitive and targets morphological information present in the orientation imaging data. A data-driven multiple output Gaussian process regression model was established for quantifying process-structure-property linkages. The model is flexible, enables inclusion of various kinds of structure and property data, does not necessitate fully paired input data, captures the full process-structure-property pipeline, and produces coupled uncertainty estimates associated with future predictions.