Multi-level physics informed deep learning for solving partial differential equations in computational structural mechanics

He, Weiwei; Li, Jinzhao; Kong, Xuan; Deng, Lu

doi:10.1038/s44172-024-00303-3

Download PDF

Article
Open access
Published: 01 November 2024

Multi-level physics informed deep learning for solving partial differential equations in computational structural mechanics

Weiwei He¹^na1,
Jinzhao Li¹^na1,
Xuan Kong ORCID: orcid.org/0000-0001-5919-1970² &
…
Lu Deng²

Communications Engineering volume 3, Article number: 151 (2024) Cite this article

11k Accesses
10 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Physics-informed neural network has emerged as a promising approach for solving partial differential equations. However, it is still a challenge for the computation of structural mechanics problems since it involves solving higher-order partial differential equations as the governing equations are fourth-order nonlinear equations. Here we develop a multi-level physics-informed neural network framework where an aggregation model is developed by combining multiple neural networks, with each one involving only first-order or second-order partial differential equations representing different physics information such as geometrical, constitutive, and equilibrium relations of the structure. The proposed framework demonstrates a remarkable advancement over the classical neural networks in terms of the accuracy and computation time. The proposed method holds the potential to become a promising paradigm for structural mechanics computation and facilitate the intelligent computation of digital twin systems.

Physics-informed learning of governing equations from scarce data

Article Open access 21 October 2021

A physics-informed neural network based on mixed data sampling for solving modified diffusion equations

Article Open access 13 February 2023

Physics-informed machine learning

Article 24 May 2021

Introduction

Structural mechanics computation plays an important role in the field of civil and structure engineering, which is fundamental for the design and analysis of structures. In particular, with the emergence of Digital Twin (DT)^1,2,3, it requires establishing a virtual digital model and implementing real-time simulation to accurately replicate physical structures. Therefore, the accurate and efficient computation of structural behavior is critical in the DT system^4,5. Currently, Finite Element Method (FEM) is commonly used for structural computation, and several commercial FEM software options (e.g., ANSYS and ABAQUS) are available. However, there exit three main limitations associated with FEM, especially as a computational module in DT system⁶: (i) The repetitive process of establishing and computing FEM models hardly satisfies the real-time capability due to its cumbersome nature and time-consuming procedure; (ii) Integrating encapsulated FEM software into DT systems is challenging; (iii) The FEM approach cannot handle the infinite ___domain problems in which the boundary conditions or initial conditions may be unknown. In addition, FEM cannot solve the inverse problem, e.g., the inference of structure parameters from the structural response.

With the rise of artificial intelligence (AI) in recent years, especially the advancement of deep learning developed from neural network, AI technology has been broadly applied in the field of engineering structures^7,8,9,10, such as damage identification^11,12,13,14, model optimization^15,16, and performance prediction^17,18,19,20. Data-driven network models rely on large amounts of high-quality data and lack physical interpretability. Some studies currently explain the black box model by employing SHAP (SHapley Additive exPlanations) to quantify the contribution of each input feature to the output prediction^21,22,23. However, physics-explaining machine learning still finds it difficult to directly learn physics prior knowledge. To address this issue, the physics-informed neural network (PINN) was proposed^24,25,26, which has the capability to learn the prior knowledge of partial differential equations with or without labeled data. Moreover, PINN has the advantage of solving practical problems with unknown boundary conditions or initial conditions imposed on the boundary²⁷. In addition, a well-trained PINN model with generalization capabilities can be regarded as a surrogate model, greatly enhancing computational efficiency and enabling real-time predictions. To this end, PINN has attracted increasing attention ever since it was first proposed²⁸ and has been applied in various communities such as shape optimization²⁹, thermochemical curing³⁰, quantification of cracks³¹, metal additive manufacturing³², seismic response prediction^33,34, nonlinear modeling³⁵, structural design³⁶, material defect identification³⁷, subsurface flow problems³⁸, engineering problems in heterogeneous domains³⁹. In addition, there have been several extensions in computational science, including solving partial differential equations (PDEs) problems with optimized framework^{40,41,42,43,44,45,46}, surrogate modeling⁴⁷, multi-physics simulation framework⁴⁸, fluid mechanics^{49,50,51,52,53}, solid mechanics^54,55,56,57, transfer learning⁵⁸, computational elastic dynamic⁵⁹. In general, the application of PINN could be divided into two categories: forward and inverse problems. The inverse problem aims to determine unknown physical parameters of a system from the known output by optimizing the prediction effect of data-driven neural networks through learning physical information. The forward problem is to implement PINN as a solver for PDEs without relying on labeled data, which can be regarded as an unsupervised learning approach by learning physical laws instead of labeled data. Previous studies related to the forward problem have mainly focused on optimizing the PINN framework according to the specific PDEs⁴⁴.

Compared to the FEM approach that requires the generation of fine discrete meshes, PINNs directly model the continuous field, making them a meshless method. Moreover, PINNs demonstrate remarkable flexibility to solve problems by combining simulated data from FEM and relevant physical laws for training neural networks⁶⁰. However, in the field of structural mechanics computation, the establishment of PINN for bending structures is still a challenging task due to the inherent complexity associated with solving higher-order PDEs. The governing equations for bending calculation of structures are characterized by fourth-order nonlinear PDEs featuring multiple orders of differential terms, which significantly decreases the accuracy of PINN when it comes to higher-order differentiation. More importantly, the presence of multi-order differential terms in PDEs can result in unstable convergence or even non-convergence due to the significant variations in the weights assigned to these differential terms⁴².

In this study, to address the above-mentioned challenge in computational structural mechanics, we propose a multi-level physics-informed neural network framework (herein termed as ml-PINN), enabling intelligent computation of bending structures governed by fourth-order nonlinear PDEs. Through comparison with FEM results, we validate the accuracy of ml-PINN for the solution of bending structures including the typical beam and shell structures. To our knowledge, this is the first application of PINN as a promising paradigm in computational structural mechanics. The proposed method provides a promising perspective for structure mechanics computation and holds potential as an intelligent computational module embedded into structural DT systems.

Results

Framework of ml-PINN

The classical PINN methodology for solving high-order PDEs involves utilizing the automatic differentiation technique to directly evaluate the high-order differential terms based on a single neural network (NN) (Fig. 1). Nevertheless, using a single NN for multi-order differentiations and multiple parameters can significantly increase the amount of computation, resulting in extended computation time and unstable convergence. (The relevant content is elaborated in the Supplementary information)

**Fig. 1: Classical physics-informed neural network framework for solving high-order partial differential equations (PDEs).**

To tackle this problem, we propose a multi-level physics-informed neural network (ml-PINN). Two strategies have been incorporated in the ml-PINN: (i) a aggregation model is established by combining multiple NNs, where the ensemble learning involves only first-order or second-order PDEs representing the geometrical, constitutive, and equilibrium relations of the structure. This effectively reduces the order of the original higher-order PDEs; (ii) The loss functions corresponding to each NN with lower-order PDEs are linked and combined into one loss function with similar weights for all variables. Additionally, the loss function for boundary constraints can be directly defined in each individual NN rather than through differential terms. These treatments enable successful solutions for structural bending calculation. The specific process is described below.

The ml-PINN model is established to predict the vertical displacements of bending structures, including the beam and shell structures. The beam structure can be regarded as a one-dimensional (1D) scenario, and the computation framework of ml-PINN for beam is correspondingly presented in Fig. 2. The beam is subjected to a uniform distributed load q (Fig. 2a), and the vertical displacement ω along with other important parameters including section angle θ, curvature κ, bending moment M, and shear force F_Q can be simultaneously obtained by the ml-PINN model. The governing equation for the beam structure is a fourth-order differential equation relating q and ω, which can be transformed into five first-order equations. The five equations actually correspond to the three fundamental equations in structural mechanics: the geometric equation, stress-strain equation, and equilibrium equation with physical variables (i.e., θ, κ, M, and F_Q) as transition parameters (see ‘Methods’ for details). Figure 2b illustrates how these physical variables are related during the transition from q to ω. Figure 2c presents the framework of ml-PINN to illustrate the computation process for beam structures. Firstly, three separate neural networks (NNs) are established to obtain the outputs of ω, κ, and M, which means that each parameter is trained independently within its own dedicated NN. This strategy facilitates the convergence of training owing to reducing the complexity of NN for each parameter. More details can be found in the ‘Discussion’ section. Secondly, the derivatives of the NN output are obtained by utilizing the automatic differentiation technique, including dω/dx, dθ/dx, dκ/dx, dF_Q/dx. Lastly, the multiple NNs are connected by the loss functions to form a comprehensive loss function that includes both the loss functions from differential equations (L_p) and those from boundary constraints (L_b). During the iterative computation process, the updated parameters of multiple NNs are optimized to minimize the loss function until convergence is achieved.

**Fig. 2: Multi-level physics-informed neural network (ml-PINN) for 1D beam.**

For the two-dimensional (2D) scenario of shell structures, the computation framework of ml-PINN is presented in Fig. 3. Similar to beam structures, the approach to reduce the PDE order involves decomposing into the lower-order PDEs (i.e., the three typical equations). However, in shell structures, these decomposed equations are second-order differential terms such as the relationship between ω_NN(x, y) and κ_NN(x, y) as well as the relationship between M_NN(x, y) and q(x, y) (see Fig. 3b). Moreover, the framework for shell structure is more complicated due to the increase of dimension, the existence of second-order partial derivatives, and more boundary constraints (Fig. 3c). Hence, training the framework of shell structures requires more time-consuming iterations to achieve convergence.

**Fig. 3: Multi-level physics-informed neural network (ml-PINN) for 2D shell.**

Computational results of bending structures

We first consider the computation of 1D beam structures under four different loading conditions, and the ml-PINN computed results of beam deformation are compared with the corresponding FEM results. As previously mentioned, the arbitrarily distributed points, termed as collocation points, are required to train the NN. A total of 25 randomly distributed collocation points along the beam structure are used for training (Fig. 4a). Regarding the influence of point number on the computational results, see the forthcoming sensitivity analysis.

**Fig. 4: Computational results for 1D beam structure.**

The loads exerted on the beam could be divided into external force (i.e., the uniform load q) and boundary constraints (i.e., the displacement ω and rotational angle θ imposed at the two ends of the beam). The specific conditions for different loading cases are listed in Table 1. Figure 4b–e exhibits the comparison of deformation results between ml-PINN and FEM under four different loading conditions. It shows that the vertical displacement computed by ml-PINN agrees quite well with that obtained from FEM. The profiles of beam deformation from both ml-PINN and FEM are almost overlapped (relative error <0.5%), indicating that the ml-PINN shares comparable accuracy with FEM in bending beam computation.

Table 1 The summary of loading conditions for beam structure

Full size table

We further applied the ml-PINN to solve the 2D shell. Table 2 lists the four cases corresponding to various loading conditions exerted on the shell. A total number of 1000 data points are randomly distributed within the interior zone, along with 50 data points at each boundary (S1–S4) (refer to Fig. 5a, b). The comparison indicates that the computed results by ml-PINN and FEM are very close (Fig. 5c–f). The error contours demonstrate that the deviation between ml-PINN and FEM in the corner region of the shell is more obvious than that at the interior region, especially for case 3 and case 4 (Fig. 5e, f). The primary reason is that more constraints imposed at the corners lead to the convergence of computation being more difficult in these regions. Nevertheless, the overall errors remain low, with relative errors below 2%, which means that the proposed ml-PINN model is capable of accurately solving the 2D structural mechanics problem, specifically the deformation of bending shells under various loading conditions.

Table 2 The summary of loading conditions for shell structure

Full size table

**Fig. 5: Computational results for 2D shell structure.**

Sensitivity analysis of collocation points

For the ml-PINN computation, the number of collocation points is critical, affecting the resolution of the computational results. Hence, a sensitivity analysis of the collocation points is required. Figure 6 shows the comparison of ml-PINN predicted results using different numbers of collocation points (n_p). For the beam structure case, we tested 10, 25, 50, and 100 randomly distributed points. For the shell structure case, we used 10, 100, 500, and 1000 randomly distributed points in the interior area while keeping 50 data points fixed at each boundary of the shell structure.

**Fig. 6: Comparison of multi-level physics-informed neural network prediction using a different number of collocation points.**

For the 1D beam (Fig. 6a), the computational error is relatively high (close to 3%) when rather sparse collocation points are used (n_p = 10), while the error is reduced to less than 1% as the number of collocation points increases (n_p > 25). Thus, to balance the computational precision and speed, the number of points n_p = 25 has been adopted. It is concluded that more collocation points are better than fewer collocation points. When there are fewer allocation points, there is not enough information in the data points to support a better prediction effect. As the number of allocation points reaches a certain value (n_p = 25 in beam element for the studied case), the prediction effect will be almost independent of the number of input data points. It is noted that the number of allocation points can be determined by sensitivity analysis in different applications.

For the 2D shell computation (Fig. 6b), Table 3 lists the quantitative comparison of loss residuals and relative errors corresponding to different numbers of collocation points. It is evident that a number of densely distributed points are required to improve the accuracy, and the error is reduced to approximately 2% when n_p = 1000. Thus, considering both the computational accuracy and efficiency, we used 1000 collocation points to compute the shell structure, as shown in Fig. 5.

Table 3 Loss residuals and errors of prediction using different number of collocation points

Full size table

Discussion

In summary, this study proposes a ml-PINN framework for structural mechanics computation with high-order PDEs, aiming to predict the response of bending beam and shell structures under different loading conditions. The PDEs of bending structures involve multiple physical parameters such as vertical displacement ω, section angles θ, and bending moment M. These parameters are correlated through differentiation terms (e.g., θ∝∂ω, κ∝∂²ω, and q∝∂⁴ω as described in the ‘Methods’ section), which are enforced by boundary constraints. Hence, multiple loss functions with different orders of magnitudes corresponding to each differential term exist in the classical PINN framework. The disparity in the magnitudes between these loss functions is substantial due to different orders of differentiation, which results in non-convergence or only local convergence. However, the proposed ml-PINN adopts multiple NNs for the output of each intermediate physical parameter, such as θ, κ, and M, to achieve both local and global convergence, meanwhile reducing the order of differential terms through the transition of the intermediate parameters (e.g., q∝∂F_Q, F_Q∝∂M, κ∝∂θ and θ∝∂ω in the beam as well as, q∝∂²M, κ∝∂θ and θ∝∂ω in shell). In other words, the advantage of the ml-PINN to solve the high-order PDEs lies in the underlying physical relationships, i.e., the three typical equations describing the correlation between these physical variables. It is evident that the proposed ml-PINN framework significantly enhances computational accuracy by reducing the error of beam and shell structures computation to 0.5% and 2%, respectively, surpassing that achieved by classical PINN, while simultaneously achieving four times faster computation time. This strategy of employing ml-PINN to reduce the order of PDEs can also be extended to address other solid mechanics issues characterized by similar governing PDE properties.

Currently, the primary disadvantage of ml-PINN lies in its time-consuming training process, which is a general problem yielded in PINN when solving the forward problem. However, once the ml-PINN is trained well, it can serve as a surrogate model to enable efficient end-to-end prediction. Overall, the aforementioned results and discussion indicate that the ml-PINN achieves comparable accuracy to FEM for solving bending structures. Moreover, unlike the FEM requiring repeated model establishment (e.g., mesh generation, parameters adjustment, and calibration) and computation for different conditions, the ml-PINN possesses generalization learning capabilities and has the potential to rapidly predict the results for different loading cases once the neural networks (NNs) are well-trained. Figure 7a shows the ml-PINN framework for generalization learning, where the physical variables related to loading conditions (u and q) are considered additional input parameters of the network. Figure 7b exhibits the direct prediction performance under different loading conditions, indicating that incorporating generalization into ml-PINN significantly improves computational efficiency while maintaining comparable accuracy with FEM computation.

**Fig. 7: The multi-level physics-informed neural network (ml-PINN) for generalization computation.**

Methods

Differential equations of beam structures

The governing equation describing the deformation of a bending beam is a fourth-order differential equation, given by

$$\frac{{d}^{4}\omega }{d{x}^{4}}=\frac{q(x)}{EI}$$

(1)

where q(x) represents the uniform load on the beam, EI denotes flexural stiffness, and ω represents vertical displacement. To reduce the order of Eq. (1), several intermediate variables are introduced, and Eq. (1) can be decomposed into five first-order differential equations:

$$q=-\frac{d{F}_{Q}}{dx}$$

(2a)

$${F}_{Q}=\frac{dM}{dx}$$

(2b)

$$M=-EI\kappa$$

(2c)

$$\kappa =-\frac{d\theta }{dx}$$

(2d)

$$\theta =\frac{d\omega }{dx}$$

(2e)

where θ and κ denote the section angle and curvature of the beam, respectively. M and F_Q represent the bending moment and shear force. In structural mechanics, these equations are commonly referred to as equilibrium equation (Eqs. (2a) and (2b)), stress-strain equation (Eqs. (2c)), and geometric equation (Eqs. (2d) and (2e)).

There are two types of boundary constraints: (i) the vertical displacement $\overline{\omega }$ and section angle $\overline{\theta }$ are defined on the boundary s_t1 as,

$$\omega (x)=\overline{\omega }\cap \theta (x)=\overline{\theta },\, x\in {s}_{t1}$$

(3)

Specially, $\overline{\omega }=0$ and $\overline{\theta }=0$ means the endpoint consolidation; (ii) the vertical displacement $\overline{\omega }$ and bending moment $\overline{M}$ are imposed on the boundary s_t2 as,

$$\omega (x)=\overline{\omega }\cap M(x)=\overline{M},\, x\in {s}_{t2}$$

(4)

For a simply supported beam, the boundary condition is specified as $\overline{\omega }=0$ and $\overline{M}=0$.

Similar to the governing equations of beam structure described above, the governing PDEs for shell structures also incorporate the geometric equation, deformation equation, and equilibrium equation, as given by,

$${\kappa }_{x}=-\frac{{\partial }^{2}\omega }{\partial {x}^{2}}, {\kappa }_{xy}=-2\frac{{\partial }^{2}\omega }{\partial x\partial y}, {\kappa }_{y}=-\frac{{\partial }^{2}\omega }{\partial {y}^{2}}$$

(5)

$$M=\left[\begin{array}{c}{M}_{x}\\ {M}_{y}\\ {M}_{xy}\end{array}\right]=D\kappa =\frac{E{t}^{3}}{12(1-{\nu }^{2})}\left[\begin{array}{ccc}1 & \nu & 0\\ \nu & 1 & 0\\ 0 & 0 & (1-\nu )/2\end{array}\right]\left[\begin{array}{c}{\kappa }_{x}\\ {\kappa }_{y}\\ {\kappa }_{xy}\end{array}\right]$$

(6)

$$\frac{{\partial }^{2}{M}_{x}}{\partial {x}^{2}}+2\frac{{\partial }^{2}{M}_{xy}}{\partial x\partial y}+\frac{{\partial }^{2}{M}_{y}}{\partial {y}^{2}}+q(x,y)=0$$

(7)

where ω denotes the displacement in the z-direction; κ_x, κ_y, and κ_xy represent the curvature of shell structures in the x-direction, y-direction, and torsion rate in the z-direction respectively; M_x, M_y and M_xy denote the bending moment of shell structures in the x-direction, y-direction, and torque in the z-direction respectively; D, E, t, and ν denote the elastic relationship matrix, elastic modulus, shell thickness in the z-direction, and Poisson ratio of the material respectively; q(x, y) is the uniformly distributed load exerting on the shell, which is a known quantity.

Similarly, two types of boundary constraints are defined on the two boundaries S_t1 and S_t2, respectively, which are expressed as

$$\omega (x,y)=\overline{\omega }\cap \frac{\partial \omega }{\partial n}(x,y)=\overline{\theta },(x,\, y)\in {S}_{t1}$$

(8)

$$\omega (x,y)=\overline{\omega }\cap {M}_{n}(x,y)=\overline{{M}_{n}},(x,\, y)\in {S}_{t2}$$

(9)

where $\overline{\omega }$, $\overline{\theta }$ and $\overline{M}$ represent vertical displacement, section angle and bending moment; n denotes normal direction of boundary.

Loss function of ml-PINN

The loss function of ml-PINN is defined as the residuals of PDEs and the boundary constraints. For the beam structure, the loss function of the PDE, L_p, is composed of residuals from geometric equation (L_p(ω‘, θ) and L_p(θ’, κ)) as well as equilibrium equation (L_p(M’, F_Q) and L_p(F_Q’, q)), as depicted below. Note that the loss function from the stress-strain equation is not established due to its simple linearity between κ and M.

$${L}_{p}(\omega ^{\prime},\theta )={\sum _{i=1}^{{n}_{p}}\left|\left|{\frac{d{\omega }_{NN}}{dx}}_{i}-{\theta }_{NN}{(x,y)}_{i}\right|\right|}_{2}^{2},x\in l$$

(10)

$${L}_{p}(\theta ^{\prime} ,\kappa )={\sum _{i=1}^{{n}_{p}}\left\Vert {\frac{d{\theta }_{NN}}{dx}}_{i}-{\kappa }_{NN}{(x)}_{i}\right\Vert }_{2}^{2},x\in l$$

(11)

$${L}_{p}(M^{\prime} ,{F}_{Q})={\sum _{i=1}^{{n}_{p}}\left\Vert {\frac{d{M}_{NN}}{dx}}_{i}-{F}_{QNN}{(x)}_{i}\right\Vert }_{2}^{2},x\in l$$

(12)

$${L}_{p}({{F}_{Q}}^{{\prime} },q)={\sum _{i=1}^{{n}_{p}}\left\Vert {\frac{d{F}_{QNN}}{dx}}_{i}-q{(x)}_{i}\right\Vert }_{2}^{2},x\in l$$

(13)

where n_p represents the number of collocation points in whole regions, l represents the interior zone within the computational ___domain of the beam structure, $\sum ||\cdot |{|}_{2}^{2}$ is the quadratic norm, and $\frac{d{\omega }_{NN}}{dx}$, $\frac{d{\theta }_{NN}}{dx}$, $\frac{d{M}_{NN}}{dx}$ and $\frac{d{F}_{QNN}}{dx}$ correspond to the first-order derivatives of NNs output for ω, θ, M and F_Q respectively.

The loss function of boundary constraints L_b comprises the loss functions for constrained vertical displacement L_b(ω), constrained section corner L_b(θ), and constrained bending moment L_b(M), namely,

$${L}_{b}(\omega )={\sum _{i=1}^{{n}_{b}}\left\Vert {\omega }_{NN}{(x)}_{i}-\overline{\omega {(x)}_{i}}\right\Vert }_{2}^{2},x\in {l}_{b}$$

(14)

$${L}_{b}(\theta )={\sum _{i=1}^{{n}_{b}}\left\Vert {\theta }_{NN}{(x)}_{i}-\overline{\theta {(x)}_{i}}\right\Vert }_{2}^{2},x\in {l}_{b}$$

(15)

$${L}_{b}(M)={\sum _{i=1}^{{n}_{b}}\left\Vert {M}_{NN}{(x)}_{i}-\overline{M{(x)}_{i}}\right\Vert }_{2}^{2},x\in {l}_{b}$$

(16)

where n_b denotes the number of collocation points in the boundary regions. l_b represents the boundary regions within the beam structure.$\overline{\omega (x)}$, $\overline{\theta (x)}$ and $\overline{M(x)}$ represent the displacement, rotation angle, and bending moment at the boundary, respectively. The total loss function L_total of the beam structure is obtained by adding L_p and L_b together, namely,

$${L}_{total}= \left[{L}_{b}(\omega )+{L}_{b}(\theta )+{L}_{b}(M)\right]+\left[{L}_{p}(\omega ^{\prime} ,\theta )+{L}_{p}(\theta ^{\prime} ,\kappa ) +{L}_{p}(M^{\prime} ,{F}_{Q})\right] +{L}_{p}({{F}_{Q}}^{{\prime} },q)$$

(17)

Similarly, for the shell structures, the PDE loss function, L_p, consists of residuals from geometric equation L_p(ω“, κ), stress-strain equation L_p(κ, M), and equilibrium equation L_p(M”, q), which are written as

$${L}_{p}(\omega ^{\prime \prime},\kappa )= {\sum _{i=1}^{{n}_{p}}\left\Vert {\left[\frac{{\partial }^{2}{\omega }_{NN}}{\partial {x}^{2}},\frac{{\partial }^{2}{\omega }_{NN}}{\partial {y}^{2}},2\frac{{\partial }^{2}{\omega }_{NN}}{\partial x\partial y}\right]}_{i}^{T}-{\kappa }_{NN}{(x,y)}_{i}\right\Vert }_{2}^{2},\, (x,y)\in \Omega$$

(18)

$${L}_{p}(\kappa ,M)={\sum _{i=1}^{{n}_{p}}\left\Vert D{\kappa }_{NN}{(x,y)}_{i}-{M}_{NN}{(x,y)}_{i}\right\Vert }_{2}^{2},(x,y)\in \Omega$$

(19)

$${L}_{p}(M^{\prime \prime},q)={\sum _{i=1}^{{n}_{p}}\left\Vert {\left(\frac{{\partial }^{2}{M}_{NNx}}{\partial {x}^{2}}+2\frac{{\partial }^{2}{M}_{NNxy}}{\partial x\partial y}+\frac{{\partial }^{2}{M}_{NNy}}{\partial {y}^{2}}\right)}_{i}+q{(x,y)}_{i}\right\Vert }_{2}^{2},(x,y)\in \Omega$$

(20)

where Ω represents the interior zone of the computational ___domain within the shell structure; $\frac{{\partial }^{2}{\omega }_{NN}}{\partial {x}^{2}}$, $\frac{{\partial }^{2}{\omega }_{NN}}{\partial {y}^{2}}$ and $\frac{{\partial }^{2}{\omega }_{NN}}{\partial x\partial y}$ represent the second-order partial derivative of the NN output of displacement ω_NN(x, y); κ_NN(x, y) is the neural network (NN) output of curvature κ; $\frac{{\partial }^{2}{M}_{NNx}}{\partial {x}^{2}}$, $\frac{{\partial }^{2}{M}_{NNy}}{\partial {y}^{2}}$ and $\frac{{\partial }^{2}{M}_{NNxy}}{\partial x\partial y}$ denote the second-order partial derivative of the NN output of bending moment M_NN(x, y).

The loss functions of boundary constraints mainly include constrained vertical displacement L_b(ω), constrained section corner L_b(ω’), and constrained bending moment L_b(M_n), namely,

$${L}_{b}(\omega )={\sum _{i=1}^{{n}_{b}}\left\Vert \omega {(x,y)}_{i}-\overline{\omega {(x,y)}_{i}}\right\Vert }_{2}^{2},(x,y)\in {S}_{1},{S}_{2},{S}_{3},{S}_{4}$$

(21)

$${L}_{b}(\omega ^{\prime})={\sum _{i=1}^{{n}_{b}}\left\Vert \frac{\partial \omega {(x,y)}_{i}}{\partial n}-\overline{\theta {(x,y)}_{i}}\right\Vert }_{2}^{2},(x,y)\in {S}_{1},{S}_{2},{S}_{3},{S}_{4}$$

(22)

$${L}_{b}({M}_{n})={\sum _{i=1}^{{n}_{b}}\left\Vert {M}_{n}{(x,y)}_{i}-\overline{{M}_{n}{(x,y)}_{i}}\right\Vert }_{2}^{2},(x,y)\in {S}_{1},{S}_{2},{S}_{3},{S}_{4}$$

(23)

where S₁, S₂, S₃, and S₄ represent different boundary regions within the shell structure (Fig. 5a); $\overline{\omega (x,y)}$, $\overline{\theta (x,y)}$ and $\overline{M(x,y)}$ represent the displacement, rotation angle, and bending moment at the boundaries.

Finally, the total loss function L_total of the shell structure is written as

$${L}_{total}=\left[{L}_{b}(\omega )+{L}_{b}(\omega ^{\prime})+{L}_{b}(M)\right]+\left[{L}_{p}(\omega ^{\prime \prime},\kappa )+{L}_{p}(\kappa ,M)+{L}_{p}(M^{\prime \prime},q)\right]$$

(24)

The NNs are automatically optimized to update the weights and biases of the network by minimizing the loss function, namely,

$${\{W,b\}}_{i}^{\ast }={{\mbox{arg}}}{\min }_{{\{W,b\}}_{i}}[Los{s}_{total}(\{W,b\})]$$

(25)

Model setting

All codes are optimized using the ADAM algorithm with a learning rate of 10⁻³. A total of four neural networks (NNs) were established to predict the output of ω, θ, M, and F_Q in the 1D beam structure, namely ω_NN, θ_NN, M_NN, and F_QNN. All four fully connected feedforward NNs with ml-PINN consist of an input layer with 1 node, 3 hidden layers with 10 nodes each, and an output layer with 1 node. For c-PINN, a single fully connected feedforward neural network (NN) was constructed with an input layer containing 1 node, 3 hidden layers comprising 20 nodes each, and an output layer with 1 node. The aim is to make c-PINN and ml-PINN possess the same capacity in terms of fully connected neural networks. Additionally and similarly, for the 2D shell structure with ml-PINN, three NNs were set up as ω_NN, κ_NN(κ_x, κ_y, and κ_xy), M_NN(M_x, M_y, and M_xy). The NN of ω_NN consists of an input layer with 2 nodes, 3 hidden layers with 30 nodes, and an output layer with 1 node. Additionally, the NNs used in κ_NN and M_NN were constructed with an input layer with 2 nodes, 3 hidden layers with 20 nodes, and an output layer with 3 nodes.

Data availability

The simulated data is provided by the software ABAQUS and is used for the verification of computational results predicted by the ml-PINN framework. This readily available data is bundled with the accompanying code and its source information is provided within this paper.

Code availability

The source programs of ml-PINN computation can be found in the Supplementary code. The ml-PINN framework is implemented using Tensorflow 1.15, an open-source Python package. The code for this article will be available on GitHub (https://github.com/he-weiwei/ml-PINN.git).

References

Jiang, Y., Yin, S., Li, K., Luo, H. & Kaynak, O. Industrial applications of digital twins. Philos. Trans. R Soc. A 379, 20200360 (2021).
Article Google Scholar
Austin, M., Delgoshaei, P., Coelho, M. & Heidarinejad, M. Architecting smart city digital twins: Combined semantic model and machine learning approach. J. Manage. Eng. 36, 04020026 (2020).
Article Google Scholar
Li, X. et al. Big data analysis of the Internet of things in the digital twins of smart city based on deep learning. Future Gener. Comp. SY 128, 167–177 (2022).
Article Google Scholar
Jiang, F., Ma, L., Broyd, T. & Chen, K. Digital twin and its implementations in the civil engineering sector. Autom. Constr. 130, 103838 (2021).
Article Google Scholar
Bado, M. F., Tonelli, D., Poli, F., Zonta, D. & Casas, J. R. Digital twin for civil engineering systems: an exploratory review for distributed sensing updating. Sensors 22, 3168 (2022).
Article Google Scholar
Wagg, D. J., Worden, K., Barthorpe, R. J. & Gardner, P. Digital twins: state-of-the-art and future directions for modeling and simulation in engineering dynamics applications. ASCE-ASME J. Risk. Uncert. Eng. Syst. Part B Mech. Eng. 6, 030901 (2020).
Dimiduk, D. M., Holm, E. A. & Niezgoda, S. R. Perspectives on the impact of machine learning, deep learning, and artificial intelligence on materials, processes, and structures engineering. Integ. Mater. Manuf. Innov. 7, 157–172 (2018).
Article Google Scholar
Salehi, H. & Burgueño, R. Emerging artificial intelligence methods in structural engineering. Eng. Struct. 171, 170–189 (2018).
Article Google Scholar
Sun, L., Shang, Z., Xia, Y., Bhowmick, S. & Nagarajaiah, S. Review of bridge structural health monitoring aided by big data and artificial intelligence: From condition assessment to damage detection. J. Struct. Eng. 146, 04020073 (2020).
Article Google Scholar
Tapeh, A. T. G. & Naser, M. Z. Artificial intelligence, machine learning, and deep learning in structural engineering: a scientometrics review of trends and best practices. Arch. Comput. Methods Eng. 30, 115–159 (2023).
Baduge, S. K. et al. Artificial intelligence and smart vision for building and construction 4.0: machine and deep learning methods and applications. Autom. Constr. 141, 104440 (2022).
Article Google Scholar
Cha, Y. J., Choi, W. & Büyüköztürk, O. Deep learning-based crack damage detection using convolutional neural networks. Comput. Aided Civ. Infrastruct. Eng. 32, 361–378 (2017).
Article Google Scholar
Dorafshan, S., Thomas, R. J. & Maguire, M. Comparison of deep convolutional neural networks and edge detectors for image-based crack detection in concrete. Constr. Build. Mater. 186, 1031–1045 (2018).
Article Google Scholar
Bao, Y., Tang, Z., Li, H. & Zhang, Y. Computer vision and deep learning–based data anomaly detection method for structural health monitoring. Struct. Health. Monit. 18, 401–421 (2019).
Article Google Scholar
Zhu, Z. & Brilakis, I. Parameter optimization for automated concrete detection in image data. Autom. Constr. 19, 944–953 (2010).
Article Google Scholar
Sharafi, P., Teh, L. H. & Hadi, M. N. Shape optimization of thin-walled steel sections using graph theory and ACO algorithm. J. Constr. Steel Res. 101, 331–341 (2014).
Article Google Scholar
Rahman, J., Ahmed, K. S., Khan, N. I., Islam, K. & Mangalathu, S. Data-driven shear strength prediction of steel fiber reinforced concrete beams using machine learning approach. Eng. Struct. 233, 111743 (2021).
Article Google Scholar
Kwon, S. J. & Song, H. W. Analysis of carbonation behavior in concrete using neural network algorithm and carbonation modeling. Cem. Concr. Res. 40, 119–127 (2010).
Article Google Scholar
Hossain, M., Gopisetti, L. S. P. & Miah, M. S. Artificial neural network modelling to predict international roughness index of rigid pavements. Int. J. Pavement Res. Technol. 13, 229–239 (2020).
Article Google Scholar
Nilsen, V. et al. Prediction of concrete coefficient of thermal expansion and other properties using machine learning. Constr. Build. Mater. 220, 587–595 (2019).
Article Google Scholar
Lai, D., Demartino, C. & Xiao, Y. Interpretable machine-learning models for maximum displacements of RC beams under impact loading predictions. Eng. Struct. 281, 115723 (2023).
Article Google Scholar
Onchis, D. M. & Gillich, G. R. Stable and explainable deep learning damage prediction for prismatic cantilever steel beam. Comput. Ind. 125, 103359 (2021).
Article Google Scholar
Koeppe, A., Bamer, F., Selzer, M., Nestler, B. & Markert, B. Explainable artificial intelligence for mechanics: physics-explaining neural networks for constitutive models. Front. Mater. 8, 824958 (2022).
Article Google Scholar
Raissi, M., Perdikaris, P. & Karniadakis, G. E. Physics informed deep learning (part i): Data-driven solutions of nonlinear partial differential equations. https://arxiv.org/abs/1711.10561 (2017).
Raissi, M., Perdikaris, P. & Karniadakis, G. E. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, 686–707 (2019).
Article MathSciNet Google Scholar
Baydin, A. G., Pearlmutter, B. A., Radul, A. A. & Siskind, J. M. Automatic differentiation in machine learning: a survey. J. Mach. Learn. Res. 18, 1–43 (2018).
MathSciNet Google Scholar
Meng, C., Seo, S., Cao, D., Griesemer, S. & Liu, Y. When physics meets machine learning: a survey of physics-informed machine learning. https://arxiv.org/abs/2203.16797 (2022).
Karniadakis, G. E. et al. Physics-informed machine learning. Nat. Rev. Phys. 3, 422–440 (2021).
Article Google Scholar
Liu, S. et al. Physics-informed machine learning for composition-process-property design: shape memory alloy demonstration. Appl. Mater. Today 22, 100898 (2021).
Article Google Scholar
Niaki, S. A., Haghighat, E., Campbell, T., Poursartip, A. & Vaziri, R. Physics-informed neural network for modelling the thermochemical curing process of composite-tool systems during manufacture. Comput. Methods Appl. Mech. Eng. 384, 113959 (2020).
Article MathSciNet Google Scholar
Shukla, K., Di Leoni, P. C., Blackshire, J., Sparkman, D. & Karniadakis, G. E. Physics-informed neural network for ultrasound nondestructive quantification of surface breaking cracks. J. Nondestr. Eval. 39, 1–20 (2020).
Article Google Scholar
Zhu, Q., Liu, Z. & Yan, J. Machine learning for metal additive manufacturing: predicting temperature and melt pool fluid dynamics using physics-informed neural networks. Comput. Mech. 67, 619–635 (2021).
Article MathSciNet Google Scholar
Zhang, R., Liu, Y. & Sun, H. Physics-guided convolutional neural network (PhyCNN) for data-driven seismic response modeling. Eng. Struct. 215, 110704 (2020).
Ren, P. et al. SeismicNet: Physics-informed neural networks for seismic wave modeling in semi-infinite ___domain. Comp. Phys. Comm. 295, 109010 (2024).
Zhang, R., Liu, Y. & Sun, H. Physics-informed multi-LSTM networks for metamodeling of nonlinear structures. Comput. Methods Appl. Mech. Eng. 369, 113226 (2020).
Article MathSciNet Google Scholar
Lu, X., Liao, W., Zhang, Y. & Huang, Y. Intelligent structural design of shear wall residence using physics-enhanced generative adversarial networks. Earthq. Eng. Struct. Dyn. 51, 1657–1676 (2022).
Article Google Scholar
Chen, Y., Lu, L., Karniadakis, G. E. & Negro, L. D. Physics-informed neural networks for inverse problems in nano-optics and metamaterials. Opt. Express 28, 11618–11633 (2020).
Article Google Scholar
Tartakovsky, A. M., Marrero, C. O., Perdikaris, P., Tartakovsky, G. D. & Barajas-Solano, D. Physics-informed deep neural networks for learning parameters and constitutive relationships in subsurface flow problems. Water Resour. Res. 56, e2019WR026731 (2020).
Article Google Scholar
Rezaei, S., Harandi, A., Moeineddin, A., Xu, B. X. & Reese, S. A mixed formulation for physics-informed neural networks as a potential solver for engineering problems in heterogeneous domains: comparison with finite element method. Comput. Methods Appl. Mech. Eng. 401, 115616 (2022).
Article MathSciNet Google Scholar
Yang, L., Zhang, D. & Karniadakis, G. E. Physics-informed generative adversarial networks for stochastic differential equations. SIAM J. Sci. Comput. 42, A292–A317 (2020).
Article MathSciNet Google Scholar
Jagtap, A. D., Kharazmi, E. & Karniadakis, G. E. Conservative physics-informed neural networks on discrete domains for conservation laws: applications to forward and inverse problems. Comput. Methods Appl. Mech. Eng. 365, 113028 (2020).
Article MathSciNet Google Scholar
Yu, J., Lu, L., Meng, X. & Karniadakis, G. E. Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems. Comput. Methods Appl. Mech. Eng. 393, 114823 (2022).
Article MathSciNet Google Scholar
Zhang, D., Lu, L., Guo, L. & Karniadakis, G. E. Quantifying total uncertainty in physics-informed neural networks for solving forward and inverse stochastic problems. J. Comput. Phys. 397, 108850 (2019).
Article MathSciNet Google Scholar
H. Gao, M. J. Zahr & J. X. Wang. Physics-informed graph neural Galerkin networks: a unified framework for solving PDE-governed forward and inverse problems. Comput. Methods Appl. Mech. Eng. 390, 114502 (2022).
Haghighat, E., Bekar, A. C., Madenci, E. & Juanes, R. A nonlocal physics-informed deep learning framework using the peridynamic differential operator. Comput. Methods Appl. Mech. Eng. 385, 114012 (2021).
Article MathSciNet Google Scholar
Meng, X., Li, Z., Zhang, D. & Karniadakis, G. E. PPINN: Parareal physics-informed neural network for time-dependent PDEs. Comput. Methods Appl. Mech. Eng. 370, 113250 (2020).
Article MathSciNet Google Scholar
Zhu, Y., Zabaras, N., Koutsourelakis, P. S. & Perdikaris, P. Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data. J. Comput. Phys. 394, 56–81 (2019).
Article MathSciNet Google Scholar
Hennigh, O. et al. NVIDIA SimNet™: an AI-accelerated multi-physics simulation framework. in Computational Science–ICCS 2021: 21st International Conference, Krakow, Poland, June 16-18, Proceedings, Part V. 447–461 (Springer International Publishing, Cham, 2021)
Sun, L. & Wang, J. X. Physics-constrained Bayesian neural network for fluid flow reconstruction with sparse and noisy data. Theor. Appl. Mech. Lett. 10, 161–169 (2020).
Article Google Scholar
Wang, J. X., Wu, J. L. & Xiao, H. Physics-informed machine learning approach for reconstructing Reynolds stress modeling discrepancies based on DNS data. Phys. Rev. Fluids 2, 034603 (2017).
Article Google Scholar
Eivazi, H., Tahani, M., Schlatter, P. & Vinuesa, R. Physics-informed neural networks for solving Reynolds-averaged Navier-Stokes equations. Phys. Fluids 34, 075117 (2022).
Article Google Scholar
Sun, L., Gao, H., Pan, S. & Wang, J. X. Surrogate modeling for fluid flows based on physics-constrained deep learning without simulation data. Comput. Methods Appl. Mech. Eng. 361, 112732 (2020).
Article MathSciNet Google Scholar
Erichson, N. B., Muehlebach, M. & Mahoney, M. W. Physics-informed autoencoders for Lyapunov-stable fluid flow prediction. https://arxiv.org/abs/1905.10866 (2019).
Li, W., Bazant, M. Z. & Zhu, J. A physics-guided neural network framework for elastic plates: comparison of governing equations-based and energy-based approaches. Comput. Methods Appl. Mech. Eng. 383, 113933 (2021).
Article MathSciNet Google Scholar
Nguyen-Thanh, V. M., Zhuang, X. & Rabczuk, T. A deep energy method for finite deformation hyperelasticity. Eur. J. Mech. A/Solid 80, 103874 (2020).
Article MathSciNet Google Scholar
Haghighat, E., Amini, D. & Juanes, R. Physics-informed neural network simulation of multiphase poroelasticity using stress-split sequential training. Comput. Methods Appl. Mech. Eng. 397, 115141 (2022).
Haghighat, E., Raissi, M., Moure, A., Gomez, H. & Juanes, R. A Physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics. Comput. Methods Appl. Mech. Eng. 379, 113741 (2021).
Xu, C., Cao, B. T., Yuan, Y. & Meschke, G. Transfer learning based physics-informed neural networks for solving inverse problems in engineering structures under different loading scenarios. Comput. Methods Appl. Mech. Eng. 405, 115852 (2023).
Rao, C., Sun, H. & Liu, Y. Physics-informed deep learning for computational elastodynamics without labeled data. J. Eng. Mech. 147, 8 (2021).
Article Google Scholar
Mitusch, S. K., Funke, S. W. & Kuchta, M. Hybrid FEM-NN models: combining artificial neural networks with the finite element method. J. Comput. Phys. 446, 110651 (2021).
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge the financial support from the National Natural Science Foundation of China (Grant No. 52108434 and 52108137) and the Hunan Provincial Innovation Foundation For Postgraduate (Grant No. CX20240461).

Author information

These authors contributed equally: Weiwei He, Jinzhao Li.

Authors and Affiliations

College of Civil Engineering, Hunan University, Changsha, 410082, China
Weiwei He & Jinzhao Li
State Key Laboratory of Bridge Safety and Resilience, Key Laboratory for Damage Diagnosis of Engineering Structures of Hunan Province, College of Civil Engineering, Hunan University, Changsha, 410082, China
Xuan Kong & Lu Deng

Authors

Weiwei He
View author publications
Search author on:PubMed Google Scholar
Jinzhao Li
View author publications
Search author on:PubMed Google Scholar
Xuan Kong
View author publications
Search author on:PubMed Google Scholar
Lu Deng
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization: He, Li, Kong, and Deng; Investigation: He and Li; Visualization: He and Li; Supervision: Kong and Deng; Writing—original draft: He and Li; Writing—review and editing: He, Li, Kong, and Deng.

Corresponding author

Correspondence to Xuan Kong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Engineering thanks Daniel Pitchforth and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary handling editors: [Mengying Su, Miranda Vinay, Rosamund Daw]. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

He, W., Li, J., Kong, X. et al. Multi-level physics informed deep learning for solving partial differential equations in computational structural mechanics. Commun Eng 3, 151 (2024). https://doi.org/10.1038/s44172-024-00303-3

Download citation

Received: 08 October 2023
Accepted: 23 October 2024
Published: 01 November 2024
DOI: https://doi.org/10.1038/s44172-024-00303-3

This article is cited by

A physics-informed and data-driven framework for robotic welding in manufacturing
- Jingbo Liu
- Fan Jiang
- Manabu Tanaka
Nature Communications (2025)