Abstract
At present, the parameters of the controllers in hot rolling roughing microtension control systems are not adaptively adjustable to variations in working conditions, which compromises both width accuracy and production stability. To address this issue, this paper introduces an ATKB-PID adaptive micro tension control method. This method incorporates a linear attention layer and utilizes a K-Nearest Neighbors (KNN) algorithm to predict the optimal learning rate and inertia coefficient under actual operating conditions. Furthermore, an objective function is tailored to production indices to enhance model performance. Comparative experiments with both established and recently introduced controllers demonstrate that the proposed ATKB-PID method exhibits a smaller steady-state error and quicker adjustment time. The ATKB-PID control method is well-suited for the complex and dynamic microtension control demands in thermal roughing processes, showing promising application potential.
Similar content being viewed by others
Introduction
In the steel making process, micro tension control in the roughing mill of hot rolling has a direct effect on plate width accuracy and production stability1. This control effect will further affect the finishing process and thus the overall quality of the hot rolled coil2. At present, the hot rolling roughing mill micro-tension control mostly relies on empirically presetting PID controller parameters, which have many shortcomings3. Production involves fine-tuning the main drive speed of the vertical roller mill motor according to the difference between the actual tension between the vertical and flat rolls and the theoretical tension to achieve micro-tension control4. The target speed and load of the vertical roller mill motor change frequently, resulting in fixed PID parameters, making it difficult to achieve the desired control effect, which affects the precision of steel plate width and product quality. In addition, manual adjustment of PID parameters is inefficient and greatly reduces the stability and efficiency of production5.
At present, there are two main solutions to optimize the micro-tension control: the first is a more accurate prediction of the PID parameters6, some scholars have improved the effect of micro-tension control by improving the mechanism model7, but still can not adapt to the complexity of working conditions on the site; the second is the design of adaptive controllers, the most typical adaptive controller is the BP-PID8, which can be adapted to adjust the parameters of the controller for different conditions to improve the stability and efficiency of production and meet the production stability and product quality requirements. Improve the stability and efficiency of production to meet the technical requirements. At present, BP-PID has been widely used in various industrial fields, Ren et al. proposed an active anti-disturbance pitch controller using BP-PID algorithm for different working conditions of wind turbines to meet the engineering needs9, and BP-PID has also been applied to the lateral stability control of electric vehicles10, DC electronic load systems11 and so on. However, its control performance is affected by two factors: the parameter settings (learning rate and inertia coefficient)12 and the susceptibility of the BP algorithm to gradient vanishing13. Adaptive controllers with extremely irrational parameter settings can adapt to different operating conditions, but it will lead to its long control time, so predicting roughly reasonable parameters through different operating conditions can effectively shorten the control time of the BP-PID controller14, and Adithiyaa et al. have already proved the ability of KNN to predict the parameters15; and Wang et al. have proposed to introduce the attention mechanism into the BP algorithm to help update the weight matrix according to the inputs, which effectively solves the problem that the BP algorithm itself is prone to gradient vanishing16. Moreover, the objective function has been shown to have a significant effect on the steady-state error, overshoot amount, control time and other indices in adaptive control tasks17. Considering that the controlled object is discrete and has a large time step, the system is a black box, and the PID controller in the hot rolling system is difficult to replace, it is best to make improvements based on the PID controller.
Therefore, we propose a novel adaptive micro tension control method specifically applied to the hot rolling roughing process. Firstly, we introduce a linear attention mechanism, which is more suitable for fast response scenarios compared to ordinary attention mechanisms, to optimize the BP neural network to avoid its potential gradient vanishing problem. Secondly, we adopt the KNN algorithm to achieve the learning rate and inertia coefficient of the adaptive controller parameters according to the working conditions. Finally, we design a specific objective function to improve the performance of the adaptive controller according to the control task to ensure that the control effect of the micro tension control of the hot rolling roughing mill meets the engineering needs18. In this paper, this micro-tension control method is referred to as the ATKB-PID micro-tension control method.
Methods
In this chapter, we introduce how we designed the ATKB-PID controller specifically for the discrete and black-box system of hot continuous rolling.
The shortcomings of the BP-PID controller
The BP neural network’s popularity stems from its capabilities in parallel computing and adaptive learning, operating without the need for detailed system information on nonlinear control objects, relying only on input-output data11. The control principle of the BP-PID controller is illustrated in Fig. 1.
Nonetheless, the BP algorithm’s data transition from input to hidden layers lacks information on input value relationships, reducing correlation and potentially prolonging adaptive control, thus impacting controller performance19.
Controller incorporating Linear attention
Attention has become a key technique in the field of deep learning in recent years, and the attention mechanism can automatically focus on important information while ignoring irrelevant information when processing data20. Introducing an attention mechanism into the controller allows it to extract deeper information without changing the information’s dimension, thereby enhancing the adaptive controller’s performance. Traditional attention mechanisms are slower in inference due to the softmax function. In contrast, linear attention mechanisms, which do not use softmax classification, offer faster inference speeds and are better suited for fast-response adaptive control scenarios21. The input layer of the BP neural network connects to the attention layer, where the corresponding weights are determined by calculating attention scores using the softmax function22.
These weights, derived from attention scores and the softmax function, are then used to calculate the weighted vectors that are input into the hidden layer of the BP neural network. The specific network structure is shown in Fig. 2, and the traditional attention mechanism can be mathematically represented as Eq. 1.
In Eq. 1, \(\:Q\bullet\:{K}^{T}\) yields a matrix of \(\:n\times\:n\), making the complexity of model is \(\:O\left({n}^{2}\right)\).
The structure of the linear attention layer is shown in Fig. 3.
The linear attention mechanism can be mathematically represented as Eq. 2. The complexity of the model using the linear attention is \(\:O\left(n\right)\), much less than \(\:O\left({n}^{2}\right)\).
KNN rectification of BP-PID controller parameters
The ATKB-PID controller uses an incremental PID algorithm. The input \(\:{net}_{i}^{\left(2\right)}\left(k\right)\) from attention layer and output \(\:{O}_{i}^{\left(2\right)}\left(k\right)\) of the intermediate layer nodes in the ATKB neural network are as shown in Eq. 3:
In Eq. 3, \(\:{\omega\:}_{ij}^{\left(2\right)}\) is the weighted coefficients between the input layer neurons and the hidden layer neurons, and the function \(\:f\left(x\right)=\frac{{e}^{x}-{e}^{-x}}{{e}^{x}+{e}^{-x}}\). Further, the inputs and outputs of the network’s output layer can be obtained as shown in Eq. 4.
In Eq. 4, \(\:{\omega\:}_{li}^{\left(3\right)}\) is the weighted coefficients between the hidden layer neurons and the output layer neurons.
The deviation between the system’s actual output and the expected output is denoted as \(\:e\left(k\right)={O}_{i}-{x}_{j}\). By employing an incremental PID control algorithm, the value of the PID control quantity \(\:u\left(k\right)\) can be calculated. The result of its discretization is shown in Eq. 5:
In the Eq. 5, \(\:{\text{T}}_{\text{I}}\) and \(\:{\text{T}}_{\text{D}}\) represent the integral and derivative time parameters, respectively. Further, the control increment of the PID can be obtained, as shown in Eq. 6:
The most common objective function for a BP-PID controller is shown in Eq. 7:
The model parameter update formula, derived using the gradient descent method, is shown in Eq. 8:
In Eq. 8, \({\text{h}}\left( {\text{x}} \right)\) is a non-negative Sigmoid function: \(h\left( x \right) = \frac{{e^{x} }}{{e^{x} + e^{{ - x}} }}\). \(h^{\prime } \left( x \right) = h\left( x \right)\left( {1 - h\left( x \right)} \right)\), \(\:{f}^{{\prime\:}}\left(x\right)=(1-{f}^{2}\left(x\right))/2\), \(\:{\upomega\:}\) is the matrix of the implied layer coefficients, \(\:{\upalpha\:}\) is the inertia coefficient, and \(\:{\upeta\:}\) is the learning rate.\(\:\:{\upalpha\:}\) and \(\:{\upeta\:}\) have a more significant effect on the BP-PID controller performance. In this paper, the E1 motor of the vertical mill of the 1500 hot rolling roughing mill is modeled by SIMULINK. The learning rate of the controller is set to 0.3, and the influence of the inertia coefficient on the control effect is verified through experiments.
In Fig. 4, a smaller inertia coefficient increases system sensitivity and speed, resulting in shorter regulation times and smaller overshoots. Conversely, a larger inertia coefficient provides greater damping, enhancing system stability. However, an excessively large inertia coefficient can slow down the BP-PID controller’s response, leading to poor control effects.
The target speed of the vertical roller E1 motor is set to 100 rpm, and the motor load is 500 NM, \(\:{\upalpha\:}\)= 0.3. Experiments are conducted to verify the influence of the learning rate on the control effect of the BP-PID controller. The experiment is shown in Fig. 5.
In Fig. 5, the control effect is best when the learning rate is \(\:{\upeta\:}=0.2\), the overshoot is small, the rise time is short, and the regulation time is short; when the learning coefficient is too large, the response of the BP-PID controller is too slow, and the control effect is poor. By adjusting the size of the learning coefficient, the fast response and stability of the system can be balanced.
In summary, the inertia coefficient and the learning coefficient strongly influence the control effect of the BP-PID controller. In the hot rolling roughing process, the speed and load of the E1 motor of the vertical roller mill fluctuate continuously with varying working conditions, thereby increasing the complexity of the controller’s requirements. The control effect of the controller under different loads is tested. The target speed of the motor is set to 100 rpm, and the load is selected from the actual roughing process load range. The experimental results are shown in Fig. 6.
In Fig. 6, increasing the load on the E1 motor results in a significant rise in system rise time, a slight decrease in overshoot, and a marked increase in regulation time. Therefore, different loads at various target speeds can significantly affect the control effect for the E1 motor of the roughing vertical roller mill in the hot rolling process. In this paper, we employ the KNN algorithm to predict the appropriate inertia and learning coefficients for the E1 motor’s target speed and load in the roughing mill. Additionally, we use it to design the objective function.
ATKB-PID controller
The control schematic of the modified ATKB-PID controller used in this paper is shown in Fig. 7.
The input parameters for the roughing vertical roller mill E1 include the target motor speed and motor load size, which serve as inputs for the KNN algorithm. The KNN model then predicts the appropriate inertia and learning coefficients for the controller as output through a classification process. The ATKB-PID controller adjusts the inertia and learning coefficients of the BP-PID controller using a machine learning approach. It also enhances and optimizes the objective function. This approach prevents poor control of the E1 motor of the vertical roller mill, which can result from improper parameter settings, making the ATKB-PID controller more adaptable to various working conditions.
KNN algorithm
KNN is a simple and efficient machine learning classification algorithm that does not need to be trained in advance, and the effect depends on the value of K. It is suitable for rapid prediction of micro-tension control in the roughing process of hot rolling under complex working conditions. When the K value is small, the model is complex and the training error is small, but the generalisation is not good; when the K value is large, the model is simple and the training error is large, but at the same time the generalisation is excellent14.
The actual production of E1 vertical roller motor has a variety of working conditions, requiring a variety of load size and motor speed combinations. The selection of the optimal K value for the KNN algorithm was determined through cross-validation, considering various working conditions of the E1 vertical roller motor. Our dataset, comprising 100 samples, covers a wide range of loads and motor speeds encountered in typical hot rolling processes. In practical engineering, the optimal controller parameters of 4–6 working conditions in the data set are the same. Considering that the time complexity of KNN is proportional to the size of K value, as shown in Eq. 9, this paper chooses K value 4.
Objective function
The effect of the objective function on the model is very significant, and a large number of studies have improved the model performance by designing the objective function23. In this paper, a new objective function is designed, as shown in Eq. 10, to replace the objective function in Eq. 7.
In Eq. 10, \(\:e\left(k\right)\) is the absolute error, \(\:{\text{N}}_{{\upsigma\:}}\)is the number of overshoots, and k is the weighting coefficient. In the micro tension control problem in the roughing region of hot rolling, accuracy, rapidity and overshoot are important indicators of model performance24. Therefore, the objective function designed in this paper uses the integral of the absolute error time to improve the model rapidity and accuracy, the integral of the absolute error squared to improve the model accuracy, and the number of overshoots as a penalty term to increase the damping and reduce the number of oscillations25. We compared the common objective function with the newly designed objective function when the load was 500NM and the target speed was 100 rpm, as shown in Table 1.
Results
In the experiment, the main configuration of the ATKB-PID controller is as follows: the K value is set to 4, and the initial weight matrices for the hidden layer and the output layer are random numbers ranging from 0 to 0.5. The parameter values of the objective function for \(\:{n}_{1}\), \(\:{n}_{2}\), \(\:{n}_{3}\) are 0.4, 0.4, 0.2. Time step is 0.1s. The vertical roll E1 mill motor is tested at a target speed of 100 rpm and a load of 500 NM, and the results of the experiment are shown in Fig. 8.
In Fig. 8, the ATKB-PID controller outperforms both the ordinary BP-PID and RBF-PID controllers, exhibiting a smaller overshoot. While its rise time is slightly longer than the RBF-PID controller’s, its regulation time is significantly shorter.
Recently, R. Muduli et al.26 proposed a reinforcement learning-based RL-PID controller, suitable for similar working conditions. Experiments were conducted to observe the performance after the mill rolls bite the steel, a point at which the load changes dramatically, and the results are depicted in Fig. 9.
Figure 9shows that during steel rolling, once the mill rolls bite the steel, the ATKB-PID controller demonstrates superior speed regulation compared to both the RL-PID and PSO-PID controllers. Additionally, we incorporated the modified GWO-RL27 (Reinforcement Learning and Grey Wolf Optimizer) control method, which is an improved version of GWO-NN, as a comparative method for this task. However, under the initial working conditions, their performance is similar. The relevant experimental parameters of the five controllers are shown in Table 2.
The ATKB-PID controller has a better control effect on different speeds and loads of the E1 motor of the roughing standing roller mill.
In Fig. 10, the load of the motor is set to 500 NM, and the target speed of the motor is controlled at 40 rpm, 80 rpm, 120 rpm, 160 rpm, and 200 rpm. The ATKB-PID controller can regulate and stabilize the target speeds of the E1 motor of the stick mill. For different speeds, the ATKB-PID controller is stabilized within 0.3 s, and the PID parameters are successfully calibrated to suit the actual working conditions so that the algorithm can meet the needs of actual production. The learning rate and inertia coefficient of the BP algorithm of ATKB-PID with operating conditions are shown in Fig. 11.
Figure 10 shows the detailed data of each experiment, and the control effect is shown in Table 3.
According to Fig. 10; Table 2, there is not much difference in the control effect of the ATKB-PID controller when the target speed of the motor changes. The difference between the regulation time and the rise time is small, and the steady-state error is 0, which simultaneously satisfies the accuracy and rapidity requirements, and the control effect is significantly improved.
The range of conditions that may occur in more detailed group experiments for a 1450 production line during the hot rolling roughing process. The possible motor loads and motor target speeds are grouped. The possible ranges of motor load are divided into 200 NM, 400NM, 600NM, 800NM, and 1000NM, the possible ranges of motor target speed are divided into 40 rpm, 80 rpm, 120 rpm, 160 rpm, and 200 rpm, and the most important adjustment time is tested. The test results are shown in the Table 4.
When the load of the E1 motor of the vertical roller mill is 1000 NM, the target speed is 120 rpm, the load is 1000 NM, the target speed is 200 rpm, the load is 800 NM, the target speed is 200 rpm, and the control effect of the ATKB-PID controller is poor. The regulation times are 0.368 s, 0.382 s and 0.326 s, respectively, and the regulation time under other conditions is less than 0.3 s. The growth of the regulation time due to the different sizes of motor loads is not obvious, so the ATKB-PID controller has a good control effect. The control effect of the above three samples is generally due to the faster load and motor target speed, which are located in the hot rolling roughing vertical roller mill E1 motor normal load and target speed range of the edge of the value, which rarely occur. This also led to the fact that when we designed the data set used by KNN, the data contained less extreme conditions in the production process, which led to the deviation of KNN’s prediction of controller parameters in the experiment, resulting in poor control effect. This is a possible future improvement direction to address extreme case parameter prediction while maintaining model complexity and considering industrial applications.
To sum up, the ATKB-PID controller proposed in this paper achieves the expected results and is able to satisfy both the rapidity and accuracy requirements to solve the micro tension control problem of hot strip roughing mills.
Discussion
-
1.
This paper proposes a micro tension control method for the roughing process that features a small steady-state error, short rise time, and short regulation time. The method’s accuracy, speed, and feasibility are verified through experiments.
-
2.
The ATKB-PID controller proposed in this paper integrates a linear attention mechanism and adjusts the inertia and learning coefficients of the BP-PID controller using the KNN algorithm. It also designs an objective function to meet the complex requirements of micro tension control in the roughing process of hot strip milling.
-
3.
In this paper, for hot rolling roughing mill E1 motors, the experimental range exceeds the actual working range, and the ATKB-PID controller still maintains a good control effect.
-
4.
In this study, one of the main challenge we faced was how to maintain control effectiveness while reducing computational costs and simplifying the implementation process. We achieved this by employing a structurally simpler Linear-attention and designing it to allow for a smaller range of values for K, thereby reducing computational costs and the difficulty of its practical application.
-
5.
Since actual production seldom experiences motor target speeds and loads near the maximum values of the set range, the ATKB-PID controller’s performance at these limits is not optimal. However, there is potential for improvement in these scenarios.
Data availability
The data that support the findings of this study are available on request from the corresponding author, upon reasonable request.
References
Zhong, Y. et al. Strip width spread prediction in rough rolling process based on mechanism modeling and optimization. J. Iron. Steel Res. Int. 30(12), 2416–2424 (2023).
De Beer, J. & de Beer, J. Future technologies for energy-efficient iron and steel making. Potential. Ind. Energy-Effic. Improv. Long. Term 93–166 (2000).
Özgür, A., Uygun, Y. & Hütt, M. T. A review of planning and scheduling methods for hot rolling mills in steel production. Comput. Ind. Eng. 151, 106606 (2021).
Choi, I. S., Rossiter, J. A. & Fleming, P. J. Looper and tension control in hot rolling mills: a survey. J. Process. Control. 17(6), 509–521 (2007).
Tong, C. et al. Application of fuzzy controller to hot strip mill micro tension control. Iron Steel (China) 33(5), 36–38 (1998).
Khramshin, V. R. et al. Improvement of electric and mechanical system for automated strip tension control at continuous wide-strip hot-rolling mill. Procedia Eng. 150, 11–17 (2016).
Wang, S. & He, F. Control technology and strategy of tension control system. In Proceedings of 2018 Chinese Control and Decision Conference (CCDC) 2620–2625 (IEEE, 2018).
Kurpe, O. et al. Electric motors power modes at synchronization of roughing rolling stands of hot strip mill. In Proceedings of 2020 IEEE Problems of Automated Electrodrive. Theory and Practice (PAEP) 1–4 (IEEE, 2020).
Ren, H. et al. Variable pitch active disturbance rejection control of wind turbines based on BP neural network PID. IEEE Access 8, 71782–71797 (2020).
Huang, G. et al. A BP-PID controller-based multi-model control system for lateral stability of distributed drive electric vehicle. J. Franklin Inst. 356(13), 7290–7311 (2019).
He, K. & Jiang, L. Research on DC Electronic load system based on BP Neural Network PID Control. Acad. J. Eng. Technol. Sci. 5, 56–61 (2022).
Lei, L., Wang, H. & Cheng, Y. An improved BP algorithm based on global revision factor and its application to PID control. In Proceedings of Advances in Neural Networks-ISNN 2006: Third International Symposium on Neural Networks, Chengdu, China, May 28-June 1, 2006, Part II vol. 3 1019–1026 (Springer, 2006),
Xi, W. D. et al. BPAM: Recommendation based on BP neural network with attention mechanism. In Proceedings of IJCAI 3905–3911 (2019).
Zhang, S. Challenges in KNN classification. IEEE Trans. Knowl. Data Eng. 34(10), 4663–4675 (2021).
Adithiyaa, T., Chandramohan, D. & Sathish, T. Optimal prediction of process parameters by GWO-KNN in stirring-squeeze casting of AA2219 reinforced metal matrix composites. Mater. Today Proc. 21, 1000–1007 (2020).
Wang, C. D. et al. A BP neural network based recommender framework with attention mechanism. IEEE Trans. Knowl. Data Eng. 34(7), 3029–3043 (2020).
Li, Y. et al. Lateral stability control for intelligent commercial vehicle based on reconstructed objective function method. Actuators 13(4) (2024).
Park, C. J. & Hwang, I. C. Width control systems with roll force automatic width control and finishing vertical mill automatic width control in hot strip mill. J. Mater. Process. Technol. 206(1–3), 143–151 (2008).
Wang, M. & An, A. An adaptive PID control based on BP neural network for the voltage of MFC. In Proceedings of 2020 Chinese Automation Congress (CAC) 7040–7045 (IEEE, 2020).
Serrano, S. & Smith, N. A. Is attention interpretable? Preprint at http://arxiv.org/abs/1906.03731 (2019).
Han, D., Pan, X., Han, Y., Song, S. & Huang, G. Flatten transformer: Vision transformer using focused linear attention. In Proceedings of the IEEE/CVF International Conference on Computer Vision 5961–5971 (2023).
Katharopoulos, A., Vyas, A., Pappas, N. & Fleuret, F. Transformers are rnns: Fast autoregressive transformers with linear attention. In Proceedings of International Conference on Machine Learning 5156–5165 (2020).
Villarrubia, G. et al. Artificial neural networks used in optimization problems. Neurocomputing 272, 10–16 (2018).
Fashoto, G. S. & Sani, S. Design and implementation of MOLP problems with fuzzy objective functions using approximation and equivalence approach. IAENG Int. J. Comput. Sci. 48(2) (2021).
Cao, J. & Lin, J. A study on formulation of objective functions for determining material models. Int. J. Mech. Sci. 50(2), 193–204 (2008).
Muduli, R., Jena, D. & Moger, T. Application of reinforcement learning-based adaptive PID controller for automatic generation control of multi-area power system. IEEE Trans. Autom. Sci. Eng. (2024).
Zamfirache, I. et al. Neural network-based control using actor-critic reinforcement learning and grey wolf optimizer with experimental servo system validation. Expert Syst. Appl. 225 (2023).
Acknowledgements
This work was supported by National Key Research and Development Program of China (Grant No. 2022YFB3304804) and China Iron and Steel Research Institute Group (Grant No. KNRCPY-JT00-23013).
Author information
Authors and Affiliations
Contributions
Xinkai Xu and Zheng Qi wrote the main manuscript text and constructed a model for experimentation . Xiangrong Song prepared Fig. 1 and Introduction. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xu, X., Song, X. & Qi, Z. ATKB-PID: an adaptive control method for micro tension under complex hot rolling conditions. Sci Rep 15, 2050 (2025). https://doi.org/10.1038/s41598-025-85960-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-85960-w