Heat Exchanger Data Driven Model Using GMDH Methodology Iraci Martinez Pereira Gonçalves and Daniel Kao Sun Ting Ipen – Instituto de Pesquisas Energéticas e Nucleares São Paulo SP CEP 05508-970 Travessa R, 400, Brasil martinez@ipen.br Abstract - This paper presents a heat exchanger model developed in the Matlab platform. The model was used to investigate the use of the GMDH (Group Method for Data Handling) methodology to predict system variables, generating analytical redundancy. This work is a part of a Monitoring and Diagnosis System for Fault Detection using the GMDH datadriven model to be applied to the IEA-R1 Ipen nuclear research reactor. The GMDH is an algebraic model for system characterization. It provides a general framework for characterizing the relationship among a set of state variables of a process and is used for generating estimates of critical variables in an optimal data-driven model form. In this work we studied the following aspects: normalization, number of input data points and noise level. The GMDH methodology implemented is working properly since the magnitude of the regularity criterion is controlled by the magnitude of the machine rounding error, indicating that the method is not introducing additional errors and, most of all, the error propagation is stable. The main results are: data normalization is fundamental to obtain better precision, the larger the number of data points, the lesser the error, the error decreases with the decreasing of noise level. I. INTRODUCTION Modern science and engineering are based on using firstprinciple models to describe physical, biological, and social systems. Such an approach starts with a basic scientific model and then builds upon them various applications in mechanical engineering or electrical engineering. Under this approach experimental data are used to verify the underlying first-principle models and to estimate some of the model parameters that are difficult to measure directly. However, in many applications the underlying first principles are unknown or the systems under study are too complex to be mathematically described. With the growing use of computers and lowcost sensors for data collection, there is a great amount of data being generated by such systems. In the absence of first-principle models, such readily available data can be used to derive models by estimating useful relationships between a system's variables. Thus there is currently a paradigm shift from the classical modeling based on first principles to developing models from data. This paper presents a heat exchanger model developed in the Matlab platform. The model was used to investigate the use of the GMDH (Group Method of Data Handling) methodology to predict system variables, generating analytical redundancy. This work is a part of a Monitoring and Diagnosis System for Fault Detection using the GMDH data-driven model to be applied to the IEA-R1 Ipen nuclear research reactor [Gonçalves et al., 2001] and [Upadhyaya et al., 2000]. II. GMDH – GROUP METHOD OF DATA HANDLING The Group Method of Data Handling (GMDH) is an algebraic method for predicting system states, controller and actuator functions [Farlow, 1984] and [Ferreira, 1999]. The GMDH constructs a model, of a desired output as a function of a set of related inputs from a subsystem, by a successive polynomial approximation. The general relationship has the form shown in (1) where {x1, x2,…,xm} is a vector of input variables and y is the variable to be predicted. This formulation can be extended to the prediction of multiple outputs {y1, y2, … , yn}. ?? ? = = = + + + = m 1 i m 1 j j i ij m 1 i i i x x c x b a y ??? = = = + m 1 i m 1 j m 1 k k j i ijk x x x d L (1) Fig. 1 shows a typical node of a GMDH modeling layer with the basic quadratic predictor. The model parameters such as {A,B,C,D,E,F}, are estimated from a leastsquares fit using N observations of the input and output variables. Fig. 2 illustrates that the predicted values of Y are propagated successively to higher layers of the algorithm, with the approximation of Ypred improving at successive stages. At each stage of the approximation, Ypred is formed from pairs of input signals (to that layer), and new values of the predicted variable are propagated pairwise to the next layer. The iteration is continued until the mean-squared error between the predicted and the measured values of the output variable attains a desired value. Parsimony in model fitting is achieved by comparing the fractional prediction errors from one generation to the next, and by terminating the algorithm when the error is a minimum or when the errors from successive approximation stages is less than a preset limit. .....