https://doi.org/10.31449/inf.v48i11.5941 Informatica 48 (2024) 15 –32 15 Profit Estimation Model and Financial Risk Prediction Combining Multi-Scale Convolutional Feature Extractor and BGRU Model Yangyan Liu 1* , Bolin Pan 2 1 School of Economics and Management, Guangxi Science & Technology Normal University, Laibing 546199, China 2 School of Economic Management, Ningxia Institute of Science and Technology, Shizuishan 753000, China E-mail: liuyangyan001@163.com * Corresponding author Keywords: convolutional neural network, feature extraction, bidirectional gated recurrent unit, profit estimate, financial risk Received: March 21, 2024 In response to the inaccuracy of financial risk prediction and profit prediction for enterprises, a financial risk prediction model based on the graph networks was designed. This experiment combined multi-scale feature extraction and sequence analysis methods. In addition, the model adopted a structurally concise and effective bidirectional gated recurrent unit to capture temporal relationships in time series data. The profit prediction model was combined multi-scale advantages and attention mechanisms. The latter enhanced the recognition and utilization of influential features, which improved predictive ability and practical value. These results confirmed that the accuracy of this model significantly improved to 98.03% after iterative training. The F1 score of the financial risk prediction model reached 0.98, demonstrating an excellent performance. The profit prediction model performed better than other models in both regression and classification problem indicators, with an error close to 0 and a mean square error of 0.0232. This indicated that the model had extremely high prediction accuracy. Therefore, both models have strong predictive ability and have practical application significance. Povzetek: Razvit je finančni model, ki združuje multi-lestvični konvolucijski ekstraktor lastnosti in BGRU model za natančno napovedovanje dobička in finančnih tveganj podjetij. Model z uporabo pozornostnih mehanizmov in sekvenčne analize dosega visoko zmogljivost in praktično vrednost v finančni analizi ter obvladovanju tveganj. 1 Introduction In the current globalized economic environment, the estimation of corporate profitability and the prediction of financial risk becomes the important links in financial analysis and investment decision-making. On the one hand, it is crucial to accurately estimate a company's profit prospects for investors. Because it directly relates to the investment and return of capital. On the other hand, financial risk prediction is particularly important for corporate management. Because it can help enterprises identify potential financial problems in a timely manner, formulate corresponding countermeasures, and ensure the stable development of the enterprise [1-3]. As information technology develops, data analysis technology becomes a key force driving innovation, especially the application of artificial intelligence in the financial field. In recent years, deep learning technologies have emerged and developed. These technologies have made significant achievements in multiple fields such as image recognition, speech processing, and natural language processing. In this context, the application of deep learning techniques to profit estimate and financial risk prediction begins to receive widespread attention from academia and industry. Specifically, Convolutional Neural Network (CNN) shows great potential in processing time-series data due to its excellent feature extraction capabilities [4-6]. Gated Recurrent Unit (GRU) is an effective variant of recurrent neural networks, which has value in processing sequence information [7]. The financial data of enterprises are high-dimensional and contain rich time-series characteristics. However, traditional machine learning techniques are inadequate in dealing with such problems. In addition, there are many factors that need to be considered due to the complexity of financial data. How to effectively integrate these factors has become a significant challenge [8-10]. In this context, this study aims at combing multi-scale convolutional feature extractions with bio-heuristic for a bidirectional gated recurrent single model. A novel profit estimate and financial risk prediction framework is constructed, aiming to achieve more accurate and robust performance in predicting future profits and risks. The study first proposes the research purpose. Secondly, the financial risk prediction and profit estimate models are 16 Informatica 48 (2024) 15 –32 Y . Liu et al. designed. Then the model performance is validated. Finally, the research conclusion is obtained. In recent years, academic discussions on financial prediction automation continue to increase. Peng et al. proposed a novel predictive tool that utilized a model to describe the volatility of financial returns as numerous potential statistical distributions. Meanwhile, the predictive accuracy was optimized by analyzing the best case of these distributions. These results confirmed that the model performed better than most traditional prediction methods [11]. Cao et al. analyzed and predicted the financial crisis of listed companies from the perspective of constructing financial risk early warning systems for e-commerce enterprises, using methods based on deep learning technology. Meanwhile, they designed a financial stability warning system to achieve the early detection and handling of crises [12]. In profit forecasting, Sirisha et al. fused three neural network models to form a comprehensive model. This model was used to process past financial data to predict the future economic situation of the enterprise. These results confirmed that the model exhibited an accuracy prediction rate of over 90% [13]. Kim et al. developed a multivariate model under self-attention mechanism, aiming to plot the Bitcoin prices. These results confirmed that the prediction results of this model had high accuracy [14]. Shajalal et al. proposed an innovative deep learning network architecture aimed at predicting the stock risk of market products. They used a series of data balancing techniques to train data balance. Then the balanced data were used as the basis for training deep neural networks. These results confirmed the effectiveness and accuracy of the model in predicting product shortages [15]. He et al. developed two flood forecasting models in the application of Bidirectional Gated Recurrent Unit (BGRU). These two models were based on BGRU and Backpropagation Neural Network (BPNN), as well as bidirectional Long Short-Term Memory (LSTM) and BNN, respectively. These two models surpassed both unidirectional and bidirectional models in terms of prediction accuracy and reliability [16]. Dangut et al. developed a rare fault recognition model that relied on deep hybrid learning methods. They conducted empirical analysis using large-scale records from the central maintenance system of large aircraft. These experiments confirmed that this model improved accuracy by 18% and recall by 5%, effectively identifying rare faults [17]. Li et al. created a deep learning model that integrated end-to-end CNN and BGRU for extracting and classifying spatiotemporal information. These experiments confirmed that the model could effectively distinguish healthy rats from diseased rats, with a classification accuracy of up to 98.73%. This study had important reference significance for the clinical diagnosis of Parkinson's disease [18]. In summary, in recent years, researchers have increasingly focused on building efficient, accurate, and stable early warning systems in automated financial forecasting. The improved neural networks and deep learning-based methods are important. These methods play important roles in enterprise financial risk prediction and the development of financial stability warning systems. There are various prediction tools, such as self-attention mechanisms based on multivariate long short-term models. From the perspective of prediction, the proposal of these tools undoubtedly opens up new avenues for financial prediction. This study combines the method of multi-scale convolutional feature extraction, which has advantages over traditional models in information acquisition. The addition of BGRU enables the prediction model to better capture time series information, effectively improving the accuracy of profit estimate and financial risk predictions. The literature summary is shown in Table 1. Table 1: Literature summary Author Literature dimension Content Conclusion Li et al. Risk prediction Design and improve a neural network model, using data from listed companies from 2017 to 2020 as the basic dataset for empirical analysis The prediction accuracy of the improved model exceeds 80% Peng et al. Design a risk prediction model based on financial volatility as a statistical distribution The model performs better than most traditional prediction tools Cao et al. Construct an early warning model for e-commerce enterprise financial risk based on deep learning The model can achieve early risk detection and risk handling Sirisha et al. Profit forecast Design a comprehensive model that integrates three types of neural networks to predict the future profit situation of enterprises The model accuracy prediction rate exceeds 90% Kim et al. Design a multivariable model based on self-attention mechanism to The prediction results of the model have high accuracy and superiority Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 17 predict the profitability of Bitcoin Shajalal et al. Propose a deep learning network architecture to predict the risk of product shortage and product profit in the market The effectiveness and accuracy of the model are superior He et al. Application of BGRU model Propose a flood prediction model that integrates BGRU and BNN The model has advantages over traditional prediction methods in terms of prediction accuracy and reliability Dangut et al. Design a rare fault recognition model for large aircraft based on deep hybrid learning method This model improves prediction accuracy by 18% and recall by 5% Li et al. Design a deep learning model that integrates end-to-end CNN and BGRU for clinical diagnosis of Parkinson's disease The model can effectively distinguish healthy and diseased rats, with a classification accuracy of up to 98.73% Current research has predicted enterprise risks and profits from multiple perspectives. However, all methods are single risk or profit predictions, which have not formed an integrated system for predicting risks and profits simultaneously. This study presents a concise and effective bidirectional GRU and attention mechanism. This method not only improves the predictive ability of the model, but also integrates risk prediction and profit prediction. Then enterprises can make comprehensive decisions and avoid information conflicts caused by different prediction systems. 2 Design of financial risk and profit prediction models A novel financial risk prediction model, namely Multi-Scale Bidirectional Gated Recurrent Unit (M-BU), is designed based on BGRU. In addition, a profit estimate model is designed, which is the Multi-Scale Adaptive Bidirectional Gated Recurrent Unit (M-AU). The financial risk prediction model is based on the graph networks and adopts a strategy combining multi-scale feature extraction technology and sequence analysis methods. Convolutional layers with different sizes of convolutional kernels are used to extract rich features. Meanwhile, advanced techniques such as Batchnorm and Mish functions are used to prevent overfitting while maintaining the model's generalization ability. BGRU can effectively capture temporal relationships in time series data. The profit estimate model combines multi-scale advantages and attention mechanisms. The latter enhances the recognition and utilization of influential features by assigning different weights to different features. 2.1 Design of financial risk prediction model The first step is the analysis and reorganization of the sample data in all financial risk prediction model designs. The essential step before data analysis is to determine which variables should be included in the research scope [19-21]. The study sets a criterion to exclude variables with a missing degree exceeding one-third in the dataset. Because these variables can provide insufficient information to facilitate the analysis. The missing data will be filled in. The dynamic data imputation method is adopted instead of simply replacing the mean or minimum values [22-24]. The study observes that the values of many financial variables mostly fluctuate around 30% of their mean after in-depth analysis of the data distribution of various financial variables. Therefore, the strategy for handling missing data is as follows: a random number based on the fluctuation range is used to replace the average value of the variables. The calculation method is represented by formula (1). Missing value Mean value+Randomness Mean value =  (1) A distortion rate is defined higher than 20% when encountering samples with severe data missing in generating a dataset. This means that the information of these samples is not sufficient to support a comprehensive analysis. Samples containing numerous missing values will be excluded under these conditions. There may still be situations where data is scarce despite going through layers of screening. For example, some corporate samples may only cover some financial variables. The study also decides to delete the incomplete data samples. Dealing with outliers is equally essential. Then this study applies the rule based on triple standard deviation to identify and remove outliers. The study uses the upper and lower quartiles to define and handle outliers based on the distribution of data from each year given the differences in data from different years. The key to this step is to eliminate data points that may distort the analysis results. The final stage of data processing is quantitative standardization. Different variables may have different scales and distribution intervals. Therefore, the study uses the min-max method to transform all financial variables into a unified standard interval of 0 to 1. Min-max standardization refers to adjusting each variable to a uniform size. This can eliminate the impact of 18 Informatica 48 (2024) 15 –32 Y . Liu et al. different measurement units and maintain comparability between different variables. Standardization involves subtracting the minimum value of each variable from its value across all data points. Meanwhile, the variable is divided by the amplitude of the variable range to calculate the standardized data points [25-27]. The calculation method is represented by formula (2). ( ) ( ) ( ) min max min xx x xx −  = − (2) In formula (2), ( ) min x and ( ) max x represent the minimum and maximum values of the financial index, respectively. A completely new pattern is designed. This pattern enhances the ability to mine important information in financial data by integrating two main methods: multi-level feature recognition and sequence analysis. Firstly, a feature extraction module with multi-scale perception capability is designed to capture risk conditions associated with numerous financial indicators. The innovative inspiration for this strategy comes from the concept of graph networks. This module can integrate convolution kernels of various sizes and successfully extract rich feature levels from financial data. Figure 1 shows the structure of convolution and feature recognition. f11 f12 f21 x11 x21 f22 x12 x22 x31 x32 x13 x23 x33 o11 o12 o21 o22 S*S n*n CBM Concat Figure 1: Convolutional and feature recognition structures Small-scale convolution kernels distinguish subtle local patterns, while large-scale convolution kernels help to interpret external environmental correlations. This model can capture and apply the complex interactions between various financial indicators. The Batchnorm method is utilized to prevent overfitting of the model. Firstly, the mean is calculated using formula (3). 1 1 m Bi i x m  = =  (3) Afterwards, the data variance is calculated using formula (4). ( ) 2 2 1 1 m B i B i x m  = =−  (4) Normalization is represented by formula (5). 2 ˆ iB i B x x   − = + (5) The final calculation result can be obtained, represented by formula (6). ˆ ii yx  =+ (6) In formula (6),  represents the scaling variable.  is a translation variable. The Mish function is utilized as the activation function, represented by formula (7). ( ) ( ) tanh ln 1 x Mish x e =  + (7) In multi-scale feature extraction, convolution operation is first required to obtain the feature representation vector, represented by formula (8). ( ) ( ) s m s conv x s conv x       (8) In formula (8), x represents the eigenvector. Then the feature representation vector can be output using formula (9). ( ) , x concat s m  = (9) The model also includes a pooling layer, aiming at screening and enhancing favorable features. This model has the ability to select the main information in the feature vectors and exclude non-deterministic interference information. The ability is achieved by applying filters of various sizes in the max pooling operation of multiple feature images [28-30]. This approach not only reduces the dimensionality of the data, Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 19 but also reduces the overall complexity of the model to accelerate the training objectives. The key to model design is to adopt sequence learners after completing feature extraction. Figure 2 shows the pooling structure. 13 17 5 9 25 6 0 2 31 0 17 8 86 13 39 72 9 25 88 72 72 25 9 88 Figure 2: Pooled structure The study utilizes the BGRU structure, which can capture temporal continuity in financial data and simultaneously learn past and future information. As a result, more accurate risk prediction can be achieved. The design of BGRU is concise and effective, reducing the parameter complexity of LSTM, but still retaining the ability to learn long-range dependencies. It is possible to determine when to retain or discard information by judging the gating mechanism. As a result, the efficiency of neural networks can be improved in processing time series data. Figure 3 shows the structure of the financial risk prediction model. S*S n*n CBM Concat Pooling S*S n*n CBM Concat Pooling GRU GRU GRU GRU GRU GRU GRU GRU GRU o 1 o 2 o 3 l 1 l 2 l n y 1 y 2 Figure 3: Structure of financial risk prediction model The model reset gate is represented by formula (10). ( ) 1 t t xr t hr r Z X W H W b  − = + + (10) In formula (10),  represents the activation function. 1 t H − is the implicit state of the previous node. t X refers to the current input. W means weight. The update gate is represented by formula (11). ( ) 1 t t xs t hs s Z X W H W b  − = + + (11) The memory gate is represented by formula (12). ( ) ( ) 1 tanh ~ t t xh t t hh h H X W R H W b − = + + (12) Finally, M-BU combines multi-level spatial features with the temporal features learned by BGRU. As a result, a comprehensive prediction of the company's financial risk can be achieved. The dual advantages of feature mining and sequence analysis provide strong data support for model decision-making. Firstly, a multi-level convolutional network performs feature extraction by using convolutional kernels of different sizes. Therefore, clear hierarchical financial data features can be extracted. Next, the pooling operation further optimizes the feature 20 Informatica 48 (2024) 15 –32 Y . Liu et al. vectors, reduces dimensionality, and removes noise. Finally, these features are integrated into a financial risk foresight model that takes into account the time dimension through the BU network. Table 2 shows the parameters of the network structure. Table 2: Structural parameters of financial risk prediction model Number of layers Layer name Nuclear size Output size Parameter quantity 0 Input layer - (38, 1) 0 1 One-dimensional convolutional layer 1 5×5 (38, 4.2) 25.2 1 One-dimensional convolutional layer 2 3×3 (38, 4.1) 16.8 2 One-dimensional convolutional layer 3 5×5 (38, 8.4) 168.4 2 One-dimensional convolutional layer 4 3×3 (38, 8.2) 104.8 3 Merge layers - (38, 16.6) 0 4 Bidirectional GRU layer 1 64 (16, 128.4) 39600 5 Bidirectional GRU layer 2 64 (16, 128.2) 74160 6 Dropout - (16, 128) 0 7 Leveling layer - (1, 2048.8) 0 8 Fully connected layer - (1, 2) 4100 2.2 Design of financial profit prediction model The initial step involves collecting economic report data from numerous listed entities in the stock market in the design of the financial profit estimate model. Incomplete or unrelated information is discarded in the screening process to maintain the accuracy of the developed model. Economic data contain many core indicators, such as the proportion of corporate capital benefits and fixed asset profitability. They undoubtedly have analytical significance, but not all indicators have equal influence on predicting economic benefits. The M-AU model has been designed as a solution in response to this situation. This model is committed to improving and enhancing the efficiency of classical methods. Traditional BGRU often assumes that different features have the same influence when processing time series data. In view of this, the study decides to prioritize the importance of features to enhance the model's discriminative ability. M-AU has allocated weights to different features by inheriting the attention mechanism and improving the model. Meanwhile, more computing power is applied to features that have a significant impact on expected results, thereby enhancing the model's ability to identify and evaluate impacts. Figure 4 shows the structure of the profit estimate model. S*S n*n CBM Concat Pooling S*S n*n CBM Concat Pooling GRU GRU GRU GRU GRU GRU o 1 o 2 o 3 l 1 l 2 l n y 1 y 2 Attention mechanism layer Fully connected layer Multi scale feature extraction Sequence learning Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 21 Figure 4: Structure of profit forecast model This mechanism does not exist alone, but can be combined with other network structures. The identification rate of key information can be improved by applying it in various stages of model processing. More precisely, the typical design of M-AU includes using filters of various scales to perform deep feature mining of financial data and identify data characteristics of different scales. Subsequently, this model is able to learn the connections between time series data by utilizing the bidirectional cyclicity of the BGRU structure. The attention mechanism introduced on this basis assigns specific weights to each hidden layer state, represented by formula (13). ( ) tt a score h = (13) In formula (13), t h represents the output of the multiscale extractor. These weights are normalized through the softmax function, representing the weighted impact of each feature on the final estimation result. This model integrates weighted features in subsequent stages and obtains the final prediction result through the fully connected layer and softmax function. The normalized weights are represented by formula (14). ( ) 1 exp / n t t t t p a a = =  (14) Finally, resource allocation is based on the attention mechanism, represented by formula (15). 1 n t t t i y p h = =  (15) Overall, M-AU is committed to improving the accuracy of financial data analysis through exquisite structural selection and algorithm integration. This provides a more reliable scientific basis for financial decision-making. Figure 5 shows the model training process. Allocate attention resources Entering GRU for learning Hidden layer calculation completed? Calculate hidden layer values Calculate loss function Calculate loss function Output layer bias value Calculate loss function Partial derivative of loss function Update weights End Y N Start Different scale convolution kernel convolution operations Input training data Data reconciliation Obtain multi-scale feature vectors Calculate vector weights Vector weight normalization Calculate hidden layer values Hidden layer deviation value Figure 5: Model training process The M-AU architecture innovatively integrates hierarchical financial risk tools and focusing techniques in the latest financial data analysis and expectation framework. This model has new insights in financial information assessment and trend prediction. This mode focuses on decoding continuous data and can identify the main characteristics closely related to prediction performance, thereby improving prediction accuracy and reliability. M-AU processes financial information with meticulous techniques under this approach and can tailor precise enterprise value evaluations for fund investors. M-AU adopts a dual scale filter and analyzes the input financial information layer by layer in the structural design. Therefore, the richness and complexity of the data 22 Informatica 48 (2024) 15 –32 Y . Liu et al. can be ensured while mining the information. This stage is crucial in capturing patterns that may be overlooked, such as using filters of different scales to simultaneously reveal short-term and long-term financial trends, adding a three-dimensional perspective to the decision-making process. BGRU is used to analyze historical and potential trends after the financial risk analysis. BGRU provides a comprehensive analytical perspective for understanding and predicting complex financial time series and successfully captures the temporal dependencies of data. The introduction of a focus layer can improve the pattern prediction function. Hierarchy, as a filter in the network, emphasizes the recognition of important features by weighting the feature vectors of convolutional layers. This indicates that in decision-making, patterns can automatically identify the most relevant data without relying on artificially set feature importance. Therefore, the mode's ability to adjust and formulate strategies can be significantly enhanced in dealing with complex backgrounds. The network architecture, as an ultimate output, presents the superimposed characteristics as predicted values through the combination of compact layers and softmax incentive function. Then the values are further transformed into net profit prediction for the next cycle. The output results provides potential economic performance predictions for fund operators and decision-makers. Meanwhile, potential risks can be quantitatively measured and returned on funds. Table 3 shows the structural parameters. Table 3: Structural parameters of profit prediction model Number of layers Layer name Nuclear size Output size Parameter quantity 0 Input layer - (38, 1) 0 1 One-dimensional convolutional layer 1 5×5 (38, 4) 24.5 1 One-dimensional convolutional layer 2 3×3 (38, 4) 16.8 2 One-dimensional convolutional layer 3 5×5 (38, 8) 169.2 2 One-dimensional convolutional layer 4 3×3 (38, 8) 105.6 3 Merge layers - (38, 16) 0 4 Bidirectional GRU layer 1 64 (16, 128) 39608 5 Bidirectional GRU layer 2 64 (16, 128) 74256 6 Fully connected layer 1 - (128, 16) 275.5 7 Attention mechanism layer - (16, 128) 0 8 Flatten layer - (1, 2048) 0 9 Fully connected layer 2 - (1, 1) 2051.3 The exquisite design of M-AU provides efficient analytical tools for financial analysis professionals at the practical application level. Therefore, a thorough understanding of the economic situation and revenue potential of enterprises can be enhanced. Overall, the economic benefit estimation framework based on M-AU continuously analyzes and practices the latest data analysis technologies to enhance the certainty of economic analysis. A framework is formed by combining multi-level convolutional networks, bidirectional control loops, and focusing mechanisms. This framework can efficiently process time series data, identify complex economic patterns, and accurately predict the company's future economic performance. The M-AU prediction mechanism is expected to transform into a disruptive tool in the financial industry with the increase of data and the continuous refinement of patterns. This provides theoretical support for business strategy formulation and opens up new perspectives for financial investors in risk and return assessment. First, the data are divided into training and test sets according to 70% and 30% arm strength to ensure that the model has sufficient generalization ability for unseen data. In addition, the study introduces L2 regularization technique to reduce the risk of overfitting. Regularization works by adding a penalty term to the loss function, which can limit the weight of the model and avoid excessive model complexity. The early stop method is used to control overfitting. This means that if the loss of the validation set does not improve within 10 consecutive training cycles, the training process will be stopped. Indicators closely related to the recent financial health and liquidity ability of the enterprise are selected in the risk prediction, such as asset liability, current, and quick ratios. Because these indicators can fully reflect the solvency and cash flow status of the enterprise. These indicators are closely related to enterprise risks, which can reflect the enterprise's risk situation in real-time. Indicators directly related to the company's profitability are selected in the profit forecasting, such as revenue growth rate, gross profit margin, net profit margin, etc. These indicators can directly reflect the company's profitability and potential, which can predict the company's future profitability performance. Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 23 3 Effectiveness of financial risk prediction and profit prediction models The performance analysis of financial risk prediction and profit estimate model included three parts. Firstly, the performance of financial risk prediction was analyzed. Secondly, the performance of the profit estimate model was analyzed. Finally, a comprehensive prediction effect analysis was conducted using actual enterprise cases. 3.1 Financial risk prediction effect The aim of this experiment is to analyze and predict data from Special Treatment (ST) and non-ST companies using machine learning models. The study divided the dataset into training and testing sets based on standard proportions to effectively evaluate the performance of the model. Table 4 shows the experimental setup. Table 4: Structural parameters of profit prediction model Dataset settings Dataset categories Number of ST companies Number of non-ST companies Total Training set 252 11500 11752 Test set 85 4760 4845 Total 335 16260 16595 Parameter settings Set project Detail settings Dataset partitioning ratio Training set: Test set=7:3 Learning rate 0.001 Optimizer Adam Training rounds 50 Dropout application layer Fully connected layer In Table 4, the training set includes 252 data points from ST company and 11500 data points from non-ST company, totaling 11752 samples. The test set includes 85 data points from ST company and 4760 data points from non-ST company, totaling 4845 samples. The entire dataset consists of 335 ST company data points and 16260 non-ST company data points, totaling 16595 samples. In addition, the Dropout technique was introduced in the fully connected layer of the model to improve the generalization ability of the model and prevent overfitting. This means that a certain proportion of neurons will be randomly discarded during training. This can increase the sparsity of the model structure, forcing the model to learn effectively through a small number of features, thereby improving its performance on unseen data. The models used for comparison are as follows: CNN, GRU, Multi-Scale Gated Recurrent Unit (M-GU), Support Vector Machine (SVM), FastText, Deep Long Short-Term Memory (D-LSTM), Character-Level Convolutional Neural Network (Char-CNN), Character-Level Convolutional Recurrent Neural Network(Char-CRNN), Word-Level Convolutional Neural Network with Attention(Word-CNN-Att), Deep Recurrent Neural Network (DRNN), and Deep Bidirectional Gated Recurrent Unit (D-BU). First, the study collected data from multiple financial databases and publicly available company information for ST and non-ST companies in the experimental process. The focus was on financial indicators during the data collection process, such as debt ratio and current ratio, and data cleaning, preprocessing. Meanwhile, feature selection steps were carried out. Then the experiment began by using the Adam optimizer and setting appropriate learning rates and training rounds. The experiment was divided into three stages. Firstly, the learning mode and generalization ability of the model were trained and tested, followed by financial risk prediction. Secondly, financial indicators were accurately predicted and profit forecasts were made through parameter adjustments and test data validation. Finally, J company was chosen as the empirical research object to evaluate the actual effectiveness of predictions. Figure 6 shows the parameter variation effect. 24 Informatica 48 (2024) 15 –32 Y . Liu et al. 2 0 4 6 Epoch 0.092 0.094 0.096 0.098 0.100 0.102 0.104 0.106 8 10 (b) Change in loss function 2 0 4 6 Epoch 0.97800 0.97825 0.97850 0.97875 0.97900 0.97925 0.97950 0.97975 Accuracy 8 10 (c) Change in accuracy 0.2 0 0.4 0.6 Dropout 90 91 92 93 94 96 97 98 0.8 1.0 (a) Dropout value impact Accuracy Trough point Accuracy Figure 6: Parameter variation effect In Figure 6, the loss function values in the database showed a significant decreasing trend as the iteration increased. This indicated that the model's predictive ability gradually improved in consistency with the actual situation. Meanwhile, the accuracy of the model also significantly improved as the loop tests increased. The proportion of values close to 1 continued to increase, further confirming the improvement in the performance of the research model. In addition, the classification accuracy was the highest when the dropout value was 0.4, reaching 98.03%. Table 5 shows a comparison of vertical improvements. Table 5: Vertical improvement comparison Model Data set Accuracy F1 score Number of parameters Training time Memory footprint SVM A 0.9498 0.945 15000 100.50s 500MB SVM B 0.9376 0.938 15000 95.75s 500MB CNN A 0.9672 0.967 250000 25.09s 1.2GB CNN B 0.9549 0.955 250000 30.17s 1.2GB GRU A 0.9532 0.953 80000 197.78s 750MB GRU B 0.9485 0.948 80000 205.89s 750MB BGRU A 0.9702 0.97 120000 349.05s 1GB BGRU B 0.9667 0.966 120000 360.11s 1GB M-GU A 0.9742 0.974 100000 237.45s 900MB M-GU B 0.9689 0.968 100000 245.96s 900MB M-BU A 0.9803 0.98 150000 377.93s 1.1GB M-BU B 0.9757 0.975 150000 390.07s 1.1GB In Table 5, M-BU achieved an accuracy of 98.03% on dataset A, which was superior to other models such as BGRU's 97.02%, GRU's 95.32%, and the classical CNN model's 96.72%. M-BU also performed well on dataset B, achieving an accuracy of 97.57%. M-BU still maintained a leading position compared to BGRU's 96.67%, GRU's 94.85%, and CNN's 95.49%. This high accuracy meant that M-BU more accurately identified potential risk points when predicting financial risk. In addition, M-BU also performed excellently in F1 scores, reaching 0.980 and 0.975 on datasets A and B, respectively, significantly higher than other models. This indicated that M-BU had Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 25 higher reliability in predicting positive and negative samples. The training time of M-BU was longer compared to other models, with 377.93 seconds and 390.07 seconds on datasets A and B, respectively. This reflected the additional time investment required for model training, partly due to the predictable complexity of the M-BU structure compared to general GRU. However, this additional time investment was reasonable and beneficial considering that training was a one-time cost and the model was used multiple times after deployment. M-BU had 150000 parameters in terms of parameter quantity, slightly more than other competitive models such as GRU's 80000. The larger number of parameters corresponded to the learning ability of the model, increasing the risk of overfitting and the demand for computing resources. M-BU was approximately 1.1GB in terms of memory usage. This indicated that M-BU could perform effective training with certain computing resources. Figure 7 shows a horizontal comparison. 68 70 72 74 76 78 Types Accuracy (%) (e) Yah. A. 82 84 86 88 90 92 94 96 98 FastText D-LSTM Char-CNN Char-CRNN Word-CNN-Att DRNN D-BGRU M-BU Accuracy (%) Types (a) AG 94 95 96 97 98 99 100 Accuracy (%) Types (b) DBP FastText D-LSTM Char-CNN Char-CRNN Word-CNN-Att DRNN D-BGRU M-BU 90 91 92 93 94 95 96 97 98 99 Accuracy (%) (c) Yelp P. Types FastText D-LSTM Char-CNN Char-CRNN Word-CNN-Att DRNN D-BGRU M-BU 56 58 60 62 64 66 68 70 72 Accuracy (%) Types (d) Yelp R. FastText D-LSTM Char-CNN Char-CRNN Word-CNN-Att DRNN D-BGRU M-BU FastText D-LSTM Char-CNN Char-CRNN Word-CNN-Att DRNN D-BGRU M-BU Figure 7: Horizontal comparison In Figure 7, the performance of M-BU was excellent on different datasets in a series of recent horizontal comparison analysis of models. M-BU not only demonstrated excellent processing capabilities compared to other cutting-edge technologies in comparative studies, but also reflected unique insights into prediction accuracy. 26 Informatica 48 (2024) 15 –32 Y . Liu et al. M-BU achieved a classification accuracy of 95.7% in the AG dataset, surpassing the 95.2% of D-BU in the same series. M-BU had shown enormous potential in the DBP dataset, leading with an accuracy of 99.6%, which was more accurate than D-BU's 99.5%. M-BU achieved an accuracy of 98.0% in the Yelp P. dataset, which was the highest among all comparison models and significantly better than the DRNN model's 97.3%. The improvement of this data indicated that M-BU had better recognition ability when dealing with complex text content. M-BU also demonstrated strong predictive performance in the Yelp R. dataset, with a classification accuracy of 70.5%, surpassing D-BU's 69.3%. This indicated that M-BU could more accurately capture semantic details and user emotional tendencies in handling user comments. M-BU still maintained a leading position in the Yah. A. dataset, with an accuracy of 77.6% being the best performance among all comparison models. This ratio was significantly higher than D-BU's 76.7%, demonstrating its high applicability and stability in a wide range of fields. 3.2 Profit forecast effect The study first analyzed the effect of parameter changes on M-AU in the profit estimate effect analysis. Figure 8 shows the variation pattern of the loss function value with increasing iteration. 2 0 4 6 Epoch -0.10 0.00 0.10 0.20 0.30 0.40 0.50 0.60 Loss 8 10 (b) MS-ABGRU 2 0 4 6 Epoch 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 Loss 8 10 (c) MS-BGRUA 0.2 0 0.4 0.6 Epoch 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.8 1.0 (a) Dropout value impact Mse Trough point Smooth point Smooth point Figure 8: The parameter variation effect of M-AU Figure 8 shows the variation pattern of the loss function value with increasing iteration. These data confirmed that as iterations increased, the loss function value significantly decreased, approaching a small value close to 0. This change indicated that this model was increasingly approaching the optimal fit to the training data. A Mean Square Error (MSE) of 0.0232 was obtained by evaluating the performance of the model on the test set. This provided an important basis for studying and evaluating the prediction accuracy of the model. On the other hand, experiments with different dropout values showed that they had a significant impact on the prediction error of the model. The model reached the lowest MSE on the test set especially when the dropout value was set to 0.6, which was 0.0349. This result suggested that the model could better generalize to unseen data and avoid overfitting under this setting. Therefore, it was recommended to set dropout to 0.6 to optimize model performance under the current experimental conditions. Figure 9 shows the horizontal comparison of M-AU. Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 27 0.6 0.7 0.8 0.9 0.5 100 200 300 400 D-LSTM F1 1.0 500 600 (a) AG Word-CNN-Att Char-CRNN Epochs Our model 0.6 0.7 0.8 0.9 0.5 100 20 0 300 400 F1 1.0 500 600 (b) DBP Epochs 0.6 0.7 0.8 0.9 0.5 100 200 300 400 F1 1.0 500 600 (c) Yelp P. Epochs 0.6 0.7 0.8 0.9 0.5 100 200 300 400 F1 1.0 500 600 (d) Yelp R. Epochs D-LSTM Word-CNN-Att Char-CRNN Our model D-LSTM Word-CNN-Att Char-CRNN Our model D-LSTM Word-CNN-Att Char-CRNN Our model Figure 9: Horizontal comparison of M-AU In Figure 9, the design model consistently maintained an advantage in F1 values among these four subgraphs representing different datasets, with the curve located above the F1 value curve of the other models. Therefore, the design model performed better on commonly used datasets. Table 6 shows a comparison of the vertical improvements of M-AU. Table 6: Comparison of M-AU vertical improvements Dataset name Model MSE MAE MAD RMSE R² score Accuracy Recall F1 score Financial dataset A CNN 0.1978 0.5684 0.5044 0.4447 0.81 0.75 0.72 0.735 Financial dataset A GRU 0.1898 0.5447 0.484 0.4356 0.83 0.78 0.74 0.76 Financial dataset A BGRU 0.1547 0.4439 0.3944 0.3933 0.86 0.82 0.79 0.805 Financial dataset A M-BU 0.1089 0.3125 0.2777 0.33 0.9 0.85 0.83 0.84 Financial dataset A M-AU 0.0232 0.1191 0.1058 0.1523 0.96 0.92 0.91 0.915 Financial dataset B CNN 0.2056 0.5792 0.5164 0.4533 0.78 0.73 0.7 0.715 Financial dataset B GRU 0.1945 0.5512 0.4998 0.441 0.81 0.76 0.73 0.745 Financial dataset B BGRU 0.1602 0.4589 0.4129 0.4002 0.85 0.81 0.78 0.795 Financial dataset B M-BU 0.1137 0.3291 0.2905 0.3372 0.88 0.83 0.81 0.82 Financial dataset B M-AU 0.0263 0.1256 0.1128 0.1621 0.94 0.9 0.88 0.89 28 Informatica 48 (2024) 15 –32 Y . Liu et al. In Table 6, the comparison indicators are MSE, Mean Absolute Error (MAE), Mean Absolute Deviation (MAD), and Root Mean Squared Error (RMSE). M-AU was only 0.0232 on MSE on financial dataset A, which was about 88% lower than CNN's 0.1978. This demonstrated its high accuracy in estimating true profit values. The MAE also significantly decreased from CNN's 0.5684 to 0.1191, further confirming that M-AU had higher prediction accuracy at the average level. Similarly, M-AU led the MAD index at 0.1058, while CNN was 0.5044. This trend was equally evident in RMSE and R2 scores. The RMSE of M-AU reached 0.1523, which was much lower than CNN's 0.4447, reflecting that M-AU had smaller errors in predicting various data points. Meanwhile, the R2 score of M-AU reached 0.96, close to perfect predictive ability, far exceeding CNN's 0.81. This demonstrated that M-AU was significantly better than other models in overall data fitting. In terms of financial dataset B, the performance of all models has decreased compared to dataset A. This might be attributed to the complexity of dataset B, but M-AU continued to maintain its superiority. The MSE and MAE of M-AU were 0.0263 and 0.1256, respectively, slightly increasing compared to dataset A, but still leading other models. M-AU demonstrated strong performance in classification performance indicators such as accuracy, recall, and F1 score on both datasets. The accuracy of M-AU reached 0.92 and 0.9, recall rates were 0.91 and 0.88, and F1 scores were 0.915 and 0.89, respectively. These were all higher than other models. For example, the accuracy of CNN on dataset A was 0.75, the recall was 0.72, and the F1 score was 0.735, which highlighted the significant improvement of M-AU in classification accuracy. 3.3 Comprehensive case analysis The J company selected for the study is listed electronic technology Co., Ltd., which focuses on the manufacturing of high-end consumer electronics products and has approximately 200 employees. This company's main products include smartphones and wearable devices to meet the market's demand for high-quality electronic devices. Table 7 shows the case results. Table 7: Case results Financial risk prediction Risk indicators Description Estimation Prediction interval Remarks Financial risk level Model-based risk classification Low risk - Risk is effectively managed Asset liability ratio The percentage of liabilities to assets 0.45 43%-47% Control at a safe level Current ratio Display short-term solvency 2.1 2.0-2.2 Good short-term liquidity Quick ratio Exclude the solvency of inventory 1.5 1.4-1.6 Quick assets in good condition Interest coverage ratio Operate profit before interest expenses 6 5.5-6.5 Interest payment without pressure Cash flow ratio Cash flow to current debt ratio 1.2 1.1-1.3 Good cash flow situation Profit forecast Profit indicators Description Predictive value Prediction interval Remarks Revenue growth rate Expected annual revenue growth rate 0.05 4%-6% Stable business growth Return on equity Return on equity to shareholders 0.1 9%-11% Good investment return Total asset turnover rate The efficiency of generating revenue from total assets 0.8 0.7-0.9 Reasonable asset utilization efficiency Net profit attributable to shareholders of the parent company Actual net profit attributable to shareholders 32 million yuan 31 million to 33 million yuan Expected good profitability Operating cost ratio The proportion of operating costs to operating revenue 0.85 80%-90% Reasonable gross profit margin Earnings per share Earnings per share 1.5yuan 1.4-1.6 yuan Good earnings per share Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 29 In Table 7, the financial risk forecast showed a debt level of 45%, which was at the center of the forecast range of 43%-47%. This indicated that companies were cautious in the application of financial leverage, with lower risks, which were effectively managed. The current ratio and quick ratio were 2.1 and 1.5, respectively, both within their predicted ranges of 2.0-2.2 and 1.4-1.6. This indicated that both short-term liquidity and quick assets were in good condition, indicating that the company had sufficient ability to cope with short-term debt. In addition, the interest coverage ratio was 6 times, safely within the predicted range of 5.5-6.5. This meant that the enterprise could easily pay interest expenses with operating profits. The cash flow ratio was 1.2, which fell within the predicted range of 1.1-1.3. This indicated that the enterprise had healthy cash flow and was conducive to ensuring smooth daily operations and investment activities. The predicted revenue growth rate was 5% in the profit estimate, which was consistent with the predicted range of 4%-6%. This demonstrated stable business growth for the enterprise. The return on equity was 10%, accurately falling within the expected range of 9%-11%, proving that investors could expect a good return on investment. The total asset turnover rate was 0.8, which was within the predicted range of 0.7-0.9. This indicated that the enterprise had a reasonable asset utilization efficiency. The predicted value of 32 million reflected the expected good profitability of the company for the net profit attributable to shareholders of the parent company. This prediction was close to the edge of its prediction range of 31 million to 33 million. The operating cost rate was controlled at 85%, with a predicted range of 80%-90%, maintaining a reasonable gross profit margin. Finally, the earnings per share forecast was 1.5 yuan, within the forecast range of 1.4 to 1.6 yuan, indicating a good earnings per share. In summary, the financial forecasting model applied provided excellent and reliable predictions of financial risk and profitability. These predicted values matched the actual indicators and were mostly within a safe prediction range. This indicated that the model had high accuracy and reliability in understanding the financial health status of enterprises. Enterprises could adjust their strategies in a timely manner, manage potential risks, and ensure sustained positive growth through these data-driven insights. Table 8: Cost calculation and scalability analysis Characteristic Dataset size Lightweight transformer Model Training time Small (~10000 samples) 2 hours 1 hour Medium (~100000 samples) 12 hours 6 hours Large (>1000000 samples) >24 hours 12 hours Resource requirements CPU core 4-core 2-core RAM 8GB 4GB Recommended GPU NVIDIA GTX 1080 NVIDIA GTX 1060 From Table 8, the designed model had significant advantages in training time compared to the lightweight transformer. Meanwhile, this model achieved efficient operation on lower computational configurations, which was more advantageous. 4 Discussion Li X et al. used an optimized BPNN to predict the financial risks of listed companies. The optimized BPNN improved the accuracy of the prediction, with a prediction accuracy rate of over 80% [31]. This study is also based on neural networks. Moreover, more complex data can be handled using bidirectional gated recursive units to capture temporal relationships in time series data. Uddin MS et al. used an optimized random forest method to predict the credit risk of micro enterprises, which was more targeted but less common [32]. Meanwhile, this study combined multi-scale feature extraction methods to effectively extract and predict various types of enterprise information, which was more universal. Barra S et al. focused on the temporal nature of information similar to the approach of this study. Time-series encoding methods combined with CNN were used to transform time-series data into Gram angle field images for analysis [33]. The designed model had a simpler execution and stronger robustness. Uthayakumar J et al. focused on predicting financial crises by combining improved K-means clustering and chaotic genetic ant colony algorithm [34]. On the other hand, Bianchi D focused on bond risk by combining extreme tree and neural network models to analyze the likelihood of bond returns [35]. Both studies focused on one aspect of corporate risk, such as financial risk. On the side of bond risk, this study takes the perspective of enterprises as the main body to conduct integrated prediction analysis of enterprise risk and profit. The designed model performed better in classification and efficiency indicators than other models compared with the model of Li X et al. and Uddin MS et al. This demonstrated higher prediction accuracy, as well as more comprehensive and stable information processing. 5 Conclusion A novel financial risk prediction model based on graph network concept was designed, which combined multi-scale feature extraction technology with sequence analysis methods. A model based on multi-scale 30 Informatica 48 (2024) 15 –32 Y . Liu et al. advantage and attention mechanism was also proposed for profit estimate. These results confirmed that the prediction accuracy of these two models was significantly improved compared to traditional financial prediction models. The model's accuracy had significantly improved after 50 rounds of training, with the highest classification accuracy reaching 98.03%. The financial risk prediction model achieved an accuracy of 98.03% and an F1 score of 0.980 on dataset A. The error was close to 0 on the profit estimate model, and the MSE was 0.0232. This indicated that the model could more accurately identify potential risks and profit points when predicting risks and profits. In the case study, the debt level was 45% for the financial risk forecast, the current ratio and quick ratio were 2.1 and 1.5, respectively. The actual interest coverage ratio and cash flow ratio were within the forecast range. The net profit attributable to the shareholders of the parent company was expected to be 32 million in terms of profit estimate, which almost matched the actual indicators. Therefore, the designed comprehensive model is an efficient and feasible method. The designed model integrates risk prediction and profit prediction has strong predictive ability. A prediction system is formed for multi-feature data and their time characteristics. However, there are many differences in economic structure, market mechanisms, policy environment, etc. between different regions in practical applications. Therefore, the specific environment in which the model is applied may also vary, leading to prediction bias in universal models under specific conditions. Therefore, it is necessary to modularize the different functions of the prediction model. Meanwhile, targeted parameter setting and application solutions can be developed for different environments in the future. References [1] A. Lin, N. Manral, P. McElhinney, K. Aditya, H. Matsumoto, and J. Kwiecinski, "Deep learning-enabled coronary CT angiography for plaque and stenosis quantification and cardiac risk prediction: an international multicentre study," The Lancet Digital Health, vol. 4, no. 4, pp. e256-e265, 2022. https://doi.org/10.1016/S2589-7500(22)00022-X. [2] W. Gleißner, T. Günther, and C. Walkshäusl, "Financial sustainability: measurement and empirical evidence," Journal of Business Economics, vol. 92, no. 3, pp. 467-516, 2022. https://doi.org/10.1007/s11573-022-01081-0. [3] R. Patel, S. Tanwani, and C. Patidar, "Relation extraction between medical entities using deep learning approach," Informatica, vol. 45, no. 3, pp. 359-366, 2021.https://doi.org/10.31449/inf.v45i3.3056. [4] A. Kurani, P. Doshi, A. Vakharia, and M. Shah, "A comprehensive comparative study of artificial neural network (ANN) and support vector machines (SVM) on stock forecasting," Annals of Data Science, vol. 10, no. 1, pp. 183-208, 2023. https://doi.org/10.1007/s40745-021-00344-x. [5] J. Kong, S. Oh, B. O. Kang, and J. Jung. "Development of an incentive model for renewable energy resources using forecasting accuracy in South Korea," Energy Science & Engineering, vol. 10, no. 9, pp. 3250-3266, 2022. https://doi.org/10.1002/ese3.1020. [6] M. Salb, M. Zivkovic, N. Bacanin, A. Chhabra, and M. Suresh, "Support vector machine performance improvements for cryptocurrency value forecasting by enhanced sine cosine algorithm," Computer Vision and Robotics, pp. 527-536, 2022. https://doi.org/10.1007/978-981-16-8225-4_40. [7] F. Liu, Y. Wu, N. Almarri, M. Habibollahi, H. T. Lancashire, B. Bryson, L. Greensmith, D. Jiang, and A. Demosthenous, "A fully implantable opto-electro closed-loop neural interface for motor neuron disease studies," IEEE Transactions on Biomedical Circuits and Systems, vol. 16, no. 5, pp. 752-765, 2022. https://doi.org/10.1109/TBCAS.2022.3202026. [8] V. K. Tewari and A. Verma, "Design and application of double-gate MOSFET in two loops controlled multi-input DC-DC converter," Silicon, vol. 14, no. 12, pp. 7321-7333, 2022. https://doi.org/10.1007/s12633-021-01469-7. [9] P. Preethi and H. R. Mamatha, "Region-based convolutional neural network for segmenting text in epigraphical images," Artificial Intelligence and Applications, vol. 1, no. 2, pp. 119-127, 2023. https://doi.org/10.47852/bonviewAIA2202293. [10] X. Li, J. Wang, and C. Yang, "Risk prediction in financial management of listed companies based on optimized BP neural network under digital economy," Neural Computing and Applications, vol. 35, no. 3, pp. 2045-2058, 2023. https://doi.org/10.1007/s00521-022-07377-0. [11] S. Peng, S. Yang, and J. Yao, "Improving value-at-risk prediction under model uncertainty," Journal of Financial Econometrics, vol. 21, no. 1, pp. 228-259, 2023. https://doi.org/10.1093/jjfinec/nbaa022. [12] Y. Cao, Y. Shao, and H. Zhang, "Study on early warning of E-commerce enterprise financial risk based on deep learning algorithm," Electronic Commerce Research, vol. 22, no. 1, pp. 21-36, 2022. https://doi.org/10.1007/s10660-020-09454-9. [13] U. M. Sirisha, M. C. Belavagi, and G. Attigeri, "Profit prediction using Arima, Sarima and LSTM models in time series forecasting: A Comparison," IEEE Access, vol. 10, pp. 124715-124727, 2022. https://doi.org/10.1109/ACCESS.2022.3224938. [14] G. Kim, D. H. Shin, J. G. Choi, and S. Lim, "A deep learning-based cryptocurrency price prediction model that uses on-chain data," IEEE Access, vol. Profit Estimation Model and Financial Risk Prediction Combining … Informatica 48 (2024) 15 –32 31 10, pp. 56232-56248, 2022. https://doi.org/10.1109/ACCESS.2022.3177888. [15] M. Shajalal, P. Hajek, and M. Z. Abedin, "Product backorder prediction using deep neural network on imbalanced data," International Journal of Production Research, vol. 61, no. 1, pp. 302-319, 2023. https://doi.org/10.1080/00207543.2021.1901153. [16] S. He, X. Sang, J. Yin, Y. Zheng, and H. Chen, "Short-term runoff prediction optimization method based on bgru-bp and blstm-bp neural networks," Water Resources Management, vol. 37, no. 2, pp. 747-768, 2023. https://doi.org/10.1007/s11269-022-03401-z. [17] M. D. Dangut, I. K. Jennions, S. King, and Z. Skaf, "A rare failure detection model for aircraft predictive maintenance using a deep hybrid learning approach," Neural Computing and Applications, vol. 35, no. 4, pp. 2991-3009, 2023. https://doi.org/10.1007/s00521-022-07167-8. [18] H. Li, Q. He, and L. Wu, "Detection of brain abnormalities in parkinson ’s rats by combining deep learning and motion tracking," IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 31, no. 1, pp. 1001-1007, 2023. https://doi.org/10.1109/TNSRE.2023.3237916. [19] P. Weston and M. Nnadi, "Evaluation of strategic and financial variables of corporate sustainability and ESG policies on corporate finance performance," Journal of Sustainable Finance & Investment, vol. 13, no. 2, pp. 1058-1074, 2023. https://doi.org/10.1080/20430795.2021.1883984. [20] S. Wahab, M. Imran, A. Safi, and D. Kirikkaleli, "Role of financial stability, technological innovation, and renewable energy in achieving sustainable development goals in BRICS countries," Environmental Science and Pollution Research, vol. 29, no. 32, pp. 48827-48838, 2022. https://doi.org/10.1007/s11356-022-18810-1. [21] H. N. H. Al-Hashimy, T. T. Y. Alabdullah, E. Ries, M. A. Ahmed, M. I. Nor, and K. A. M. Jamal, "The impact of financial management elements and behavioral intention on the financial performance," International Journal of Scientific and Management Research, vol. 5, no. 12, pp. 117-149, 2022. https://doi.org/10.37502/JSMR.2022.51210. [22] Y. Si, S. Heeringa, D. Johnson, R. J. A. Little, W. Liu, F. Pfeffer, and T. Raghunathan, "Multiple imputation with massive data: An application to the panel study of income dynamics," Journal of Survey Statistics and Methodology, vol. 11, no. 1, pp. 260-283, 2023. https://doi.org/10.1093/jssam/smab038. [23] Z. Zhao, L. Tang, M. Fang, X. Yang, C. Li, and Q. Li, "Toward urban traffic scenarios and more: a spatio-temporal analysis empowered low-rank tensor completion method for data imputation," International Journal of Geographical Information Science, vol. 37, no. 9, pp. 1936-1969, 2023. https://doi.org/10.1080/13658816.2023.2234434. [24] F. Conceição, C. H. Antunes, M. Gomes, V. Silva, and R. Dinis, "Max-min fairness optimization in uplink cell-free massive MIMO using meta-heuristics," IEEE Transactions on Communications, vol. 70, no. 3, pp. 1792-1807, 2022. https://doi.org/10.1109/TCOMM.2022.3144989. [25] Y. K. Saheed, A. I. Abiodun, S. Misra, A. S. Albahri, A. H. Alamoodi, B. B. Zaidan, S. Qahtan, H. A. Alsatar, M. S. Al-Samarraay, and A. N. Jasim, "A machine learning-based intrusion detection for detecting internet of things network attacks," Alexandria Engineering Journal, vol. 61, no. 12, pp. 9395-9409, 2022. https://doi.org/10.1109/JBHI.2022.3167256. [26] V. Gómez-Escalonilla, P. Martínez-Santos, and M. Martín-Loeches, "Preprocessing approaches in machine-learning-based groundwater potential map: an application to the Koulikoro and Bamako regions, Mali," Hydrology and Earth System Sciences, vol. 26, no. 2, pp. 221-243, 2022. https://doi.org/10.5194/hess-26-221-2022. [27] Y. Shi, X. Nie, X. Liu, L. Yang, and Y. Yin, "Zero-shot hashing via asymmetric ratio similarity matrix," IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 5, pp. 5426-5437, 2022. https://doi.org/10.1109/TKDE.2022.3150790. [28] R. Nirthika, S. Manivannan, A. Ramanan, and R. Wang, "Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study," Neural Computing and Applications, vol. 34, no. 7, pp. 5321-5347, 2022. https://doi.org/10.1007/s00521-022-06953-8. [29] J. J. Liu, Q. Hou, Z. A. Liu, and M. M. Cheng, "Poolnet+: Exploring the potential of pooling for salient object detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 1, pp. 887-904, 2022. https://doi.org/10.1109/TPAMI.2021.3140168. [30] U. Nandi, A. Ghorai, M. M. Singh, C. Changdar, S. Bhakta, and R. K. Pal, "Indian sign language alphabet recognition system using CNN with diffGrad optimizer and stochastic pooling," Multimedia Tools and Applications, vol. 82, no. 7, pp. 9627-9648, 2023. https://doi.org/10.1007/s11042-021-11595-4. [31] X. Li, J. Wang, and C. Yang, "Risk prediction financial management of listed companies based on optimized BP neural network under digital economy," Neural Computing and Applications, vol. 35, no. 3, pp. 2045-2058, 2023. https://doi.org/10.1007/s00521-022-07377-0. [32] M. S. Uddin, G. Chi, M. A. M. Al Janabi, and T. Habib, "Leveraging random forest in micro-enterprises credit risk modelling for accuracy and interpretability," International Journal of 32 Informatica 48 (2024) 15 –32 Y . Liu et al. Finance & Economics, vol. 27, no. 3, pp. 3713-3729, 2020. https://doi.org/10.1002/ijfe.2346. [33] S. Barra, S. M. Carta, A. Corriga, A. S. Podda, and D. R. Recupero, "Deep learning and time series-to-image encoding for financial forecasting," IEEE/CAA Journal of Automatica Sinica, vol. 7, no. 3, pp. 683-692, 2020. https://doi.org/10.1109/JAS.2020.1003132. [34] J. Uthayakumar, N. Metawa, K. Shankar, and S. K. Lakshmanaprabu, "Intelligent hybrid model for financial crisis prediction using machine learning techniques," Information Systems and e-Business Management, vol. 18, pp. 617-645, 2018. https://doi.org/10.1007/s10257-018-0388-9. [35] D. Bianchi, M. Büchner, and A. Tamoni, "Bond risk premiums with machine learning," The Review of Financial Studies, vol. 34, no. 2, pp. 1046-1089, 2020. https://doi.org/10.1093/rfs/hhaa062.