Analysis of the Composition of Ancient Glass and Its Identification Based on the Daen-LR, ARIMA-LSTM and MLR Combined Process

Li, Zhi-Xing; Lu, Peng-Sen; Wang, Guang-Yan; Li, Jia-Hui; Yang, Zhen-Hao; Ma, Yun-Peng; Wang, Hong-Hai

doi:10.3390/app13116639

Open AccessArticle

Analysis of the Composition of Ancient Glass and Its Identification Based on the Daen-LR, ARIMA-LSTM and MLR Combined Process

¹

School of Information Engineering, Tianjin University of Commerce, Tianjin 300134, China

²

School of Chemical Engineering and Technology, National-Local Joint Engineering Laboratory for Energy Conservation in Chemical Process Integration and Resources Utilization, Hebei University of Technology, Tianjin 300130, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Sci. 2023, 13(11), 6639; https://doi.org/10.3390/app13116639

Submission received: 21 April 2023 / Revised: 16 May 2023 / Accepted: 29 May 2023 / Published: 30 May 2023

(This article belongs to the Special Issue Application of Chemometrics and Machine Learning in Cultural Heritage Analysis)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The glass relics are precious material evidence of the early trade and cultural exchange between the East and the West. To explore the cultural differences and trade development between early China and foreign countries, it is extremely important to classify glass cultural relics. Despite their similar appearances, Chinese glass contains more lead, while foreign glass contains more potassium. In view of this, this paper proposes a joint Daen-LR, ARIMA-LSTM, and MLR machine learning algorithm (JMLA) for the analysis and identification of the chemical composition of ancient glass. We separate the sampling points of ancient glass into two systems: lead-barium glass and high-potassium glass. Firstly, an improved logistic regression model based on a double adaptive elastic network (Daen-LR) is used to select variables with both Oracle and adaptive classification characteristics. Secondly, the ARIMA-LSTM model was used to establish the correlation curve of chemical composition before and after weathering and to predict the change in chemical composition with weathering. Thirdly, combining the data processed by the above two methods, a multiple linear regression model (MLR) is used to classify unknown glass products. It was shown that the sample obtained by this processing method has a very good fit. In comparison with other similar types of models like Decision Trees (DT), Random Forests (RF), Support Vector Machines (SVM), and Random Forests based on classification and regression trees (CART-RF), the classification accuracy of JMLA is 97.9% on the train set. The accuracy rate on the test set reached 97.6%. The results of the research demonstrate that JMLA can improve the accuracy of the glass type classification problem, greatly enhance the research efficiency of archaeological staff, and gain a more reliable result.

Keywords:

Daen-LR; ARIMA-LSTM; MLR; machine learning; cultural heritage; ancient glass classification

1. Introduction

Machine learning (ML) algorithms are a set of mathematical models and statistical [1] methods that can be used in computer systems to learn and make predictions or decisions based on patterns in data. In the field of archaeology, there are many examples of machine learning algorithms applied in the direction of conservation and restoration, provenance research, and the management of cultural heritage. In 1798, the German scientist M.H. Klaproth conducted the first quantitative chemical study of Roman-era glass [2], improving the procedure for weight analysis and devising various procedures for the determination of non-metallic elements, accurately determining the composition of nearly 200 minerals and various industrial products. In 2003, Professor Fuxi Gan and his research team used the proton-excited X-fluorescence (PIXE) technique to quantify the chemical composition of a batch of ancient glass excavated in Yangzhou and Hubei, with the goal of studying the origin, system, and preparation process of ancient Chinese glass [3]. As more and more ancient silicate artifacts were unearthed, some scholars began to classify them based on their chemical composition. In 1992, Korean scientist Lee Chul applied his chemometric pattern recognition method to multivariate data to determine the classification of 94 ancient Korean glass pieces using neutron activation analysis and principal component analysis [4]. In 2010, El-Taher, an Egyptian scholar, used instrumental neutron activation analysis (INAA) and HPGe detector γ-spectroscopy to determine qualitatively and quantitatively for the first time a total of 16 elements in feldspar rock samples collected from Gabel El Dubb, Eastern Desert, Egypt, and to classify their rock samples [5]. In 2011, Thai scholar Won-in K. and his team used Raman spectrophotometry for the first time to characterize fragments of archaeological glass samples with the aim of obtaining information to identify glass samples for classification by laser scattering [6]. In 2019, Nadine Schibille and her team established a temporal model that serves as a tool for dating archaeological glass assemblages as well as a geographical model that allows for a clear classification of Levantine and Egyptian plant ash glasses [7]. However, it is worth noting that the application and extension of machine learning algorithms in the direction of cultural heritage (CH) component analysis and identification of categories are very limited [8].

In recent years, when studying the chemical composition of ancient glass objects, the classification of glass has been mainly determined by the weight ratio of oxides or by analyzing the mass fraction of compounds containing lead and potassium [9,10,11,12,13]. However, the percentage of lead and potassium compounds present varies depending on the region where the glass was produced and the degree of weathering, which would interfere with the classification of the glass. Thereby, this study is based on the data related to ancient glassware provided by the official website of the 2022 China Student Mathematical Modeling Competition [14]. The weathering of glass over thousands of years can cause significant changes in its internal chemical composition. As a result, determining the type of glass by the amount of content in a certain chemical composition is not reliable or scientific. Therefore, based on the double adaptive elastic net improved logistic regression model (Daen-LR), ARIMA-LSTM model, and multiple linear regression model (MLR) [15,16], optimizing and combining these three algorithms, we propose a processing method and process that is suitable for classifying complex data and can be used to predict the unknown classification of glass. This process method is used to analyze and model the data related to the chemical composition and classification information of a batch of ancient Chinese glass products, to find out the correlation between their chemical composition and the basis of their classification, and to use this relationship to predict the category of unknown glass. The accuracy of the algorithm model was judged by testing the presence of heteroskedasticity in the perturbation terms, testing for multicollinearity, and testing the fit of the experimental values through the model to the actual values [17].

With the continuous development of machine learning technology, a variety of machine learning models have been proposed and widely used in classification research. These include Logistic Regression (LR), Naive Bayes(NB), Decision Tree(DT), Support Vector Machine(SVM) and Random Forest(RF), Gradient Boosting Tree(GBT), and so on. However, traditional machine learning methods have some drawbacks in solving real-world problems, such as interference from external factors, failure to meet scientific standards, random results, and poor prediction accuracy. In order to solve these problems, it is necessary to combine sophisticated machine learning methods with more advanced methods. This paper presents a joint machine learning algorithm using Daen-LR, ARIMA-LSTM, and the MLR model (JMLA). We first use an improved logistic regression model based on double adaptive elastic networks (Daen-LR) to select variables that have both Oracle and adaptive classification properties. Secondly, we use the ARIMA-LSTM model to balance the linear and nonlinear trends in the time series data of the chemical content of glass artifacts before and after weathering. Finally, a multiple linear regression model (MLR) was used to classify the experimental samples. By testing the data set of the 2022 Chinese College Students Mathematical Contest in Modeling, this study proves the correctness of the proposed method.

The main contributions of this study include:

Successfully established a classification model of ancient glass products with high accuracy.
This study combines three different algorithms reasonably and effectively and integrates the advantages of different algorithms into the JMLA algorithm.
We made a comprehensive comparison of multiple test sets on multiple models, and the test results show that the algorithm given in this study is superior to other algorithms.
In the future, this algorithm model will also be able to support component analysis in many fields, such as water flow pollution, food safety, and environmental protection.

This paper consists of six parts: In the second part, the algorithm and principles of this paper are described in detail. In the third part, the preprocessing of the data and the preparation of the experiment are explained. In the fourth part, the experimental process and results are discussed. In the fifth part, the advantages and limitations of the JMLA model compared with other models are given. In the sixth part, the conclusion, influence, and future research suggestions are drawn.

2. Theory and Method

2.1. An Improved Logistic Regression Model with Double Adaptive Elastic Net

In this analysis of ancient glass artifacts, the relationship between glass weathering and its chemical composition was identified and statistically analyzed by using an improved logistic regression model based on a double adaptive elastic net, i.e., the Daen-Logistic regression (Daen-LR) model. Furthermore, we calculated the p-value of each correlation factor and counted whether each regression coefficient was significant at the 90% confidence level, extracted the strong correlation elements, and excluded the weak correlation elements. The characteristics of the model are as follows:

Logistic regression is an effective method to solve classification problems in which effective estimation of parameters and selection of variables are extremely important. The regularization method [18], which considers adding a penalty term to the optimized loss function to estimate parameters, can simultaneously solve the two key points of logistic regression. Elastic net [19] is one of the representatives of this method.

However, considering the inadequacy of the traditional logistic regression model for the estimation of parameters and the identification of important variables, it has two major shortcomings: First, the selected variables may not be consistent, i.e., they lack oracle properties [20]. Second, the specific effects of strongly correlated variables on the independent variables are not considered, i.e., adaptive categorical effects are missing [21,22].

To overcome the first deficiency of Elastic net, adaptive elastic net is established by combining Adaptive lasso [23] and Ridge to achieve consistency in selected variables. However, the Adaptive coefficient vector W1, which makes an adaptive elastic net with oracle properties, is not easy to set correctly. It is generally determined by the initial estimates of parameters and the constant δ.

To solve the second defect of Elastic net, Van et al. [24] proposed a Generalized ridge in which parameters are first divided into groups and then given different Ridge penalties for each group. The Generalized ridge has an adaptive grouping effect, and its Adaptive ridge also enjoys that effect. However, Generalized ridge does not have the function to select variables and is of limited application.

Based on existing solutions to the Elastic net deficiency, it follows that Adaptive lasso and Adaptive ridge have oracle properties and adaptive grouping effects, respectively, so they can be combined to avoid the two existing disadvantages. This combination of penalties can be called the double adaptive elastic net.

It is assumed that in the composition analysis of glass artifacts, there are m chemical composition influence factors,

X = (x_{1}, x_{2}, \dots, x_{m})

is the characteristic variable of chemical composition content (i.e., independent variable), m is the number of variables, and the weathering status of the corresponding glass artifacts is set as y (i.e., dependent variable), where y represents the dichotomous variable of weathering or not (i.e., 0 means “unweathered” and 1 means “weathered”). To assess the magnitude of the probability of whether a particular glass artifact is weathered, it is necessary to calculate the predicted outcome of the model as the probability of occurrence of

y = 1

, which can then be expressed as

P = f (y = 1 ∣ x_{1}, x_{2}, \dots, x_{m})

, i.e., the mathematical expression of the traditional logistic regression model is:

Logit (P) = \ln \frac{P (y = 1)}{1 - P (y = 1)} = β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots + β_{m} x_{m}

(1)

i.e.,

P (y = 1) = \frac{\exp (β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots + β_{m} x_{m})}{1 + \exp (β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots + β_{m} x_{m})}

In the above equation,

(β_{0}, β_{1}, β_{2}, \dots, β_{m})

are the regression coefficients to be determined. The great likelihood estimation method is used to find these coefficients:

P (y = 1 ∣ X) = 1 - \frac{1}{1 + \exp (β_{n} + β_{1} x_{1} + β_{2} x_{2} + \dots + β_{m} x_{m})} = π

(2)

P (y = 0 ∣ X) = 1 - π

(3)

Combining the probability functions of y as:

P (y_{i}) = π^{y_{i}} {(1 - π)}^{y_{i}}, y_{i} = 0, 1; i = 1, 2, \dots, n

(4)

According to the Bernoulli distribution, the maximum likelihood function can be expressed as:

l (β; X) = \prod_{i = 1}^{n} P (y_{i}) = \prod_{i = 1}^{n} π_{i}^{y_{i}} {(1 - π_{i})}^{1 - y_{i}}

(5)

The log-likelihood function is expressed as:

\ln (l (β; X)) = \sum_{i = 1}^{n} \{y_{i} (β_{0} + β_{1} x_{i 1} + β_{2} x_{i 2} + \dots + β_{m} x_{i m}) - \ln [1 + \exp (β_{0} + β_{1} x_{i 1} + β_{2} x_{i 2} + \dots + β_{m} x_{i m})]\}

(6)

Since the maximum likelihood function is a convex function, the point at which its first derivative equals zero is the point of maximum value. By calculating the first derivative of the undetermined coefficient

(β_{0}, β_{1}, β_{2}, \dots, β_{m})

in Equation (6) and setting it equal to zero, all the parameters to be solved in the equation group can be solved.

Considering the shortcomings and deficiencies of the traditional logistic regression model, the oracle effect and adaptive classification effect are integrated into the traditional logistic model to create a double adaptive elastic net model, which makes the identification results of glass cultural relics more relevant and persuasive [25,26].

Theory 1.

Oracle property

In the Logistic model, suppose the real parameters

β_{0} = {(β_{01}, β_{02}, \dots, β_{0 m})}^{T}

,

A = \{j ∣ β_{0 j} \neq 0\} = \{1, 2, \dots, m_{0}\}, m_{0} < m

, Fisher information matrix

I (β_{0}) = (\begin{matrix} I_{11} & I_{12} \\ I_{21} & I_{22} \end{matrix})

, where

I_{11}

is a square matrix of order

m_{0}

,

ϕ (X^{T} β) = \ln (1 + e^{X^{T} β})

, then the double adaptive elastic net logistic has oracle property according to the following conditions.

$I (β_{0})$ Is a positive definite matrix.
There exists an open set containing $β_{0}$ , such that for any $β \in Ω$ there exists a function $N (\cdot)$ satisfying:

$∣ ϕ^{‴} (X^{T} β) ∣ \leq N (X) < \infty,$

(7)

and for any m-dimensional vector u, we have $E (N (X) {(X^{T} u)}^{3}) < \infty$ ;
$λ_{1} = o (\sqrt{n})$ , and there is a sequence $\{a_{n}\}$ , such that:

$a_{n} ({\hat{β}}^{*} - β_{0}) = O_{p} (1) and \lim_{n \to \infty} \frac{λ_{1} a_{n}^{δ}}{\sqrt{n}} = \infty;$
$λ_{2} = o (n)$ and $\lim_{n \to \infty} \frac{λ_{2}}{\sqrt{n}} \sqrt{\sum_{j = 1}^{m_{0}} β_{0 j}^{2}} = 0$ .

When conditions 1–4 hold, double adaptive elastic net estimate

\hat{β}

has the following properties:

$\sqrt{n} ({\hat{β}}_{A} - β_{A}) \overset{D}{\to} N (0, I_{11}^{- 1})$ ;
$\lim_{n \to \infty} P ({\hat{β}}_{A^{c}} = 0) = 1$ .

Theory 2.

Adaptive Classification Effect

Given the binary data

{\{(X_{i}, y_{i})\}}_{i = 1}^{n}

, where

X_{i} = {(x_{i 1}, x_{i 2}, \dots, x_{i m})}^{T}

and

\forall j \in {1, 2, \dots, m}, \sum_{i = 1}^{n} x_{i j} = 0

,

\sum_{i = 1}^{n} x_{i j}^{2} = 1

,

y_{i} \in {0, 1}

. Let

\hat{β} (λ_{1}, λ_{2})

be the estimate of the model and assume that

{\hat{β}}_{k} (λ_{1}, λ_{2}) {\hat{β}}_{l} (λ_{1}, λ_{2}) > 0

. Define

D_{λ_{1}, λ_{2}} (k, l) = \frac{1}{n} |w_{2 k} {\hat{β}}_{k} (λ_{1}, λ_{2}) - w_{2 l} {\hat{β}}_{l} (λ_{1}, λ_{2})|

, then:

D_{λ_{1}, λ_{2}} (k, l) \leq \frac{\sqrt{2 (1 - ρ_{k l})} + \frac{λ_{1}}{n} |w_{1 k} - w_{1 l}|}{2 λ_{2}}

(8)

where

ρ_{k l} = corr (x_{k}, x_{l})

By combining the above two schemes to improve the logistic regression equation,

X_{i} = {(1, x_{i 1}, x_{i 2}, \dots, x_{i m})}^{T}

,

β = {(β_{0}, β_{1}, β_{2}, \dots, β_{m})}^{T}

,

y_{i} \in {0, 1}, i = 1, 2, \dots, n

, its estimated value of

β

is:

{\hat{β}}_{Daen} = \underset{β}{\arg \min} \{\ln (l (β; X)) + λ_{1} \sum_{j = 1}^{m} w_{1 j} |β_{j}| + λ_{2} \sum_{j = 1}^{m} w_{2 j} β_{j}^{2}\}

(9)

where

w_{1 j} = {|{\hat{β}}_{j}^{*}|}^{- δ}, w_{2 j} > 0, λ_{1}, λ_{2} > 0, δ > 0

{\hat{β}}^{*} = \underset{β}{\arg \min} \{\ln (l (β; X)) + λ_{2} \sum_{j = 1}^{m} w_{2 j} β_{j}^{2}\} .

(10)

Since incorporating the correlation of variables into the regression model helps to improve the accuracy of parameter estimation and variable selection [27],

w_{2 j} = \frac{\sum_{k = 1, k \neq j}^{m} |ρ_{k j}|}{m - 1} + ε_{j},

where

ρ_{k j} = c o r r (x_{k}, x_{j})

is the correlation coefficient between variables

x_{k}

and

x_{j}

,

{(ε_{1}, ε_{2}, \dots, ε_{m})}^{T}

is the vector that can make

w_{21}, w_{22}, \dots, w_{2 m}

unequal to each other, and

\frac{\sum_{j = 1}^{m} ε_{j}}{m} = 1, 0.95 \leq ε_{j} \leq 1.25

.

Equation(9) is equivalent to:

\begin{matrix} {\hat{β}}_{Daen} = \underset{β}{\arg \min} {\ln (l (β; X))}, \\ s . t . α \sum_{j = 1}^{m} w_{1 j} |β_{j}| + (1 - α) \sum_{j = 1}^{m} w_{2 j} β_{j}^{2} \leq t, \end{matrix}

(11)

where

α = \frac{λ_{1}}{λ_{1} + λ_{2}}, t > 0 .

Using the coordinate gradient method and the Newton method to solve

β

, Equation (9) can be rewritten as:

{\hat{β}}_{Daen} = \underset{β}{\arg \min} \{I (β) + λ_{1} \sum_{j = 1}^{m} w_{1 j} |β_{j}|\}

where

I (β) = \ln (l (β; X))) + λ_{2} \sum_{j = 1}^{m} w_{2 j} β_{j}^{2} .

If

β (t)

is the solution of

β

at step t, then

I (β)

can be approximated as:

I (β) \approx I (β (t)) + {(β - β (t))}^{T} g (t) + \frac{1}{2} {(β - β (t))}^{T} h (t) (β - β (t))

(12)

where

g (t)

and

h (t)

are the gradients of

I (β)

at

β = β (t)

, respectively, and the Hessian matrix, adding

λ_{1} \sum_{j = 1}^{m} w_{1 j} |β_{j}|

to the Equation (12) and making

\frac{\partial I (β)}{\partial β} = 0

yields:

β (t + 1) = K (β (t) - h^{- 1} (t) g (t), λ_{1} h^{- 1} (t) W_{1})

(13)

where

W_{1} = {(0, w_{11}, w_{12}, \dots, w_{1 m})}^{T},

K (Q, W) = \{\begin{matrix} Q - W, & 0 \leq W < Q \\ Q + W, & 0 \leq W < - Q \\ 0, & | Q | \leq W \end{matrix}

(14)

Since

λ_{1} h^{- 1} (t) W_{1}

may have some numbers less than 0, the parameters of some irrelevant variables cannot become 0. Thus, it can be directly rewritten as

λ_{1} W_{1}

. By the above inference, the solution process of

{\hat{β}}_{Daen}

can be derived: first generate an initial value of

β

, then repeat the calculation of

g (t)

,

h (t)

, and

β (t + 1)

, until convergence [26].

2.2. Time Series Forecasting Model Based on ARIMA-LSTM

2.2.1. ARIMA(p,d,q) Model

By using the ARIMA(p,d,q) model, it is possible to analyze observations at past time points, depict the intrinsic link between them, and predict future values, which is achieved based on past time values and linear error equations [28,29,30,31,32]. The ARIMA model is usually denoted as ARIMA(p,d,q), where p is the number of autoregressive terms, q is the number of sliding average terms, and d is the number of differences needed to make it a smooth series [33]. The correlogram, autocorrelation function (ACF), and partial autocorrelation function (PACF) of the time series provide information about the lags [34]. If the time series is found to be smooth, the model can be used for estimation and forecasting. However, if it is not smooth, in order to apply ARIMA, it must be transformed to be smooth by differencing. After identification, an ARIMA model is estimated for a specific smooth time series. The simple ARIMA model is estimated based on the number of effective coefficients, the Bayesian information criterion (BIC) and the Akaike information criterion (AIC), and the adjusted R² [35]. After estimation, the selected ARIMA model needs to be diagnosed to check if the residuals are white noise. If the residuals are not white noise, the model must be re-estimated, and Q-tests and normality tests can be used to diagnose the residuals [36]. Typically, the ARIMA model is as follows:

y_{t}^{'} = α_{0} + \sum_{i = 1}^{p} α_{i} y_{t - i}^{'} + ε_{t} + \sum_{i = 1}^{q} β_{i} ε_{t - i}

(15)

y_{t}^{'} = Δ^{d} y_{t} = {(1 - L)}^{d} y_{t}

(16)

(1 - \sum_{i = 1}^{p} α_{i} L^{i}) {(1 - L)}^{d} y_{t} = α_{0} + (1 + \sum_{i = 1}^{q} β_{i} L^{i}) ε_{t}

(17)

For the study, the more the number of chemical content parameters, the better the model fit, but this will be at the cost of increasing the model complexity, so the model selection should seek the best balance between the model complexity and the ability of the model to explain the data. According to the Bayesian information criterion, when the BIC is smallest, the optimal solution between the fit effect and complexity can be found [37]:

BIC = \ln (T) (n) - 2 \ln (M)

(18)

T: number of samples;

n: number of unknown parameters, n = p + q + 1;

M: maximum likelihood number of the model.

The maximum likelihood estimation process for the ARIMA(p,d,q) model is [38]:

Y_{t} = c + Φ_{1} Y_{t - 1} + Φ_{2} Y_{t - 2} + \dots + Φ_{p} Y_{t - p} + ε_{t} + θ_{1} ε_{t - 1} + \dots + θ_{q} ε_{t - q}

(19)

Φ_{1}, Φ_{2} \dots Φ_{p}

respectively represent the autolinear correlation coefficients between

Y_{t - 1} \dots Y_{t - p}

and

Y_{t}

. By introducing the

p - 1

term in the middle, the direct relationship between

Y_{t - 1} \dots Y_{t - p}

and

Y_{t}

can be separated, and this relationship is linear.

Φ_{1}, Φ_{2} \dots Φ_{p}

is the value to measure the size of this influence, which is the so-called PACF(

θ_{1} \dots θ_{q}

is the same meaning as

Φ_{1}, Φ_{2} \dots Φ_{p}

).

ε_{t}

is the perturbed term. Where

ε_{t} ~ iidN (0, σ^{2})

, The vector of total parameters is

\vec{Θ} = (c, Φ_{1}, Φ_{2}, \dots, Φ_{p}, θ_{1}, θ_{2}, \dots, θ_{q}, σ^{2})

The estimation of the likelihood function for the autoregressive process is conditioned on the initial value of

y

, and the estimation of the likelihood function for the moving average process is conditioned on the initial value of

ε

. Then ARIMA(p,d,q) is conditioned on d as the difference order and the initial values of

y

and

ε

.

Assume the initial values

\vec{y_{0}} = {(y_{0}, y_{- 1}, \dots, y_{- p + 1})}^{'}

and

\vec{ε_{0}} = {(ε_{0}, ε_{- 1}, \dots, ε_{- q + 1})}^{'}

is known, then according to

\{y_{1}, y_{2}, \dots, y_{r}\}

, it can be iterate to this equation:

ε_{t} = y_{t} - c - Φ_{1} y_{t - 1} - Φ_{2} y_{t - 2} - \dots - Φ_{p} y_{t - p} - θ_{1} ε_{t - 1} - θ_{2} ε_{t - 2} - \dots - θ_{q} ε_{t - q}

(20)

The sequence

\{ε_{1}, ε_{2}, \dots, ε_{T}\}

for

t = 1, 2, \dots, T

can be obtained, then the conditional likelihood function is:

L (θ) = \ln f_{Y_{T}, Y_{T - 1} \dots, Y_{1} ∣ {\vec{Y}}_{0}, {\vec{t}}_{0}} (y_{T}, y_{T - 1}, \dots, y_{1} ∣ {\vec{Y}}_{0}, {\vec{ε}}_{0}, \vec{θ}) = - \frac{T}{2} \ln (2 π) - \frac{T}{2} \ln (σ^{2}) - \sum_{t = 1}^{T} \frac{ε_{t}^{2}}{2 σ^{2}}

(21)

2.2.2. LSTM Model

ARIMA(p,d,q) model can well deal with the linear part of the chemical composition content in the time series, but it has certain limitations because the obtained residual series results have nonlinear characteristics, and the process of the content of some chemical components changing with the degree of weathering is a nonlinear process. This requires a deep learning model to solve the nonlinear trend of chemical composition changes [39]. The LSTM (Long Short-Term Memory) model is a deep learning model that is very good at solving nonlinear data. Its nonlinear gate unit can adjust the information flowing into and out of memory tuples at each time point so as to better fit the trend of nonlinear data changing over time.

LSTM is a special type of recurrent neural network (RNN) that performs very well with long sequences of data, mainly solving gradient disappearance, gradient explosion, and overfitting problems when training long sequences [40,41,42,43]. RNN is an artificial neural network that operates on time-series data and can use back-propagation algorithms to learn and adapt to the relationship between inputs and outputs. In contrast to standard RNN, LSTM has an input gate, a forgetting gate, and an output gate that control the way information flows through the network. These gates of the LSTM allow it to store past information and update the current state appropriately, thus providing a significant advantage when dealing with long sequences of data. The basic structure is shown in Figure 1 [44,45].

The basic unit of the LSTM network contains an oblivion gate, an input gate, and an output gate. The oblivion gate determines the oblivion part of the state storage unit by combining the input

x_{i}

with the state storage unit

C_{i - 1}

and the intermediate output

h_{i - 1}

, while the input gate transforms

x_{i}

by means of the Σ and tanh functions. The associated intermediate output

h_{i}

is determined by the updated

C_{i}

and the output

B_{i}

[32]. The calculation formulas are shown in (22) to (27):

f_{i} = σ (W_{f} \cdot [h_{i - 1}, x_{i}] + b_{f})

(22)

e_{i} = σ (W_{e} \cdot [h_{i - 1}, x_{i}] + b_{e})

(23)

{\tilde{C}}_{i} = \tan h (W_{c} \cdot [h_{i - 1}, x_{i}] + b_{c})

(24)

C_{i} = f_{i} * C_{i - 1} + e_{i} * C_{i}

(25)

B_{i} = σ (W_{B} \cdot [h_{i - 1}, x_{i}] + b_{B})

(26)

h_{i} = B_{i} * \tan h (C_{i})

(27)

f_{i}

,

e_{i}

,

{\tilde{C}}_{i}

,

C_{i}

, and

B

are the forgetting gate, input gate, new candidate vector, updated cell state, and output gate, respectively,

W_{f}

and

b_{f}

are the corresponding weight coefficient matrix and bias term, tanh, and

σ

represent the hyperbolic tangent activation function and S-shaped activation function [45]:

\tan h (x) = \frac{1 - \exp (- 2 x)}{1 + \exp (- 2 x)}

(28)

σ (x) = \frac{1}{1 + \exp (- x)}

(29)

2.2.3. ARIMA-LSTM Model

In order to deal with linear and nonlinear trends in the time series data of chemical composition content before and after weathering of cultural relics, the unique advantages of the ARIMA model in dealing with linear data and the excellent performance of LSTM in dealing with nonlinearity were used [46,47]. First, the artifact chemical content data were processed, and linear prediction results and residual series were obtained with the help of the ARIMA model. Then, the nonlinear factors of the residual series were further analyzed by the LSTM model, and the nonlinear prediction results were obtained. Finally, the linear and nonlinear prediction results were superimposed to obtain the final prediction results for the chemical composition content. According to the decomposition principle of the time series model, it is assumed that the time series

Y = \{y_{t}, t = 1, 2 \dots, N\}

consists of linear and nonlinear components

y_{t} = x_{t} + b x_{t}

. Therefore, the one-dimensional chemical component data are first linearly predicted by the ARIMA model to obtain the linear component

x_{t r}

and the residual series

δ_{t} = y_{t} + y_{t}_{r}

. Then, the residual series are processed by further nonlinear prediction to obtain the nonlinear component

b x_{t r}

. Finally, the linear and nonlinear components are combined to obtain the final prediction

y_{t r} = x_{t r} + b x_{t r}

. Root mean square error (RMSE) [48], mean absolute percentage error (MAPE), and R² are used to evaluate the performance of the model [49,50,51,52].

R² is usually taken as [0,1]; the closer R² is to 1, the better the fit is, and the equations are as follows:

RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(x_{i} - y_{i})}^{2}}

(30)

MAPE = \sum_{i = 1}^{N} |\frac{x_{i} - y_{i}}{x_{i}}| \times \frac{100}{N}

(31)

R^{2} = \frac{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2} - \sum_{i = 1}^{N} {(y_{i} - \hat{y})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}}

(32)

In the above equation,

x_{i}

is the observed value,

y_{i}

is the predicted value, N is the sample size,

\bar{y}

is the mean of

y_{i}

, and

{\hat{y}}_{i}

is the regression fit [32].

2.3. Multiple Linear Regression Model

Multiple linear regression (MLR) is a statistical method that predicts the distribution of the dependent variable by using multiple independent factors [15]. The goal of the MLR model is to establish linear links between independent and dependent characteristics that influence a given event, and it is an extension of classical least squares regression because it employs multiple explanatory factors.

y = α_{0} + α_{1} x_{1} + \dots α_{i} x_{i} + \dots + α_{n} x_{n} + μ_{i}

(33)

where

y

is the dependent variable,

x_{1} \dots x_{n}

are the independent variables,

α_{0}

is the

y

intercept,

α_{i}

is the regression coefficient of the

i

th independent variable, and

μ_{i}

is the model error, also known as the residual. The magnitude of the coefficient of determination(R²)and the squared error(MSE)can be used to assess the predictive performance of the MLR model [15]:

M S E = \frac{Σ_{j = 1}^{n} {(y_{j} - {\hat{y}}_{j})}^{2}}{n}

(34)

R^{2} = 1 - \frac{\sum_{j = 1}^{n} {(y_{j} - {\hat{y}}_{j})}^{2}}{\sum_{j = 1}^{n} {(y_{j} - {\bar{y}}_{j})}^{2}}

(35)

y_{j}

is the

j

th parameter after normalization,

{\hat{y}}_{j}

is the

j

th parameter predicted,

{\bar{y}}_{j}

is the mean of the predicted parameters, and n is the number of samples.

We performed the BP test (Breucsh and Pagan test) and the White test on the perturbation term

μ_{i}

to see if there was heteroskedasticity. If the perturbation term is correlated with the independent variable, it may make the regression coefficients of the model inaccurate, thus leading to large errors in the results.

In the BP test, it is assumed that the regression model is

y_{i} = β_{1} + β_{2} x_{i 2} + \dots + β_{K} x_{i K} + ε_{i}

, test the following original hypothesis:

H_{0} : E (ε_{i}^{2} | x_{2}, \dots, x_{k}) = σ^{2}

If

H_{0}

is not true, then the conditional variance

E (ε_{i}^{2} | x_{2}, \dots, x_{k})

is a function of

(x_{2}, \dots, x_{k})

and is called the conditional variance function. The BP test assumes that the conditional variance function is linear:

ε_{i}^{2} = δ_{1} + δ_{2} x_{i 2} + \dots + δ_{K} x_{i K} + u_{i}

(36)

The original hypothesis can be simplified to:

H_{0} : δ_{2} = \dots = δ_{K} = 0

If we assume that

H_{0}

is true, we can show that

ε_{i}

has no correlation with the independent variable

x_{i K}

; that is, there is no autocorrelation, and the perturbation term has no heteroscedasticity. Since the perturbation term

ε_{i}

is not observable, the residual squared

e_{i}^{2}

is used for auxiliary regression of the explanatory variable:

e_{i}^{2} = δ_{1} + δ_{2} x_{i 2} + \dots + δ_{K} x_{i K} + e r r o r_{i}

(37)

n R^{2}

statistics were used:

n R^{2} \overset{d}{\to} x^{2} (K - 1)

R^{2}

is the

R^{2}

of auxiliary regression. The difference between the White test and the BP test lies in that when the White test carries out auxiliary regression, there are

x_{i K}

square terms and cross terms in Equation (37), so the BP test can be regarded as a special case of the White test.

In addition, we tested the model for multicollinearity, and the variance inflation factor VIF was used to eliminate the influence factors with multicollinearity, which improved the accuracy of the model:

Assuming that there are k independent variables, then the variance inflation factor

V I F_{n} = \frac{1}{1 - R_{1 - k / n}^{2}}

,

R_{1 - k / n}^{2}

is the goodness of fit obtained by regressing the n-th independent variable as the dependent variable on the remaining k − 1 independent variables; the larger the

V I F_{n}

, the greater the correlation between the n-th variable and the other variables [53]. If

V I F_{n}

is greater than 10, there is strict multicollinearity between the variables.

3. Material and Experiment

3.1. Data Pre-Processing

This study is based on the data related to ancient glassware provided by the official website of the 2022 China Student Mathematical Modeling Competition [14]. The glass sampling points are discussed separately by two systems: lead-barium glass and high-potassium glass. The data gives the proportion of the chemical composition of the sampling points of this batch of artifacts, which is characterized by composition, that is, the data of the proportion of the content of each chemical component of the cumulative sum should be found 100%, but may be due to detection means or contain various types of impurities and other reasons, resulting in the proportion of its corresponding components of the cumulative sum of the non-100% situation. Thus, in this study, the data with the sum of components between 85% and 105% were stored as valid data, and the data with severe weathering of the glass were excluded to eliminate the influence of outliers on the model results. The results are shown in Table A1.

3.2. Experimental Procedure

This paper focuses on three improved joint model algorithms, Daen-LR, ARIMA-LSTM and MLR. The experimental software environment Matlab 2021b, SPSS, Stata were used to analyze the identification of ancient glasses.

First, in this paper, the obtained pre-processed data set is used to find the relationship between the chemical composition content and weathering at its sampling points after glass classification by building an improved logistic regression model based on a double adaptive elastic net. Then, by using the ARIMA-LSTM model, we predict the content of chemical components contained in the two glasses before weathering and obtain the correlation curves of chemical components before and after weathering. Finally, based on the results obtained above, this paper uses a multiple linear regression model to predict the type of unknown glass and judges the accuracy and efficiency of the model by testing whether there is heteroskedasticity in the perturbation terms, multicollinearity, and the degree of fit between the experimental and actual values of the model. The flow chart is shown in Figure 2.

4. Process and Result

4.1. Relationship between Glass Weathering and Its Chemical Composition Based on an Improved Logistic Regression Model with Double Adaptive Elastic Net

Based on the data in Table A1, we use Matlab and SPSS to conduct modeling and calculation of the Daen-Logistic Regression model. The dependent variable here is a dichotomous variable (i.e., weathered and unweathered states), and the content of various chemical components is set as the independent variable. The double adaptive elastic net model can determine the classification results of weathered or unweathered glass under different conditions for each independent variable, which can avoid the variability of the results when the independent variables are selected differently, make the classification more adaptive, and reduce the influence of strongly correlated variables on other variables. The calculation gives the following in Table 1:

From the table of high potassium glass type, it can be seen that the values of two chemical components, SiO₂ and K₂O, are relatively large, and the values of the significance p-value are less than p = 0.1, so these two chemical components have the greatest influence on whether the surface of high potassium glass is weathered or not. From the table of lead-barium glass type, it can be seen that the values of three chemical components, SiO₂, PbO, and P₂O₅, are relatively large, and the values of the significance p-value are less than p = 0.1, so these two chemical components have the greatest influence on whether the surface of high potassium glass is weathered or not.

4.2. Prediction of the Chemical Content of Glass before Weathering Based on the ARIMA-LSTM Model

By solving 4.1, we obtained the results of the relationship between glass weathering and its chemical composition, and using this relationship, we screened 14 chemical elements in two respective types, high potassium and lead-barium, respectively. For high potassium glasses, we have chosen to retain both SiO₂ and K₂O chemical components. For lead-barium glasses, we chose to retain three chemical components: SiO₂, PbO, and P₂O₅; all of them have relatively complete data and have strong correlations for modeling analysis.

In order to make the model better identify the patterns in the data, the outliers with large deviations are first eliminated. SPSS 24 software was used to detect three abnormal data values with an additive or transient state, and the existence of such outliers would lead to accidental results in the model, leading to wrong conclusions. Taking the SiO₂ content of high potassium glass and lead-barium glass as examples, the outliers of both are shown in Table 2.

Through the analysis and calculation in Matlab and SPSS, we tested and fitted the data values of all the chemical composition contents changing with the time series and established the ARIMA-LSTM prediction curve model. We found that the parameters of ARIMA(2,1,0)-LSTM can obtain the maximum likelihood value of the model, and the normalized BIC [54] values of 3.160 and 4.160 for the SiO₂ component content in high potassium glass and lead-barium glass, for example, are the smallest values among the parameters. In addition, the smooth R² values of the model are 0.960 and 0.934, both close to 1, and both p-values are 0.000, both less than 0.05, so it can be considered that the results of the model are significantly reasonable and can fit well with the prediction model (Table 3).

After the initial completion of the estimated time series model based on the chemical composition content, a white noise test of the residuals is required. If the residuals are white noise, then it can indicate that the selected model can identify the laws of the time series data, that is, the model is acceptable; if the residuals are not white noise, then it means that there is still some information not identified; at this time, the model parameters need to be revised to continue to identify this part of the information. The study used Ljung and Box’s Q test to determine whether the residuals are white noise [55,56]:

Assuming that the residual

\{ϵ_{t}\}

is a white noise sequence, then

ρ_{s} = \{\begin{array}{l} 1, s = 0 \\ 0, s \neq 0 \end{array}

, the autocorrelation coefficient of the sample, is:

r = \hat{ρ_{s}} = \frac{\sum_{t = s + 1}^{T} (x_{t} - \bar{x}) (x_{t - s} - \bar{x})}{\sum_{t = 1}^{T} {(x_{t} - \bar{x})}^{2}}

(38)

In

H_{0} : ρ_{1} = ρ_{2} = \dots = ρ_{S} = 0

,

H_{1} : ρ_{i} (i = 1, 2, \dots, s)

at least one is not 0. In the case that

H_{0}

holds, the statistic

Q = T (T + 2) \sum_{k + 1}^{s} \frac{r_{k}^{2}}{T - k} ~ X_{s - n}^{2}

, from which the p-value can be calculated, and if the p-value is less than 0.05, then the original hypothesis is rejected, indicating that the model is not fully identified and the model parameters need to be modified.

Through the model statistics, the p-values of the Ljung and Box’s Q test for SiO₂ content of high potassium glass and lead-barium glass are 0.889 and 0.744, respectively, both of which are greater than 0.05, i.e., we cannot reject the original hypothesis, and we can assume that the residuals are white noise sequences and the model can be fully identified. Figure 3 shows that the autocorrelation coefficients and partial autocorrelation coefficients of all lag orders are not significantly different from 0 [57,58].

By the same method, the fitting coefficients of all mathematical models of the measured chemical composition contents were obtained. In the category of high potassium glass, the R² values of SiO₂ and K₂O were 0.960 and 0.969, respectively. In the category of lead-barium glass, the R² values of SiO₂, P₂O₅, and PbO are 0.934, 0.951, and 0.948, respectively. Finally, the corresponding prediction model curve is drawn, from which the correlation of chemical composition content before and after weathering can be clearly seen, as shown in Figure 4. The blue curve represents the actual value of chemical content changing with time after weathering, while the yellow curve represents the fitting value. The fitting degree of both represents the superiority of the model’s performance. The red curve represents the predicted value of component content over time before weathering. It can be seen that the ARIMA (2,1,0)-LSTM model shows the correlation of chemical composition contents before and after weathering, reduces the interference of “weathering” factors on glass classification, and improves the accuracy of subsequent glass classification.

4.3. Identifying Unknown Artifact Types Based on Multiple Linear Regression Model

Through the results of Section 4.1 and Section 4.2, we conducted chemical content testing and analysis on a batch of newly excavated glass artifacts, as shown in Table 4, and judged the categories to which they belonged by the correlation of related elements and weathering effects. Firstly, the chemical elements with significant correlation in each category were initially screened out by the statistical law of chemical element content, and the multiple linear regression equation between elements and categories was established to find out the experimental values of categories and make errors and fits with the actual values of categories, and to verify the accuracy of the model.

According to the classification rules of chemical content and surface weathering, it can be initially concluded that K₂O, CaO, MgO, Al₂O₃, FeO, PbO, BaO, and P₂O₅ have strong correlations with surface weathering and categories, while the remaining elements have weak correlations, so the remaining elements can be deleted. In addition, because the chemical element contents of the three heavily weathered glass artifacts are very different from other contents, which will have a large impact on the analysis of the model, they are treated as outliers. In the multiple linear regression equation, the qualitative data should be set as dummy variables, so the qualitative variables (unweathered and weathered) in the surface weathering independent variable

S u w

can be set as quantitative variables (0 and 1), and the qualitative variables (high-potassium and lead-barium) in the discriminatory category dependent variable

y_{i}

can be set as quantitative variables (A and B), and the following multiple linear regression equation can be established as:

\begin{matrix} y_{i} = & α_{0} + α_{1} x_{S i i} + α_{2} x_{K i} + α_{3} x_{C a i} + α_{4} x_{M g i} + α_{5} x_{A l i} \\ + α_{6} x_{F e i} + α_{7} x_{P b i} + α_{8} x_{B a i} + α_{9} x_{P i} + β S u w_{i} + μ_{i} \end{matrix}

(39)

S u w_{i} = 1 denotes the i - th weathering sample S u w_{i} = 0 denotes the i - th unweathered sample E (y ∣ S u w = 1 a n d o t h e r i n d e p e n d e n t v a r i a b l e s) = β \times 1 + m (C o n s t a n t s) E (y ∣ S u w = 0 a n d o t h e r i n d e p e n d e n t v a r i a b l e s) = β \times 0 + m (C o n s t a n t s)

The joint significance test indicators for the F-statistic [59,60] for the above model results are as follows:

F(10,55) represents the F joint statistic test value of 51.41, the confidence interval is 95%, and the original hypothesis

H_{0}

is:

a 1 = a 2 = a 3 = \dots = a 9 = β = 0

. From Table 5, we can see that the p-value is 0, p is less than 0.05, and at this time the original hypothesis is rejected. We have reason to believe that the correlation coefficient is significantly different from 0, so we can consider this model to be useful. The regression coefficients and corresponding p-values for the variables of interest can be derived as in Table 6. Only when the p-value is less than 0.05, we consider it significant, and the regression coefficient is credible at this point, so we can use the regression coefficients corresponding to K₂O, Al₂O₃, PbO, BaO, and Suw (the dummy variable “weathering”), and the larger the absolute value of the regression coefficients, the greater the effect on the dependent variable.

We can derive the multiple linear regression equation for glass artifact class, chemical element content, and weathering type as follows:

{\hat{y}}_{i} = 0.7458 + 0.0480 x_{k} - 0.0299 x_{A l} - 0.0165 x_{P b} - 0.0161 x_{B a} + 0.3517 Suw

(40)

4.3.1. Testing for the Presence of Heteroskedasticity in the Perturbation Term

Perturbation term

μ_{i}

is unobservable and requires certain conditions to be met. Our model defaults to a spherical perturbation term, which generally has to satisfy “no autocorrelation” and “homoskedasticity” because if the perturbation term is “correlated with the independent variable”, i.e., endogenous, it will make the correlation regression coefficient inaccurate; if there is “heteroskedasticity”, it will cause the hypothesis test statistic we constructed to be invalid, and the

O L S

estimator cannot be treated as the optimal linear unbiased estimator [61]. Therefore, we performed the BP test and White test on the perturbation term to verify the presence of heteroskedasticity, as shown in Table 7 [62,63,64].

The above two hypotheses were tested for heteroskedasticity, and the original hypothesis

H 0

was that there is no heteroskedasticity in the perturbation term. However, the p-value is greater than 0.05, so

H 0

is accepted, and we can assume that there is no heteroskedasticity in the perturbation term.

4.3.2. Testing for Multicollinearity

If the data matrix X does not satisfy the column rank, i.e., a variable can be linearly expressed by other explanatory variables, then there is “strict multicollinearity”, Stata software was used to calculate the VIF of each variable, and the test results were as follows Table 8:

It is generally believed that when VIF > 10, the regression equation has severe multicollinearity; SiO₂ and PbO both exceed 10, but the p-value of SiO₂ is higher than 0.05, which is not significant, so its coefficient is not considered in the equation model. PbO, although VIF exceeds 10, the p-value is lower than 0.05 because the coefficient is still significant with variance inflation; if there is no multicollinearity, the regression coefficients would be more significant.

4.3.3. Testing the Fit of the Experimental and Actual Values of the Model and Identifying the Unknown Artifact Types

Due to the fact that the dependent variable is the category of glass artifacts, there are only two categories: high-potassium and lead-barium, so it can be treated as a 1-0 variable. If the experimental value is close to 1, then it is considered the high potassium category; if the experimental value is close to 0, then it is considered the lead-barium category. From Figure 5 and Table 9, it can be seen that the 66 samples fit very well, almost no chance data occur, and the identification results of 8 unknown cultural relic types are completely consistent with reality, so it can be considered that the predicted value of this multiple linear regression equation is quite accurate.

4.4. Comparision of Different Models

To demonstrate the superiority of the proposed method, we compare the proposed joint algorithm with similar decision and classification algorithms like Decision Trees (DT), Random Forests (RF) [65], Support Vector Machines (SVM), Random Forests based on calssification and regression tree (CART-RF) [66,67,68,69]. We did not use any pre-trained models, but trained each model from scratch. When we select the parameters of traditional machine learning algorithm, we take into account the number of data features and avoid overfitting, as we can see in Table 10. Then we perform experimental simulations of these models to be compared as well as the model proposed in this paper using Matlab. The results are presented in the following Table 11. In the classification results, this study uses common evaluation indicators to judge the superiority of the model: Train Acc, Test Acc, Precision, Recall, and F₁ Score. TP, TN, FP, FN are required to explain the above indicators, so confusion matrix is introduced, as shown in Figure 6. The specific performance is described as follows:

TP (True Positive): The true value of the data is high potassium, and the predicted value is also high potassium.
TN (True Negative): The true value of the data is lead barium, and the predicted value is also lead barium.
FP (False Positive): The true value of the data is high potassium, but it is incorrectly predicted as lead barium.
FN (False Negative): The true value of the data is lead barium, but it is incorrectly predicted as high potassium.

Accuracy is the simplest and most clear index for evaluating classification models, but it is a good measurement standard only when the proportion of samples in each category of the data set is fairly balanced, as shown in Equation (41):

Accuracy = \frac{T P + T N}{T P + F P + T N + F N}

(41)

Precision represents the proportion of samples that are actually positive in the predicted positive example. As shown in Equation (42):

Precision = \frac{T P}{T P + F P}

(42)

Recall represents the proportion of the actual number of positive samples in the total positive samples among the predicted positive samples. As shown in Equation (43):

Recall = \frac{T P}{T P + F N}

(43)

F₁ Score is a weighted average of accuracy rate and recall rate, which is a synthesis of both. The value of the F₁ Score determines the robustness of the model. It can be considered that the higher F₁ is, the more stable the model is. As shown in Equation (44):

F_{1} = 2 \times \frac{Precision \times Recall}{Precision + Recall}

(44)

5. Discussion

From the results of the comparison experiments, it is not difficult to see that the joint algorithm proposed in this paper shows notable advantages in all performance parameters. As shown in Table 11, in the indicators of Train Acc, Test Acc, Precision, Recall, and F₁ Score, we can find that the difference values of JMLA’s performance over the past optimal algorithm model are +0.017, +0.025, +0.046, +0.035, and +0.040, respectively. We improve common machine learning algorithms and combine them with deep learning models to make the classification results more accurate, which provides a new idea for the study of the classification of ancient cultural relics.

Considering the accuracy of the JMLA algorithm in classification results and excellent evaluation indexes, this study believes that the model proposed in this paper is suitable for providing more in-depth research ideas for the classification of ancient cultural relics. The algorithm ideas in this paper can also be applied to other related fields, such as the data analysis of nutrient elements in food, the influence of air oxidation degree on nutrient elements, the classification of water pollution degree, etc. However, there is still room for improvement in the joint algorithm to address its high computational complexity and formula complexity. Compared with the existing algorithm, the calculation cost of JMLA is higher, and the formula is more complex. Further reducing algorithm complexity and better unifying the above three algorithms will be the focus of future research.

6. Conclusions

In this paper, we propose a joint Daen-LR, ARIMA-LSTM, and MLR machine learning algorithm (JMLA). Firstly, we combine a double adaptive elastic network with a traditional logistic model to select variables that have both Oracle and adaptive classification characteristics. These two characteristics eliminate the influence of different categories on the inconsistent selection of important independent variables and the influence of strong-correlation independent variables on the interference of weak independent variables. Secondly, we combine the deep learning model (LSTM) with the ARIMA time series model so that it can handle both linear and nonlinear trends. By calculating the ARIMA-LSTM model, we establish the correlation curve of chemical composition before and after weathering and predict the change in chemical composition with weathering. Thirdly, we combine the data processed by the above two improved methods with the multiple linear regression model to classify the unknown glass relics.

The experimental results show that the accuracy of the JMLA model on the train set is 97.9%, and the accuracy of the JMLA model on the test set is 97.6%. In addition, we compared JMLA with similar classifiers, and the results were shown in Train Acc, Test Acc, Precision, Recall, and F₁ Score indexes. The difference values of JMLA’s performance over the past optimal algorithm model are +0.017, +0.025, +0.046, +0.035, and +0.040, respectively. These data show that the JMLA model has better performance than other classification models without changing the structure of similar classification models and under the same experimental conditions. The classification accuracy of the JMLA model is higher than other models, especially for large glass relics with more chemical elements and a harsh environment.

This processing method is practical and reliable in the direction related to the composition analysis and identification of cultural heritage. The application of this method is expected to improve the accuracy of the classification of cultural relics by archaeologists and can effectively reduce the impact of identification difficulties caused by factors such as harsh burial environments. It helps us to have a deeper understanding of the exchange, penetration, and development of ancient Eastern and Western cultures.

In addition, the future research directions of this study can be summarized as follows:

Algorithm optimization. The processing method uses a variety of machine learning algorithms that effectively combine the advantages of each algorithm with high practicality and feasibility and a good fitting effect. However, this model is only combined with an LSTM deep learning neural network, which can be combined with more advanced deep learning models in the future so as to improve the accuracy and efficiency of classification.
Reduce model calculation costs and formula complexity. Although the classification accuracy of the JMLA model is very high, the calculation time is relatively long compared with other models, and the formula is relatively complex, which is also a pain point for the JMLA model. Therefore, reducing the calculation amount and better integrating the three models will be the focus of future research.
Application prospects of this data processing method. In spite of its application in the direction of heritage composition analysis and identification, it is expected to be applied in the areas of health, food safety, and environmental protection, for example: analysis and classification of chemical constituents of tobacco; composition analysis of nutritional composition in food; classification and monitoring of pollutant composition in air, etc.

Author Contributions

Conceptualization, Z.-X.L. and P.-S.L.; data curation, P.-S.L.; formal analysis, Z.-X.L.; investigation, J.-H.L.; methodology, Z.-X.L.; project administration, Z.-H.Y., Y.-P.M. and H.-H.W.; resources, G.-Y.W.; software, P.-S.L.; supervision, G.-Y.W. and H.-H.W.; validation, J.-H.L., Z.-H.Y. and Y.-P.M.; visualization, J.-H.L.; writing—original draft, Z.-X.L. and P.-S.L.; writing—review and editing, G.-Y.W., J.-H.L. and Z.-H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (62203332/22008050) and the Natural Science Foundation of Hebei Province (B2022202008).

Institutional Review Board Statement

This study did not require ethical approval.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are provided in the 2022 China Undergraduate Mathematical Contest in Modeling. The authors do not have permission to share the data.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. The result graph after glass pre-processing.

Glass Sampling Points	SiO₂	Na₂O	K₂O	CaO	MgO	Al₂O₃	Fe₂O₃	CuO	PbO	BaO	P₂O₅	SrO	SnO₂	SO₂
HPNP01	69.33	0	9.99	6.32	0.87	3.93	1.74	3.87	0	0	1.17	0	0	0.39
HPNP03(1)	87.05	0	5.19	2.01	0	4.06	0	0.78	0.25	0	0.66	0	0	0
HPNP03(2)	61.71	0	12.37	5.87	1.11	5.5	2.16	5.09	1.41	2.86	0.7	0.1	0	0
HPNP04	65.88	0	9.67	7.12	1.56	6.44	2.06	2.18	0	0	0.79	0	0	0.36
HPNP05	61.58	0	10.95	7.35	1.77	7.5	2.62	3.27	0	0	0.94	0.06	0	0.47
HPNP06(1)	67.65	0	7.37	0	1.98	11.15	2.39	2.51	0.2	1.38	4.18	0.11	0	0
HPNP06(2)	59.81	0	7.68	5.41	1.73	10.05	6.04	2.18	0.35	0.97	4.5	0.12	0	0
HPWP07	92.63	0	0	1.07	0	1.98	0.17	3.24	0	0	0.61	0	0	0
HPWP09	95.02	0	0.59	0.62	0	1.32	0.32	1.55	0	0	0.35	0	0	0
HPWP10	96.77	0	0.92	0.21	0	0.81	0.26	0.84	0	0	0	0	0	0
HPWP12	94.29	0	1.01	0.72	0	1.46	0.29	1.65	0	0	0.15	0	0	0
HPNP13	59.01	2.86	12.53	8.7	0	6.16	2.88	4.73	0	0	1.27	0	0	0
HPNP14	62.47	3.38	12.28	8.23	0.66	9.23	0.5	0.47	1.62	0	0.16	0	0	0
HPNP15	61.87	3.21	7.44	0	1.02	3.15	1.04	1.29	0.19	0	0.26	0	0	0
HPNP16	65.18	2.1	14.52	8.27	0.52	6.18	0.42	1.07	0.11	0	0	0.04	0	0
HPNP17	60.71	2.12	5.71	0	0.85	0	1.04	1.09	0.19	0	0.18	0	0	0
HPNP18	79.46	0	9.42	0	1.53	3.05	0	0	0	0	1.36	0.07	2.36	0
HPNP21	76.68	0	0	4.71	1.22	6.19	2.37	3.28	1	1.97	1.1	0	0	0
HPWP27	92.72	0	0	0.94	0.54	2.51	0.2	1.54	0	0	0.36	0	0	0
HPWP22	92.35	0	0.74	1.66	0.64	3.5	0.35	0.55	0	0	0.21	0	0	0
LBNP20	37.36	0	0.71	0	0	5.45	1.51	4.78	9.3	23.55	5.75	0	0	0
LBNP23	53.79	7.92	0	0.5	0.71	1.42	0	2.99	16.98	11.86	0	0.33	0	0
LBNP24	31.94	0	0	0.47	0	1.59	0	8.46	29.14	26.23	0.14	0.91	0	0
LBNP25	50.61	2.31	0	0.63	0	1.9	1.55	1.12	31.9	6.65	0.19	0.2	0	0
LBWP26	19.79	0	0	1.44	0	0.7	0	10.57	29.53	32.25	3.13	0.45	0	1.96
LBWP08	20.14	0	0	1.48	0	1.34	0	10.41	28.68	31.23	3.59	0.37	0	2.58
LBWP19	29.64	0	0	2.93	0.59	3.57	1.33	3.51	42.82	5.35	8.83	0.19	0	0
LBWP11	33.59	0	0.21	3.51	0.71	2.69	0	4.93	25.39	14.61	9.38	0.37	0	0
LBWP02	36.28	0	1.05	2.34	1.18	5.73	1.86	0.26	47.43	0	3.57	0.19	0	0
LBNP28	68.08	0	0.26	1.34	1	4.7	0.41	0.33	17.14	4.04	1.04	0.12	0.23	0
LBNP29	63.3	0.92	0.3	2.98	1.49	14.34	0.81	0.74	12.31	2.03	0.41	0.25	0	0
LBNP30(1)	34.34	0	1.41	4.49	0.98	4.35	2.12	0	39.22	10.29	0	0.35	0.4	0
LBNP30(2)	36.93	0	0	4.24	0.51	3.86	2.74	0	37.74	10.35	1.41	0.48	0.44	0
LBNP31	65.91	0	0	1.6	0.89	3.11	4.59	0.44	16.55	3.42	1.62	0.3	0	0
LBNP32	69.71	0	0.21	0.46	0	2.36	1	0.11	19.76	4.88	0.17	0	0	0
LBNP33	75.51	0	0.15	0.64	1	2.35	0	0.47	16.16	3.55	0.13	0	0	0
LBWP34	35.78	0	0.25	0.78	0	1.62	0.47	1.51	46.55	10	0.34	0.22	0	0
LBNP35	65.91	0	0	0.38	0	1.44	0.17	0.16	22.05	5.68	0.42	0	0	0
LBWP36	39.57	2.22	0.14	0.37	0	1.6	0.32	0.68	41.61	10.83	0.07	0.22	0	0
LBNP37	60.12	0	0.23	0.89	0	2.72	0	3.01	17.24	10.34	1.46	0.31	0	3.66
LBWP38	32.93	1.38	0	0.68	0	2.57	0.29	0.73	49.31	9.79	0.48	0.41	0	0
LBWP39	26.25	0	0	1.11	0	0.5	0	0.88	61.03	7.22	1.16	0.61	0	0
LBWP40	16.71	0	0	1.87	0	0.45	0.19	0	70.21	6.69	1.77	0.68	0	0
LBWP41	18.46	0	0.44	4.96	2.73	3.33	1.79	0.19	44.12	9.76	7.46	0.47	0	0
LBNP42(1)	51.26	5.74	0.15	0.79	1.09	3.53	0	2.67	21.88	10.47	0.08	0.35	0	0
LBNP42(2)	51.33	5.68	0.35	0	1.16	5.66	0	2.72	20.12	10.88	0	0	0	0
LBWP43(1)	12.41	0	0	5.24	0.89	2.25	0.76	5.35	59.85	7.29	0	0.64	0	0
LBWP43(2)	21.7	0	0	6.4	0.95	3.41	1.39	1.51	44.75	3.26	12.83	0.47	0	0
LBNP44	60.74	3.06	0.2	2.14	0	12.69	0.77	0.43	13.61	5.22	0	0.26	0	0
LBNP45	61.28	2.66	0.11	0.84	0.74	5	0	0.53	15.99	10.96	0	0.23	0	0
LBNP46	55.21	0	0.25	0	1.67	4.79	0	0.77	25.25	10.06	0.2	0.43	0	0
LBNP47	51.54	4.66	0.29	0.87	0.61	3.06	0	0.65	25.4	9.23	0.1	0.85	0	0
LBWP48	53.33	0.8	0.32	2.82	1.54	13.65	1.03	0	15.71	7.31	1.1	0.25	1.31	0
LBNP49	28.79	0	0	4.58	1.47	5.38	2.74	0.7	34.18	6.1	11.1	0.46	0	0
LBWP49	54.61	0	0.3	2.08	1.2	6.5	1.27	0.45	23.02	4.19	4.32	0.3	0	0
LBWP50	17.98	0	0	3.19	0.47	1.87	0.33	1.13	44	14.2	6.34	0.66	0	0
LBNP50	45.02	0	0	3.12	0.54	4.16	0	0.7	30.61	6.22	6.34	0.23	0	0
LBWP51(1)	24.61	0	0	3.58	1.19	5.25	1.19	1.37	40.24	8.94	8.1	0.39	0.47	0
LBWP51(2)	21.35	0	0	5.13	1.45	2.51	0.42	0.75	51.34	0	8.75	0	0	0
LBWP52	25.74	1.22	0	2.27	0.55	1.16	0.23	0.7	47.42	8.64	5.71	0.44	0	0
LBNP53	63.66	3.04	0.11	0.78	1.14	6.06	0	0.54	13.66	8.99	0	0.27	0	0
LBWP54	22.28	0	0.32	3.19	1.28	4.15	0	0.83	55.46	7.04	4.24	0.88	0	0
LBNP55	49.01	2.71	0	1.13	0	1.45	0	0.86	32.92	7.95	0.35	0	0	0
LBWP56	29.15	0	0	1.21	0	1.85	0	0.79	41.25	15.45	2.54	0	0	0
LBWP57	25.42	0	0	1.31	0	2.18	0	1.16	45.1	17.3	0	0	0	0
LBWP58	30.39	0	0.34	3.49	0.79	3.52	0.86	3.13	39.35	7.66	8.99	0.24	0	0

Note: HPWP01(1) means that the part 1 of weathering point 01 of high potassium, HPNP01-(1) means that the part 1 of non-weathering point 01 of high potassium, LBWP01(1) means that the part 1 of weathering point 01 of lead barium, LBNP01(1) means that the part 1 of non-weathering point 01 of lead barium, and LBSWP01(1) means that the part 1 of severe weathering point 01 of high potassium.

References

Bzdok, D.; Altman, N.; Krzywinski, M. Points of Significance Statistics versus machine learning. Nat. Methods 2018, 15, 232–233. [Google Scholar] [CrossRef]
Guo, Y.; Zhan, W.; Li, W. Application of Support Vector Machine Algorithm Incorporating Slime Mould Algorithm Strategy in Ancient Glass Classification. Appl. Sci. 2023, 13, 3718. [Google Scholar] [CrossRef]
Li, F.; Li, Q.; Gan, F.; Zhang, B.; Cheng, H. Chemical Composition Analysis for Some Ancient Chinese Glasses by Proton Induced X-ray Emission Technique. J. Chin. Ceram. Soc. 2005, 33, 581–586. [Google Scholar]
Chul, L.; Myungzoon, C.; Seungwon, K.; Kang, H.T.; Du, L.J. Classification of Korean Ancient Glass Pieces by Pattern Recognition Method. J. Korean Chem. Soc. 1992, 36, 113–124. [Google Scholar]
El-Taher, A. Elemental content of feldspar from Eastern Desert, Egypt, determined by INAA and XRF. Appl. Radiat. Isot. 2010, 68, 1185–1188. [Google Scholar] [CrossRef]
Won-in, K.; Thongkam, Y.; Pongkrapan, S.; Intarasiri, S.; Thongleurm, C.; Kamwanna, T.; Leelawathanasuk, T.; Dararutana, P. Raman spectroscopic study on archaeological glasses in Thailand: Ancient Thai Glass. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2011, 83, 231–235. [Google Scholar] [CrossRef] [PubMed]
Schibille, N.; Gratuze, B.; Ollivier, E.; Blondeau, E. Chronology of early Islamic glass compositions from Egypt. J. Archaeol. Sci. 2019, 104, 10–18. [Google Scholar] [CrossRef]
Fiorucci, M.; Khoroshiltseva, M.; Pontil, M.; Traviglia, A.; Del Bue, A.; James, S. Machine Learning for Cultural Heritage: A Survey. Pattern Recognit. Lett. 2020, 133, 102–108. [Google Scholar] [CrossRef]
Wei, J.; Chu, X.; Sun, X.Y.; Xu, K.; Deng, H.X.; Chen, J.G.; Wei, Z.M.; Lei, M. Machine learning in materials science. Infomat 2019, 1, 338–358. [Google Scholar] [CrossRef]
Zhou, Y.; Hu, Y.; Tao, Y.; Sun, J.; Cui, Y.; Wang, K.; Hu, D. Study on the microstructure of the multilayer glaze of the 16th–17th century export blue-and-white porcelain excavated from Nan’ao-I Shipwreck. Ceram. Int. 2016, 42, 17456–17465. [Google Scholar] [CrossRef]
Han, M.S. Characteristic Analysis of Chemical Compositions for Ancient Glasses Excavated from the Sarira Hole of Mireuksaji Stone Pagoda, Iksan. J. Conserv. Sci. 2017, 33, 215–223. [Google Scholar] [CrossRef]
Lin, Y.; Liu, T.; Toumazou, M.K.; Counts, D.B.; Kakoulli, I. Chemical analyses and production technology of archaeological glass from Athienou-Malloura, Cyprus. J. Archaeol. Sci. Rep. 2019, 23, 700–713. [Google Scholar] [CrossRef]
Oikonomou, A.; Triantafyllidis, P. An archaeometric study of Archaic glass from Rhodes, Greece: Technological and provenance issues. J. Archaeol. Sci. Rep. 2018, 22, 493–505. [Google Scholar] [CrossRef]
The Official Website of 2022 China Undergraduate Mathematical Contest in Modeling. Available online: http://www.mcm.edu.cn/html_cn/node/5267fe3e6a512bec793d71f2b2061497.html (accessed on 14 May 2023).
Kouadri, S.; Pande, C.B.; Panneerselvam, B.; Moharir, K.N.; Elbeltagi, A. Prediction of irrigation groundwater quality parameters using ANN, LSTM, and MLR models. Environ. Sci. Pollut. Res. 2022, 29, 21067–21097. [Google Scholar] [CrossRef]
Gomah, M.E.; Li, G.; Khan, N.M.; Sun, C.; Xu, J.; Omar, A.A.; Mousa, B.G.; Abdelhamid, M.M.A.; Zaki, M.M. Prediction of Strength Parameters of Thermally Treated Egyptian Granodiorite Using Multivariate Statistics and Machine Learning Techniques. Mathematics 2022, 10, 4523. [Google Scholar] [CrossRef]
Leonardi, B.; Ajjarapu, V. Development of multilinear regression models for online voltage stability margin estimation. IEEE Trans. Power Syst. 2010, 26, 374–383. [Google Scholar] [CrossRef]
Tihonov, A.N. Solution of incorrectly formulated problems and the regularization method. Sov. Math. Dokl. 1963, 5, 1035–1038. [Google Scholar]
Hui, Z.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. 2005, 67, 768. [Google Scholar]
Ghosh, S. On the grouped selection and model complexity of the adaptive elastic net. Stat. Comput. 2011, 21, 451–462. [Google Scholar] [CrossRef]
Li, J.T.; Jia, Y.M.; Zhao, Z.H. Partly adaptive elastic net and its application to microarray classification. Neural Comput. Appl. 2013, 22, 1193–1200. [Google Scholar] [CrossRef]
Algamal, Z.Y.; Lee, M.H. Applying penalized binary logistic regression with correlation based elastic net for variables selection. J. Mod. Appl. Stat. Methods 2015, 14, 15. [Google Scholar] [CrossRef]
Hui, Z. The Adaptive Lasso and Its Oracle Properties. J. Am. Stat. Assoc. 2006, 101, 1418–1429. [Google Scholar]
Van, D.; Lien, T.G.; Verlaat, W.; Wieringen, W.V.; Wilting, S.M. Better prediction by use of co-data: Adaptive group-regularized ridge regression. Stat. Med. 2016, 35, 368–381. [Google Scholar]
Zhang, F. Combination Model of Enterprise Credit Evaluation Based on XGBoost and Logistic Regression and Its Application. Master’s Thesis, Hebei University of Engineering, Handan, China, 2021. (In Chinese) [Google Scholar] [CrossRef]
Jiang, S.; Dai, J. An Improved Elastic Net Estimate for Logistic Regression Models. Math. Theory Appl. 2022, 42, 108–119. (In Chinese) [Google Scholar]
Anbari, M.E.; Mkhadri, A. Penalized regression combining the L 1 norm and a correlation based penalty. Sankhya B 2014, 76, 82–102. [Google Scholar] [CrossRef]
Wang, Q.; Li, S.; Li, R.; Ma, M. Forecasting US shale gas monthly production using a hybrid ARIMA and metabolic nonlinear grey model. Energy 2018, 160, 378–387. [Google Scholar] [CrossRef]
Singh, S.; Mohapatra, A. Repeated wavelet transform based ARIMA model for very short-term wind speed forecasting. Renew. Energy 2019, 136, 758–768. [Google Scholar]
Wang, C.-C.; Chien, C.-H.; Trappey, A.J. On the application of ARIMA and LSTM to predict order demand based on short lead time and on-time delivery requirements. Processes 2021, 9, 1157. [Google Scholar] [CrossRef]
Fan, D.; Sun, H.; Yao, J.; Zhang, K.; Yan, X.; Sun, Z. Well production forecasting based on ARIMA-LSTM model considering manual operations. Energy 2021, 220, 119708. [Google Scholar] [CrossRef]
Li, C.; Fang, X.; Yan, Z.; Huang, Y.; Liang, M. Research on Gas Concentration Prediction Based on the ARIMA-LSTM Combination Model. Processes 2023, 11, 174. [Google Scholar] [CrossRef]
Jiang, S.; Yang, C.; Guo, J.; Ding, Z. ARIMA forecasting of China’s coal consumption, price and investment by 2030. Energy Sources Part B Econ. Plan. Policy 2018, 13, 190–195. [Google Scholar] [CrossRef]
Ediger, V.Ş.; Akar, S. ARIMA forecasting of primary energy demand by fuel in Turkey. Energy Policy 2007, 35, 1701–1708. [Google Scholar] [CrossRef]
Dey, B.; Roy, B.; Datta, S.; Ustun, T.S. Forecasting ethanol demand in India to meet future blending targets: A comparison of ARIMA and various regression models. Energy Rep. 2023, 9, 411–418. [Google Scholar] [CrossRef]
De Gooijer, J.G. Partial sums of lagged cross-products of AR residuals and a test for white noise. Test 2008, 17, 567–584. [Google Scholar] [CrossRef]
Wang, Y.; Liu, Q. Comparison of Akaike information criterion (AIC) and Bayesian information criterion (BIC) in selection of stock–recruitment relationships. Fish. Res. 2006, 77, 220–225. [Google Scholar] [CrossRef]
Man, K. Maximum likelihood estimation for a nearly random walk model. Commun. Stat. Theory Methods 2000, 29, 677–697. [Google Scholar] [CrossRef]
Qureshi, S.A.; Hsiao, W.W.-W.; Hussain, L.; Aman, H.; Le, T.-N.; Rafique, M. Recent Development of Fluorescent Nanodiamonds for Optical Biosensing and Disease Diagnosis. Biosensors 2022, 12, 1181. [Google Scholar] [CrossRef]
Zheng, C.; Deng, J.; Hong, Z.; Wang, G. Prediction model of suspension density in the dense medium separation system based on LSTM. Processes 2020, 8, 976. [Google Scholar] [CrossRef]
Lyu, P.; Chen, N.; Mao, S.; Li, M. LSTM based encoder-decoder for short-term predictions of gas concentration using multi-sensor fusion. Process Saf. Environ. Prot. 2020, 137, 93–105. [Google Scholar] [CrossRef]
Al-Hajj, R.; Assi, A.; Fouad, M. Short-term prediction of global solar radiation energy using weather data and machine learning ensembles: A comparative study. J. Sol. Energy Eng. 2021, 143, 051003. [Google Scholar] [CrossRef]
Zhu, X.; Li, L.; Liu, J.; Li, Z.; Peng, H.; Niu, X. Image captioning with triple-attention and stack parallel LSTM. Neurocomputing 2018, 319, 55–65. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Olah, C. Understanding lstm Networks. 2015. Available online: https://colah.github.io/posts/2015-08-Understanding-LSTMs/ (accessed on 24 March 2023).
Wu, X.; Zhou, J.; Yu, H.; Liu, D.; Xie, K.; Chen, Y.; Hu, J.; Sun, H.; Xing, F. The development of a hybrid wavelet-ARIMA-LSTM model for precipitation amounts and drought analysis. Atmosphere 2021, 12, 74. [Google Scholar] [CrossRef]
Xu, D.; Zhang, Q.; Ding, Y.; Zhang, D. Application of a hybrid ARIMA-LSTM model based on the SPEI for drought forecasting. Environ. Sci. Pollut. Res. 2022, 29, 4128–4144. [Google Scholar] [CrossRef] [PubMed]
Wei, X.; Shahani, N.M.; Zheng, X. Predictive Modeling of the Uniaxial Compressive Strength of Rocks Using an Artificial Neural Network Approach. Mathematics 2023, 11, 1650. [Google Scholar] [CrossRef]
Xu, P. Prediction of Per Capita Ecological Carrying Capacity Based on ARIMA-LSTM in Tourism Ecological Footprint Big Data. Sci. Program. 2022, 2022, 6012998. [Google Scholar] [CrossRef]
Manowska, A.; Rybak, A.; Dylong, A.; Pielot, J. Forecasting of Natural Gas Consumption in Poland Based on ARIMA-LSTM Hybrid Model. Energies 2021, 14, 8597. [Google Scholar] [CrossRef]
Huang, Y.; Fan, J.; Yan, Z.; Li, S.; Wang, Y. Research on early warning for gas risks at a working face based on association rule mining. Energies 2021, 14, 6889. [Google Scholar] [CrossRef]
Bukhari, A.H.; Raja, M.A.Z.; Sulaiman, M.; Islam, S.; Shoaib, M.; Kumam, P. Fractional neuro-sequential ARFIMA-LSTM for financial market forecasting. IEEE Access 2020, 8, 71326–71338. [Google Scholar] [CrossRef]
Salmerón, R.; García, C.; García, J. Variance inflation factor and condition number in multiple linear regression. J. Stat. Comput. Simul. 2018, 88, 2365–2384. [Google Scholar] [CrossRef]
Burnham, K.P.; Anderson, D.R. Multimodel inference: Understanding AIC and BIC in model selection. Sociol. Methods Res. 2004, 33, 261–304. [Google Scholar] [CrossRef]
Hassani, H.; Yeganegi, M.R. Sum of squared ACF and the Ljung–Box statistics. Phys. A Stat. Mech. Appl. 2019, 520, 81–86. [Google Scholar] [CrossRef]
Lee, T. Wild bootstrap Ljung–Box test for cross correlations of multivariate time series. Econ. Lett. 2016, 147, 59–62. [Google Scholar] [CrossRef]
Hollas, B. An analysis of the autocorrelation descriptor for molecules. J. Math. Chem. 2003, 33, 91–101. [Google Scholar] [CrossRef]
Angel, E.; Zissimopoulos, V. Autocorrelation coefficient for the graph bipartitioning problem. Theor. Comput. Sci. 1998, 191, 229–243. [Google Scholar] [CrossRef]
Goudet, J. FSTAT (version 1.2): A computer program to calculate F-statistics. J. Hered. 1995, 86, 485–486. [Google Scholar] [CrossRef]
Weir, B.S.; Hill, W.G. Estimating F-statistics. Annu. Rev. Genet. 2002, 36, 721–750. [Google Scholar] [CrossRef]
Weaver, B.; Wuensch, K.L. SPSS and SAS programs for comparing Pearson correlations and OLS regression coefficients. Behav. Res. Methods 2013, 45, 880–895. [Google Scholar] [CrossRef]
Halunga, A.G.; Orme, C.D.; Yamagata, T. A heteroskedasticity robust Breusch–Pagan test for Contemporaneous correlation in dynamic panel data models. J. Econom. 2017, 198, 209–230. [Google Scholar] [CrossRef]
Jeong, J.; Lee, K. Bootstrapped White’s test for heteroskedasticity in regression models. Econ. Lett. 1999, 63, 261–267. [Google Scholar] [CrossRef]
Baum, C.; Cox, N. WHITETST: Stata Module to Perform White’s Test for Heteroskedasticity. 2002. Available online: https://econpapers.repec.org/software/bocbocode/s390601.htm (accessed on 25 March 2023).
Koklu, M.; Taspinar, Y.S. Determining the Extinguishing Status of Fuel Flames With Sound Wave by Machine Learning Methods. IEEE Access 2021, 9, 86207–86216. [Google Scholar] [CrossRef]
Su, C.; Wang, J. Research on composition analysis and type identification of ancient glass products based on data mining. Autom. Mach. Learn. 2022, 3, 63–71. [Google Scholar] [CrossRef]
Sun, C.; Li, Z. Analysis and Identification of the Composition of Ancient Glass Objects based on Statistical Research and Machine Learning Algorithms. Highlights Sci. Eng. Technol. 2023, 39, 1412–1418. [Google Scholar] [CrossRef]
Pu, Q.; Jiang, L.; Liu, Z.; Wang, X.; Liu, Z. Research on Classification of Ancient Glass Products Based on Machine Learning. In Proceedings of the 2022 International Conference on Information Technology, Communication Ecosystem and Management (ITCEM), Bangkok, Thailand, 19–21 December 2022. [Google Scholar]
Bai, D. Comparative study on chemical composition of ancient glass based on machine learning and deep learning. Highlights Sci. Eng. Technol. 2022, 22, 234–240. [Google Scholar] [CrossRef]

Figure 1. Basic structure diagram of LSTM model.

Figure 2. Processing flow chart.

Figure 3. (a) ACF and PACF patterns of SiO₂ content in high potassium; (b) ACF and PACF patterns of SiO₂ content in lead-barium glasses.

Figure 4. Correlation curve of chemical element content before and after weathering: (a) (High potassium) SiO₂; (b) (lead barium) SiO₂; (c) (High potassium) K₂O; (d) (lead barium) P₂O₅; (e) (lead barium) PbO.

Figure 5. Fitting curve of multiple linear regression equation.

Figure 6. Confusion matrix (1: high potassium, 0: lead barium).

Table 1. Return coefficients of chemical components in high potassium and lead-barium glasses β and significance p-value.

Regression Coefficient of High Potassium Glass Types β		Significance p-Value of High Potassium Glass type (P > \|t\|)		Regression Coefficient of Lead-Barium Glass Types β		Significance p-Value of Lead-Barium Glass Type (P > \|t\|)
β₀	15.301	P₀	0.000	β₀	31.077	P₀	0.000
β₁	2.718	P₁	0.099	β₁	7.788	P₁	0.005
β₂	12.318	P₂	0.000	β₂	1.016	P₂	0.313
β₃	5.410	P₃	0.020	β₃	8.320	P₃	0.004
β₄	7.199	P₄	0.007	β₄	0.039	P₄	0.843
β₅	7.051	P₅	0.008	β₅	2.678	P₅	0.102
β₆	4.751	P₆	0.029	β₆	0.222	P₆	0.637
β₇	1.072	P₇	0.300	β₇	1.293	P₇	0.255
β₈	2.629	P₈	0.101	β₈	25.165	P₈	0.000
β₉	1.792	P₉	0.181	β₉	0.831	P₉	0.362
β₁₀	2.629	P₁₀	0.105	β₁₀	13.764	P₁₀	0.000
β₁₁	3.142	P₁₁	0.076	β₁₁	3.702	P₁₁	0.054
β₁₂	0.451	P₁₂	0.502	β₁₂	0.161	P₁₂	0.688
β₁₃	1.490	P₁₃	0.222	β₁₃	0.942	P₁₃	0.332

Table 2. Outlier of SiO₂ component content.

Relic Number	Outlier Type		Estimates	S.E.	t	Significance
High potassium 02	Additive		23.850	3.478	6.858	0.000
High potassium 13	Transient	Magnitude	16.260	3.478	4.675	0.001
High potassium 13	Transient	Decay factor	0.829	0.266	3.114	0.011
Lead barium 05	Transient	Magnitude	31.926	4.569	6.988	0.000
Lead barium 05	Transient	Decay factor	0.964	0.013	76.495	0.000

Table 3. Parameters of the ARIMA-LSTM model for SiO₂ component content in two types of glass.

	Stationary R²	R²	RMSE	MAPE	MaxAPE	MAE	MaxAE	Normalized BIC
Type	Stationary R²	R²	RMSE	MAPE	MaxAPE	MAE	MaxAE	Normalized BIC
High potassium glass	0.960	0.960	1.330	1.411	8.842	2.177	6.130	3.160
Lead-barium glass	0.934	0.934	1.523	2.105	10.624	4.001	11.250	4.160

Table 4. Chemical Composition of Unclassified Cultural Relics.

Relic Number	A1	A2	A3	A4	A5	A6	A7	A8
Surface Weathering	No	Yes	No	No	Yes	Yes	Yes	No
SiO₂	78.45	37.75	31.95	35.47	64.29	93.17	90.83	51.12
Na₂O	0.00	0.00	0.00	0.00	1.2	0.00	0.00	0.00
K₂O	0.00	0.00	1.36	0.79	0.37	1.35	0.98	0.23
CaO	6.08	7.63	7.19	2.89	1.64	0.64	1.12	0.89
MgO	1.86	0.00	0.81	1.05	2.34	0.21	0.00	0.00
Al₂O₃	7.23	2.33	2.93	7.07	12.75	1.52	5.06	2.12
Fe₂O₃	2.15	0.00	7.06	6.45	0.81	0.27	0.24	0.00
CuO	2.11	0.00	0.21	0.96	0.94	1.73	1.17	9.01
PbO	0.00	34.3	39.58	24.28	12.23	0.00	0.00	21.24
BaO	0.00	0.00	4.69	8.31	2.16	0.00	0.00	11.34
P₂O₅	1.06	14.27	2.68	8.45	0.19	0.21	0.13	1.46
SrO	0.03	0.00	0.52	0.28	0.21	0.00	0.00	0.31
SnO₂	0.00	0.00	0.00	0.00	0.49	0.00	0.00	0.00
SO₂	0.51	0.00	0.00	0.00	0.00	0.00	0.11	2.26

Table 5. Indicators for joint significance testing of F-statistics.

F(10,55)	51.41
Prob > F	0.0000
R-squared	0.9633
Adj R-squared	0.9558

Table 6. Regression coefficient β and significance p-value.

Type	SiO₂	K₂O	CaO	MgO	Al₂O₃	Fe₂O₃
Coef.	−0.014	0.048	−0.001	0.004	−0.299	0.035
P > \|t\|	0.749	0.000	0.965	0.919	0.001	0.105
Type	PbO	BaO	P₂O₅	SUW	y_i	_cons
Coef.	−0.166	−0.161	−0.018	0.352	0.000	0.746
P > \|t\|	0.001	0.021	0.078	0.000	0.000	0.082

Table 7. Results of the BP test and White test.

BP test	Prob > chi2	0.1725
White test	Prob > chi2	0.4095

Table 8. Results of the variance inflation factor analysis.

Variable	VIF
SiO₂	27.28
PbO	21.06
BaO	6.36
K₂O	5.11
P₂O₅	2.75
CaO	2.68
MgO	1.83
Al₂O₃	1.82
Fe₂O₃	1.63
Mean Value	7.84

Table 9. Identification results.

Relic Number	A1	A2	A3	A4	A5	A6	A7	A8
Identification type	High potassium	Lead barium	Lead barium	Lead barium	Lead barium	High potassium	High potassium	Lead barium

Table 10. Selected parameters of each algorithm.

Algorithms	Parameters
DT	Node splitting evaluation criteria = gini
	Feature division point selection criteria = random
	Minimum samples for internal node splitting = 2
	Minimum samples in leaf nodes = 1
	Maximum leaf nodes = 2
	Maximum depth of the tree = 15
RF	Node split evaluation criterion = gini
	Number of decision trees = 5
	Minimum samples in leaf nodes = 1
	Maximum depth of the tree = 15
	Maximum leaf nodes = 2
CART-RF	Node split evaluation criterion = gini
	Number of decision trees = 6
	Minimum samples in leaf nodes = 3
	Maximum depth of the tree = 15
	Maximum leaf nodes = 2
SVM	kernel = ‘rbf’
	C = 20
	γ = 2.00
JMLA	Qualitative variable—weathered: 1
	Qualitative variable—unweathered: 0
	The number of autoregressive terms: 2
	The number of sliding average terms: 0
	The number of differences needed to make it a smooth series: 1

Table 11. Results of model experiments.

Algorithms	Train Acc	Test Acc	Precision	Recall	F₁ Score
DT	0.862	0.873	0.806	0.791	0.798
RF	0.958	0.924	0.872	0.866	0.869
CART-RF	0.962	0.951	0.929	0.941	0.935
SVM	0.850	0.909	0.869	0.830	0.849
JMLA	0.979	0.976	0.975	0.976	0.975

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Z.-X.; Lu, P.-S.; Wang, G.-Y.; Li, J.-H.; Yang, Z.-H.; Ma, Y.-P.; Wang, H.-H. Analysis of the Composition of Ancient Glass and Its Identification Based on the Daen-LR, ARIMA-LSTM and MLR Combined Process. Appl. Sci. 2023, 13, 6639. https://doi.org/10.3390/app13116639

AMA Style

Li Z-X, Lu P-S, Wang G-Y, Li J-H, Yang Z-H, Ma Y-P, Wang H-H. Analysis of the Composition of Ancient Glass and Its Identification Based on the Daen-LR, ARIMA-LSTM and MLR Combined Process. Applied Sciences. 2023; 13(11):6639. https://doi.org/10.3390/app13116639

Chicago/Turabian Style

Li, Zhi-Xing, Peng-Sen Lu, Guang-Yan Wang, Jia-Hui Li, Zhen-Hao Yang, Yun-Peng Ma, and Hong-Hai Wang. 2023. "Analysis of the Composition of Ancient Glass and Its Identification Based on the Daen-LR, ARIMA-LSTM and MLR Combined Process" Applied Sciences 13, no. 11: 6639. https://doi.org/10.3390/app13116639

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analysis of the Composition of Ancient Glass and Its Identification Based on the Daen-LR, ARIMA-LSTM and MLR Combined Process

Abstract

1. Introduction

2. Theory and Method

2.1. An Improved Logistic Regression Model with Double Adaptive Elastic Net

2.2. Time Series Forecasting Model Based on ARIMA-LSTM

2.2.1. ARIMA(p,d,q) Model

2.2.2. LSTM Model

2.2.3. ARIMA-LSTM Model

2.3. Multiple Linear Regression Model

3. Material and Experiment

3.1. Data Pre-Processing

3.2. Experimental Procedure

4. Process and Result

4.1. Relationship between Glass Weathering and Its Chemical Composition Based on an Improved Logistic Regression Model with Double Adaptive Elastic Net

4.2. Prediction of the Chemical Content of Glass before Weathering Based on the ARIMA-LSTM Model

4.3. Identifying Unknown Artifact Types Based on Multiple Linear Regression Model

4.3.1. Testing for the Presence of Heteroskedasticity in the Perturbation Term

4.3.2. Testing for Multicollinearity

4.3.3. Testing the Fit of the Experimental and Actual Values of the Model and Identifying the Unknown Artifact Types

4.4. Comparision of Different Models

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI