Novel algorithm for identifying kinematics characteristic data under road conditions in Xi’an

Wensheng Wang; Xindong Hu; Shendun Zhao; Liying Jin; Yangfeng Cao; Yang Xie

doi:10.1117/12.2661176

2 February 2023 Novel algorithm for identifying kinematics characteristic data under road conditions in Xi’an

Wensheng Wang, Xindong Hu, Shendun Zhao, Liying Jin, Yangfeng Cao, Yang Xie

Author Affiliations +

Proceedings Volume 12462, Third International Symposium on Computer Engineering and Intelligent Communications (ISCEIC 2022); 124620V (2023) https://doi.org/10.1117/12.2661176
Event: International Symposium on Computer Engineering and Intelligent Communications (ISCEIC 2022), 2022, Xi'an, China

Abstract

Aiming at the problem that the kinematics feature dataset of different traffic flow, average speed, acceleration, and traffic time are affected by high-dimensional, irrelevant, and redundant factors, and the kinematics feature dataset is a multi-objective, multi-constrained and complex nonlinear optimization system, the improved Multi-Objective evolutionary Soft Subspace Clustering algorithm (iMOSSC) is proposed to mine the micro-stroke segments with different kinematic characteristics and realize data classification. The algorithm uses iNSGA-II as the base algorithm and performs local search operator and repair operator operation in the feature space to accelerate convergence and improve the accuracy of the solution. The feasibility and effectiveness of the algorithm are verified by 12 sets of UCI standard dataset. The classified kinematics characteristic data is used to construct the Xi'an urban road trajectory database. Compared with the iMWK-HD algorithm in the collected kinematics feature data of circulation condition, the feature importance degree of the iMOSSC algorithm is more reasonable, the stability is better, the accuracy is higher, and the classification effectiveness is more obvious than the iMWK-HD algorithm. The excavated kinematics data is imported into the Optimumlap simulation software to construct the actual road circulation condition trajectory database. Based on the ADVISOR commercial software platform for the simulation module.

1. INTRODUCTION

The kinematic features dataset of the dual-motor & dual-mode power-split hybrid electric vehicle is a multi-objective, multi-constraint, and complex nonlinear optimization system. Hence, each sample in the kinematic features dataset is high-dimensional, irrelevant, and redundant in the feature space. The direct use of these kinematic features data to simulate the dynamic performance of the target vehicle will result in large randomness, which cannot truly reflect the dynamic performance of the target vehicle. Therefore, it is necessary to use the dimensionality reduction technology to process the circulation condition characteristic data to eliminate irrelevant and redundant features, and to mine the micro-stroke segments with different kinematic features to realize the classification of data.

As some kinematic features of driving conditions can better reflect a certain type of working conditions but not sensitive to another type of working conditions, this characteristic is called subspace. If the subspace is not considered, the kinematic information of different working conditions is classified under the same feature subset, which will affect the dimensionality reduction and also affect the accuracy of the classification. Even if certain features are effective for the recognition of multiple working conditions, there will be cases where each feature is sensitive to different working conditions. Therefore, a more appropriate weighting method is used to express the relationship between the features and various working conditions. The subspace under the weight representation is also called the soft subspace [1].

The soft subspace clustering technology can realize pattern recognition of high-dimensional data. The kinematic features data is different under the driving conditions, the characteristics of different driving conditions, and the distribution of each driving condition. In the high-dimensional data space, the irrelevant, redundant, and noise features make the distribution characteristics of each driving condition covered up. In addition to different feature values of each driving condition, different driving conditions also have a different number of features. Therefore, each driving condition is generally in its own unique subspace. The soft subspace clustering algorithm can measure each feature in the high-dimensional space according to the importance degree of each driving condition, i.e. the weight value significantly removes the irrelevant and redundant features, restore the kinematic features distribution law, and effectively improve the distance metrics affected by the sparsity of samples in high-dimensional space.

The soft subspace clustering technology can not only effectively deal with the dimensionality reduction of high-dimensional data, but also explore the subspace corresponding to each working condition and give sensitive features of different working conditions. However, the existing soft subspace clustering technique has the disadvantages of being easy to fall into local extremum and slow convergence, while the evolutionary soft subspace clustering algorithm has a good global optimization ability and does not require the differentiability and continuity of the objective function, and therefore, it is used by many scholars for data clustering [2-4]. In the fields of artificial intelligence, pattern recognition, and statistical analysis, research scholars attach great importance to and extensively study cluster analysis [5-7].

In practical engineering application, the kinematic features data face multiple problems: Firstly, the data is affected by irrelevant and redundant features, the sample distribution is not regular enough, and it is difficult to divide it reasonably, thereby affecting the recognition rate; Secondly, the time cost caused by directly processing high-dimensional data is too high, which affects the operational efficiency of the algorithm; Thirdly, the external environment is changeable, and the test signal is disturbed, which increases the workload of data collection. Therefore, the single-objective evolutionary soft subspace clustering algorithm is not suitable for data recognition in complex situations. The multi-objective evolutionary soft subspace clustering algorithm can effectively solve this problem [8,9], because it optimizes multiple objective functions at the same time, taking into account the optimization direction of each objective function, and evaluating the feature subset and clustering quality from different angles. A multi-objective optimal solution is a set of solutions that are weighed against each other, which is also called a Pareto optimal solution set [10,11], such as improved multi-objective particle swarm algorithm [12], multi-objective evolutionary soft subspace clustering algorithm [13], intelligent Minkowski metric feature weights soft subspace clustering algorithm through hybrid dissimilarity measure (iMWK-HD) [14], improved multi-objective sine and cosine optimization algorithm [15], adaptive soft subspace clustering combining within-cluster and between-cluster information [16], and so on.

However, these algorithms are limited by design mechanisms, resulting in poor performance when processing large-scale high-dimensional data. In view of this, the improved multi-objective evolutionary soft subspace clustering algorithm (iMOSSC) was proposed. This algorithm inherits the excellent performance of the original iMWK-HD algorithm [14], adds an objective function, can accurately mine the micro-stroke segments with different kinematic features, and realize data classification. During the algorithm execution, real number coding is used. A local search of motion information is performed and operator operations are repaired in the feature space to accelerate convergence and improve the accuracy of the solution. The iNSGA-II as the base algorithm [17], the redundancy is used as the target during mutation, and features having a high correlation with selected features are excluded.

2. RELATED WORKS

Tsinghua University, Jilin University, Shanghai Jiao Tong University, Harbin Institute of Technology, and other scientific research institutions [18-21], as well as China’s FAW Group Corporation, Dong Feng Motor Corporation, Chery Automobile Co., Ltd., and Changan Automobile Co., Ltd. [22], are conducting research on hybrid vehicles, and have achieved fruitful results in their research, besides, some related results have also been applied to urban buses, passenger cars, and other vehicles. The research and development of hybrid power systems provide a new platform for the comprehensive development of automotive technology, which is of great significance to promote electric drive in China’s automotive industry [23-26].

Hybrid Electric Vehicle (HEV) provides vehicle power requirements by distributing engine or motor power or torque, which can ensure fuel economy without considering the problem of “mileage anxiety” [27,28]. A hybrid electric vehicle has two power sources, an internal combustion engine and an electric motor. It is an energy management strategy to improve the fuel economy of the hybrid system to achieve energy conservation and emission reduction [29, 30]. The hybrid type adopts the planetary row structure to realize the output power coupling of the engine and the two motors, to achieve the complete decoupling of the torque-speed and the wheels, promote the engine to operate with high efficiency, and effectively simplify the power system structure [31, 32].

Reference [17] analyzed the structural parameter optimization of key components of the novel dual-mode&dual-motor hybrid drive system and proposed an improved NSGA-II algorithm (iNSGA-II). The combination of ADVISOR and Simulink software was used to simulate the performance of the novel system. The simulation results showed that the battery is basically maintained in the expected working range, which meets the vehicle’s dynamic performance requirements. The novel dual-mode&dual-motor hybrid drive system rod model is shown in Fig. 1(a), and the cross-sectional view is shown in Fig. 1(b).

Figure 1.

Schematic diagram of the novel dual-mode&dual-motor hybrid drive system

For the construction of the Xi’an urban road trajectory database, the kinematic information of the automobile roads in the urban area of Xi’an city was collected by GPS and CAN-Analyzer equipment recorder. As the collected kinematic information such as different traffic flow, average speed, acceleration, and traffic time contain high-dimensional, irrelevant, and redundant data, the resulting kinematic features data of driving conditions is multi-variable, multi-constraint, complex, and nonlinear. Using this kinematic data directly to simulate the vehicle’s dynamic performance of the target model will cause it to be very random, and thus cannot truly reflect the dynamic vehicle performance of the target model. In practical engineering applications, decision-makers must simultaneously consider the low cost, high performance, and high efficiency of components. However, there is usually a conflict between low cost, high performance, and high efficiency. Promoting one of them may cause a reduction or degradation of the other two. Problems like this are called multi-objective optimization problems [33]. The multi-objective evolutionary soft subspace clustering algorithm is an integration of multi-objective optimization problems and a soft subspace clustering algorithm. Based on the iMWK-HD algorithm [14], to solve the dimensionality reduction problem of high-dimensional data of cyclic kinematic features, an objective function was added to construct a multi-objective evolutionary soft subspace clustering algorithm (iMOSSC) to solve the kinematic features data of road driving conditions in urban areas of Xi’an city in Shaanxi Province, and the corresponding software module which can effectively realize the dimensionality reduction of the kinematic information and significantly remove redundant data in the kinematic information was developed.

3. IMOSSC ALGORITHM

The intelligent identification of the cycle conditions of the novel dual-mode&dual-motor hybrid drive system is built on data mining based on multi-objective evolutionary soft subspace clustering model analysis. It aims to find potentially searchable rules in the disordered sample data and extract different features.

3.1

Multi-objective mathematical model

where C ---- Number of clusters; N ---- Total number of samples; M ---- Total number of features; U = [u_ik] ---- The degree of membership of the k-th class on the i-th sample; V = [v_kj] ---- The center value of the k-th class on the j-th dimension; W =[w_kj] ---- Feature weight value of the k-th class on the j-th dimension feature; x_ij ---- The value of the i-th sample on the j-th feature; k_σ = exp(- (v_kj - v_oj)² / σ²) ---- Gaussian kernel function, which changes the original measurement method to a certain extent; σ ---- Parameter, the value of this article is 2, (σ = [2, 5, 10]) [34]; γ ---- Information entropy coefficient, which is used to coordinate the influence of entropy on the clustering results, the value range is (0,1); η ---- Reciprocal of sample data variance, such as ; v_oj ---- The average value of all sample points in the class center of the j-th feature dataset.

In Matlab software, the real number coding method is used to mix the class centers and feature weights. For example, a dataset with a sample number of N and a dimension of M is divided into C classes, and its coding method is {v₁₁, …, v_1M, v₂₁, …, v_2M, …, v_C1, …, v_CM, w₁₁, …, w_1M, w₂₁, …, w_2M, …, w_C1, …, w_CM}, with a length equal to 2 × C × M. The initial value of each gene locus of the class center in the coding is randomly selected in the dataset, and the initial value of each gene locus of feature weight is equal to the reciprocal of the dimension. Because the dataset has been normalized before running, their the value range is [0, 1].

3.2

Search operator and repair operator

According to the iterative idea of the clustering algorithm, a local search operator is used to improve the class center and feature weight accuracy. Their calculation formulas are shown in formula (2) and formula (3).

where D_kj = (x_ij - v_kj)^β − η × [(v_kj × x_ij) / (||v_k|| × ||x_i||) − (1 / M)], When u_ik = 1, it means that the i-th sample x_i is assigned to the k-th category v_k. When the value of Σ_p=1^C |v_pj - v_kj| is larger, it means that the k-th category and the remaining categories are easier to distinguish in the j-th dimensional feature, and then brought into equation (3), the feature weight value will also increase, and vice versa.

In the clustering process, there are two kinds of random clustering division. The first type of division is that the number of classes represented in the encoding is greater than the actual number of classes; the second type of division is that the samples with different encodings refer to the same division after decoding. Therefore, the results of these two situations need to be repaired. The operation flow of the first repair operator is: If there is no sample in one or more classes after each class is divided, then randomly select a sample from the class with the largest number of samples as the class center of the first empty class, then, under the constraint condition that the feature weights satisfy Σ_j=1^M w_kj = 1, the feature weights of this class are randomly generated within the range of [0, 1]. Then randomly select a sample different from the selected sample as the class center of the second empty class. The feature weight value is also randomly generated in the range of [0, 1]; Repeat the operation by analogy until there is no empty class repair. In the case of repair operators, each class contains at least one sample. The second repair operator is solved based on the crowding distance calculation formula in environment selection. In order to maintain the diversity of the population, priority is given to giving up samples with small crowding distances.

3.3

iMOSSC algorithm flow

Step 1: Normalize all sample values in the dataset [35], and set the required parameter values. The initial value of evolution algebra ite is set to 1, and the maximum algebra MaxGen is 500;
Step 2: Initialize class center V₀ and feature weight W₀. V₀ is randomly selected from the normalized sample, and W₀ is the reciprocal of the individual’s total dimension. Then bring V₀ and W₀ into equation (1) to calculate the objective function value;
Step 3: Select the parent individual from the parent population P according to the binary league rule, and perform the crossover operation with probability P_c to obtain the child population Q₁. The comparison principle is: Pareto dominates the individual first, and the two do not dominate each other individuals with a large congestion distance are preferred, and they are randomly selected when the two have the same congestion distance;
Step 4: Select progeny individuals from progeny population Q₁ with probability P_s, and perform mutation operation with probability P_m to obtain progeny population Q₂;
Step 5: According to the formulas of (2) and (3), perform local search and repair operations on all descendants of Q₂ in turn to obtain the Q₃ population;
Step 6: Calculate the objective function values of all individuals in Q₃, and merge the parent populations P and Q₃. Then, according to the non-inferior sorting method in NSGA-II, the environment selection operation is performed, and the highest-level |P| individuals are selected to form a new parent population P;
Step 7: If the maximum evolution algebra MaxGen is reached, the evolution will stop, otherwise ite = ite + 1, and proceed to step 3 to continue execution.

4. IMOSSC ALGORITHM PERFORMANCE TEST AND ANALYSIS

In this paper, 12 sets of UCI standard dataset [36] are used to test the performance of the iMOSSC algorithm, and the Entropy Weighting Fuzzy Clustering in Composite Kernel Space (CKS-EWFC) [37], entropy weight Soft subspace clustering algorithm (Entropy Weighting K-Means algorithm, EWKM) [38] and fuzzy subspace clustering algorithm (Fuzzy Weighted Soft Subspace Clustering Algorithm, FSC) [39, 40] Three excellent soft subspace clustering algorithms Compared. Table 1 lists the relevant information of the UCI standard dataset. The dataset is represented by the data matrix of “number of samples × features”.

Table 1.

UCI dataset related information

Name	Number of samples	Feature dimension	Class number
Australian	690	15	2
Breast tissue	106	9	6
Bupa	345	6	2
Pima Indians diabetes	768	8	2
Vehicle	846	18	4
Wdbc	569	30	2
Letter-ABCD	3096	16	4
Letter-UVWX	3116	16	4
Heart	270	13	2
Iris	150	4	3
Parkinsons	195	23	2
Wine	178	13	3

Each of the four algorithms is executed 10 times using different initial parameters. Each algorithm runs independently 10 times under different parameter combinations, and then uses the average of 10 times RI index [41] and normalized mutual information index (NMI) [42] to evaluate the quality of the clustering results, The value range of the two index is [0,1], and the value is proportional to the cluster quality. The calculation formulas of NMI and RI are equations (4) and (5), respectively. Table 2 shows the values of various parameters of the algorithm during the execution of the experiment.

Table 2

Parameter setting of each algorithm in the experiment

Algorithm	Parameter setting
iMOSSC	γ = [0, 1], σ = [2, 5, 10] m = [1.05, 1.2];
CKS-EWFC [37]	η = [1, 5, 10, 100, 1000], γ = [1, 5, 10, 100, 1000, 10000]
EWKM [38]	γ = [1e-3, 1e-2, 1e-1, 1e0, 1e1, 1e2, 1e3, 1e4, 1e5, 1e6]
FSC [39]	β = [1.05∼4] γ = [0.0001, 0.001, 0.01, 0.1]

Where C ---- The number of sample classifications; N ---- The number of samples; d_ik ---- The number of samples belonging to the i-th category, and also the number of samples belonging to the k-th category; d_i ---- The number of samples in the i-th category only; d_k ---- The number of samples in class k only.

Where d₀₀ ---- After the algorithm is run, the samples are divided into different classes, and the real class markers are also divided into the number of sample pairs of different classes; d₁₁ ---- After the algorithm is run, the samples are divided into the same class, and the real class labels are also divided into the logarithm of the samples of the same class; N ---- the total number of samples.

4.1

UCI dataset test results and analysis

The best results achieved by the four algorithms of iMOSSC, CKS-EWFC, EWKM, and FSC are shown in Table 3, and the bold results indicate the best results of the four algorithms.

Table 3

The best results of the four algorithms on 12 UCI dataset

Dataset	iMOSSC	CKS-EWFC[37]	EWKM[38]	FSC[39]
RI	NMI	RI	NMI	RI	NMI	RI	NMI
Australian	m = 1, η = 3	m = 1.2,η = 1, γ = 10	γ = 50	β = 1.05, γ = 5
Mean	0.7563	0.4394	0.7204	0.3686	0.7122	0.3621	0.7518	0.4279
Variance	0	0	0.0888	0.1345	0	0	0	0
Wine	m = 1, η = 4		γ = 2	β = 2.5, γ = 10-5
Mean	0.9400	0.8648	0.8964	0.8464	0.9331	0.8336	0.7241	0.3621
Variance	0	0	0.0649	0	0	0	0	0
Heart	m = 1, η = 0		γ = 100	β = 1.05, γ = 0
Mean	0.6941	0.3061	0.6816	0.3062	0.6788	0.2795	0.6788	0.2795
Variance	0.0001	0.2100	0.0088	0	0	0	0	0
Vehicle	m = 1, η = 8		γ = 100	β = 1.5, γ= 10-3
Mean	0.6840	0.200	0.6641	0.1848	0.6378	0.1354	0.6505	0.0943
Variance	0.0001	0	0.0019	0.0097	0.0100	0.0233	0	0
Breast tissue	m = 1, η = 7		γ = 10	β = 2.3, γ = 10-3
Mean	0.8001	0.5013	0.7457	0.4856	0.7242	0.3189	0.7353	0.3179
Variance	0.0120	0.0021	0.0258	0.0218	0.0240	0.0173	0.0235	0.0118
Bupa	m = 1, η = 1		γ = 1000	β = 1.05, γ = 0.1
Mean	0.5512	0.0381	0.5047	0.0196	0.5107	0.0105	0.5052	0.0102
Variance	0.1001	0	0.0008	0	0.0004	0	0.0017	0.0011
Iris	m = 1, η = 4		γ = 2	β = 4, γ = 10-2
Mean	0.9514	0.8525	0.8737	0.8513	0.8667	0.7416	0.9381	0.8525
Variance	0.0101	0	0	0	0.0112	0.0291	0.0241	0.0246
Parkinsons	m = 1, η = 2		γ = 1	β = 2.4, γ = 0.1
Mean	0.9688	0.9859	0.9799	1	0.6606	0.3206	0.6280	0.3059
Variance	0	0	0.0636	0	0.1193	0.0142	0.0022	0
Pima Indians diabetes	m = 1, η = 7		γ = 50	β = 2, γ= 10-2
Mean	0.6120	0.1251	0.5574	0.1306	0.5507	0.0297	0.5390	0.0204
Variance	0	0	0.0050	0.0021	0	0	0	0
Wdbc	m = 1, η = 8		γ = 1000	β = 3.1, γ = 10-3
Mean	0.9121	0.6840	0.8605	0.6833	0.8365	0.5944	0.7515	0.3932
Variance	0	0	0.0031	0	0	0.0031	0.1170	0.2006
Letter-ABCD	m = 1, η = 4		γ = 1000	β = 1.05, γ = 0.1
Mean	0.8001	0.4703	0.7623	0.4656	0.7707	0.4480	0.7287	0.3897
Variance	0.0100	0.0214	0.0188	0.1638	0.0374	0.0785	0.0187	0.0830
Letter-UVWX	m = 1, η = 5		γ = 1000	β = 2.1, γ = 10-3
Mean	0.3104	0.3220	0.2613	0.3111	0.2836	0.2836	0.2567	0.2567
Variance	0	0	0.0282	0.0471	0.0318	0.0318	0.0833	0.0833

It can be seen from Table 3 that most of the solutions obtained by the iMOSSC algorithm proposed in this study are better than or close to the best results of the three traditional soft subspace clustering algorithms of CKS-EWFC, EWKM, and FSC. The iMOSSC algorithm has a more stable performance than the CKS-EWFC, EWKM, and FSC on various datasets. However, on the Heart and Pima Indians diabetes dataset, the iMOSSC algorithm has the best RI evaluation index, and the NMI evaluation index is almost the same as the CKS-EWFC algorithm, which is superior to the EWKM algorithm and FSC algorithm. On the Parkinsons dataset, the RI evaluation index and NMI evaluation index of the iMOSSC algorithm are slightly lower than the CKS-EWFC algorithm but higher than the EWKM algorithm and FSC algorithm. This shows that the iMOSSC algorithm has strong robustness, and also shows that no algorithm has better clustering results for all the datasets than other algorithms of the same type. In addition, when the RI evaluation index achieves the best result in a set of data, the NMI evaluation index may not necessarily achieve the optimal result in the same set of data. In the same set of data, these two evaluation indicators cannot achieve the optimal clustering result at the same time. Therefore, it is necessary to evaluate the performance of the same clustering algorithm under different evaluation indexes. Finally, the performance of the clustering algorithm was comprehensively evaluated. As shown in the variance in Table 3, the iMOSSC algorithm shows better robustness on each dataset than the other three algorithms.

4.2

Cluster Analysis of Kinematics Data of Driving Conditions in Xi’an City

While performing kinematic features processing on the collected data, the data can be divided into segments of driving conditions that are independent of each other, which are called kinematic micro-travel segments. The kinematic micro-travel segments describe the vehicle’s driving process, starting from the idle speed, after accelerating, cruising, and decelerating to a complete single-stroke kinematic process. According to the kinematic features and parameter indexes of driving conditions, and based on the analysis of the influence of kinematic features on fuel economy and power of hybrid vehicles, the kinematic features parameters of driving conditions used in this study are shown in Table 4.

Table 4

Kinematic parameters of driving conditions

Kinematic characteristic parameters	Symbol	Unit
Total distance traveled	L	km
Total travel time	T	s
Maximum speed	Vmax	km/h
Maximum/Min acceleration	a	m/s2
Maximum/Min deceleration	d	m/s2

Accuracy and effectiveness are the multi-objective optimization soft subspace clustering algorithms for hybrid electric passenger vehicle driving conditions, which can well identify the purpose and requirements of the kinematic features of the urban area in Xi’an. The importance of kinematic features of the multi-objective optimization soft subspace clustering algorithm in the high-dimensional space in each driving condition is measured by the weight value. This method can significantly remove irrelevant and redundant features, restore the distribution of kinematic features, and effectively reduce the effect of the sample’s sparseness in a high-dimensional space on a distance metric. It can be used for the construction of the city circular track database in Xi’an. However, in the test under a complex environment, the kinematic parameters are easily interfered with by the outside world, and the dimensions of these parameters are different. Therefore, the original test data collected usually contains noise, and its accuracy and effectiveness will be relatively poor. It needs to be standardized before analyzing the data. Mapped to the range of [0, 1] and converted into the corresponding dimensionless form, the formula for non-quantization is:

where i ---- The i-th sample data; j ---- The j-th dimension feature; data(:, j)_min ---- The minimum value of all samples on the j-th dimension feature; data(:, j)_max ---- The maximum value of all samples on the j-th dimension feature.

The iMOSSC algorithm was used to perform cluster analysis on the kinematic features (12000 samples, 4000 samples of smooth, congested, and slow movement) in driving conditions, and compared with the single-objective iMWK-HD algorithm [14]. The feature importance ranking method is to count the number of occurrences of each feature in the Pareto solution after running on the single-objective iMWK-HD and multi-objective iMOSSC algorithms. The importance of the features is positively correlated with the number of occurrences. The serial numbers corresponding to the important features in the kinematic features of driving conditions are indicated in bold black font, as shown in Table 5. The clustering performance of the two algorithms was evaluated, and the optimal results are indicated in bold, as shown in Table 6. The feature subsets of the two algorithms are shown in Fig. 2.

Fig. 2

Comparison between the two algorithms for distinguishing kinematics data

Table 5

The two algorithms are important to the kinematics characteristics of driving conditions in Xi’an

Dataset	iMWK-HD algorithm	iMOSSC algorithm
Kinematics data of driving conditions in Xi’an	4, 1, 3, 2	1, 4, 3, 2

Table 6

Clustering results of the two algorithms on kinematic feature data

Algorithm	ACC (%)	RI (%)	Time (s)
iMWK-HD	95.8140	93.4139	2.1896
iMOSSC	98.0122	95.9530	2.2406

It can be seen from Table 5 that both algorithms can correctly select features. It can be seen from Table 6 that the multi-objective iMOSSC algorithm proposed in this study is superior to the single-objective iMWK-HD algorithm in the ACC evaluation index and RI evaluation index in the 12000 driving conditions kinematic dataset. Compared with the single-objective iMWK-HD algorithm, the multi-objective iMOSSC algorithm’s ACC evaluation index results increased by 2.1982%, and the evaluation index RI value increased by 2.5391%. However, the running time of the multi-objective iMOSSC algorithm only increased by 0.051 seconds.

It can be seen from Fig. 2 that the feature subset corresponding to the iMOSSC algorithm can basically separate the overlapped kinematic features data, so as to better distinguish the data collected in congestion, slow movement, and unobstructed driving conditions, respectively. This means that the introduction of the objective function between classes is very important for the ranking of feature importance, which can lengthen the distance between feature subsets. Therefore, the robustness of the iMOSSC algorithm and the quality of the clustering results are superior to the single-objective iMWK-HD algorithm.

5. ANALYSIS OF VEHICLE FUEL PERFORMANCE AND POWER PERFORMANCE UNDER ROAD CONDITIONS IN XI’AN

The excavated kinematic features data was imported into Optimumlap simulation software to construct a trajectory database of vehicles operating in the cycling conditions in Xi’an’s urban area. Based on the ADVISOR commercial software platform and the above-mentioned Xi’an urban road trajectory database, a co-simulation model including the novel dual-mode&dual-motor hybrid drive system, a gasoline-powered vehicle of the same specifications and size, and the Japanese Toyota Prius 2012 hybrid car on the urban roads of Xi’an was established. The results are shown in Table 7.

Table 7

Comparison of different driving methods under roads in Xi’an urban area

	Performance	Road	Gasoline vehicle	Prius	dual-mode&dual-motor hybrid system
1.8-liter gasoline engine	Fuel consumption (100km)	Urban	8.82-liter	4.06-liter	3.64-liter
High	7.628-liter	4.271-liter	3.50-liter
Comprehensive	7.56-liter	4.102-liter	3.57-liter
Dynamic performance	Acceleration (s)	10.3	10.15	7.55
Climbing (%)	63.5	64.5	73.7

It can be seen from Table 7 that under the urban roads of Xi’an, the new hybrid electric vehicle system has more advantages than the traditional 1.8-liter standard gasoline vehicle and the Prius 2012 hybrid electric vehicle produced by Toyota Japan. With good fuel economy and power, fuel consumption per 100 kilometers is reduced by 52.8% and 13.0%, acceleration time per 100 kilometers is reduced by 26.7% and 25.6%, and the maximum gradeability is increased by 13.8% and 12.5%, respectively.

6. CONCLUSIONS

Considering that the collected kinematic data is subject to multi-objective, multi-constraint, and complex nonlinear characteristics due to the changeable external environment and test signal interference, and based on the iMWK-HD algorithm to solve the problem of dimensionality reduction of high-dimensional data in the kinematic information under cyclic conditions, a new multi-objective evolution soft subspace clustering algorithm(iMOSSC) was proposed to dig out micro-stroke segments of driving conditions with different kinematics, achieving data clustering and classification. The algorithm used iNSGA-II as the base algorithm and performed local search operator and repair operator operation in the feature space to accelerate convergence and improve the accuracy of the solution. The feasibility and effectiveness of the algorithm were verified by 12 sets of UCI standard datasets.

The classified kinematic features data was used to construct the Xi’an urban road trajectory database. Based on the ADVISOR commercial software platform and the Xi’an urban road trajectory database, a co-simulation model, including the novel dual-mode&dual-motor hybrid drive system, a gasoline-powered vehicle of the same specifications and size, and the Japanese Toyota Prius 2012 hybrid car on the urban roads of Xi’an, was developed. The simulation results showed that, compared with the standard 1.8 liters gasoline-powered vehicle and the hybrid system of the Japanese Toyota Prius 2012, the new hybrid-electric system has better fuel economy, acceleration performance, and gradability when running under the urban road conditions of Xi’an. The fuel consumption per 100 kilometers is reduced by 52.8%、 13.0 %, respectively, the acceleration time per 100 kilometers is reduced by 26.7%、25.6% respectively, and the maximum grade is increased by 13.8%、12.5% respectively..

FUNDING

This work was supported by the Key Research and Development Program of Hubei Province (Grant no. 2021BID001) and the Research on Multiple Regression and Fitting Technology of Simulation Data for Dynamic Umbrella Opening of Lifesaving Umbrella (Grant No. HX2021157).

REFERENCES

[1]

Hu Xia, Research on Processing High-Dimensional Dataset Based on an Intelligent Diagnosis Approach with Multi-objective Evolutionary Optimization and Soft Subspace Clustering [D], Xi’an Jiaotong University, Xi’an (2013). Google Scholar

[2]

Hu Xia, Jian Zhuang, Dehong Yu, “Novel soft subspace clustering with multi-objective evolutionary approach for high-dimensional data [J],” Pattern Recognition, 46 2562 –2575 (2013). https://doi.org/10.1016/j.patcog.2013.02.005 Google Scholar

[3]

Weimin Zhong, Dayu Tan, Xin Peng, et al., “Fuzzy high-order hybrid clustering algorithm for swarm intelligence sets [J],” Neurocomputing, 314 347 –359 (2018). https://doi.org/10.1016/j.neucom.2018.03.019 Google Scholar

[4]

Chao Liu, Jing Xie, Qi Zhao, et al., “Novel evolutionary multi-objective soft subspace clustering algorithm for credit risk assessment [J],” Expert Systems With Applications, 138 1 –11 (2019). https://doi.org/10.1016/j.eswa.2019.112827 Google Scholar

[5]

Erzhou Zhu, Yuanxiang Zhang, Peng Wen, et al., “Fast and stable clustering analysis based on Grid-mapping K-means algorithm and new clustering validity index [J],” Neurocomputing, 363 149 –170 (2019). https://doi.org/10.1016/j.neucom.2019.07.048 Google Scholar

[6]

Luque-Casado Antonio, Fogelson Noa, Iglesias-Soler Eliseo, et al., “Exploring the effects of Transcranial Direct Current Stimulation over the prefrontal cortex on working memory: A cluster analysis approach [J],” Behavioural brain research, 375 112144 (2019). https://doi.org/10.1016/j.bbr.2019.112144 Google Scholar

[7]

Yongping Tang; Zizhen Peng, “A Customer Group Mining Method Based on Cluster Analysis [J],” Advanced Multimedia and Ubiquitous Engineering. MUE/FutureTech 2019, 351 –357 (2020). https://doi.org/10.1007/978-981-32-9244-4 Google Scholar

[8]

Deb K., “Multi-objective optimization,” Search methodologies, 403 –449 Springer,2014). https://doi.org/10.1007/978-1-4614-6940-7 Google Scholar

[9]

Guanci Yang, Ansi Zhang, Shaobo Li, et al., “Multi-objective evolutionary algorithm based on decision space partition and its application in hybrid power system optimization [J],” Appl Intell, 46 827 –844 (2017). https://doi.org/10.1007/s10489-016-0864-1 Google Scholar

[10]

Cruz-Ramírez M, Hervás-Martínez C, Sánchez-Monedero J, et al., “Metrics to guide a multi-objective evolutionary algorithmfor ordinal classification [J],” Neurocomputing, 135 21 –31 (2014). https://doi.org/10.1016/j.neucom.2013.05.058 Google Scholar

[11]

Zhaomin Chen, KiatYeo Chai, BuSung Lee, et al., “Evolutionary multi-objective optimization based ensemble autoencoders for image outlier detection [J],” Neurocomputing, 309 192 –200 (2018). https://doi.org/10.1016/j.neucom.2018.05.012 Google Scholar

[12]

Jiaxing You, Jili Chen, Minggang Dong, “An improved multistage multi-objective particle swarm optimization algorithm [J],” Journal of Chinese Computer Systems, 36 (4), 792 –796 (2015). Google Scholar

[13]

Shengdun Zhao, Liying Jin, Yuehui Wang, et al., “Soft Subspace Clustering with a Multi-objective Evolutionary Approach [J],” Association for Computing Machinery, (2018). https://doi.org/10.1145/3271553.3271610 Google Scholar

[14]

Liying Jin, Xiaobin Zhi, Shengdun Zhao, “Enhanced subspace clustering through combining Minkowski distance and Cosine dissimilarity [J],” Journal of Intelligent & Fuzzy Systems, 35 (5), 5541 –5556 (2018). https://doi.org/10.3233/JIFS-18563 Google Scholar

[15]

Wanliang Wang, Weikun Li, Yule Wang, et al., “Improved Multi-objective Sine Cosine Optimization Algorithm [J],” Journal of Chinese Computer Systems, 40 (10), 2102 –2108 (2019). Google Scholar

[16]

Liying Jin, Shengdun Zhao*, Congcong Zhang, “Adaptive Soft Subspace Clustering Combining Within-cluster and Between-cluster Information [J],” Journal of Intelligent & Fuzzy Systems, 38 (3), 3319 –3330 (2020). https://doi.org/10.3233/JIFS-190146 Google Scholar

[17]

Liying Jin, Shengdun Zhao, Wei Du, et al., “Research on Multi-objective Optimization of Switched Flux Motor Based on Improved NSGA-II Algorithm [J],” PART E-JOURNAL OF PROCESS MECHANICAL ENGINEERING, 233 (6), 1268 –1279 (2019). https://doi.org/10.1177/0954408919864185 Google Scholar

[18]

Huiyong Liu, Yeping Xiong and Qing Zhao, “Research Status and Development Trends of Petrol-Electric Hybrid Construction Machinery [J],” MACHINE TOOL & HYDRAULICS, 45 (15), 166 –171 (2017). Google Scholar

[19]

Fei Jiang, Research on Design and Control Strategy of the Hydraulic Hybrid Excavator Slewing [D], Harbin Institute of Technology, Harbin (2015). Google Scholar

[20]

Hong Wang, Research on Dynamic Modeling and Control Strategy for Hybrid Tracked Bulldozer [D], Beijing Institute of Technology, Bei Jing (2015). Google Scholar

[21]

Peng Li, Research on Control Strategy and Simulation of Hybrid in Crawler Bulldozer [D], Chang’an University, Xi’an (2015). Google Scholar

[22]

Miao Xu, Research on parameter design and control strategy of hybrid excavator power assembly [D], Jilin University, Changchun (2017). Google Scholar

[23]

Kanghui LÜ, Ping Liao, Chen Wang, et al., “Research on fuel consumption of vehicle’s power split hybrid powertrain at high-speed operation [J],” Modern Electronies Technique, 42 (9), 7280 –75 (2019). Google Scholar

[24]

Hamut HS, Dincer I, Naterer GF, “Analysis and optimization of hybrid electric vehicle thermal management systems [J],” Journal of power sources, 247 643 –654 (2014). https://doi.org/10.1016/j.jpowsour.2013.08.131 Google Scholar

[25]

Kaur J, Gaur P, Saxena, et al., “Speed control of hybrid electric vehicle using artificial intelligence techniques [J],” International journal of computing and network technology, 2 (1), 33 –39 (2014). https://doi.org/10.12785/IJCNT/020105 Google Scholar

[26]

Bing Han, Single-mode Compound Power Split Hybrid Power System Development and Heat Balance Technology Research [D], Jiangsu University, Zhenjiang (2011). Google Scholar

[27]

Li Feng, Simulate Investigation of the Power Matching for the Parallel Hybrid Electric Vehicle [D], Xi’an University of Technology, Xi’an (2006). Google Scholar

[28]

Weimin Li, Research of Hybrid Electric Vehicle Control System and Energy Management Strategy [D], Shanghai Jiao Tong University, Shanghai (2008). Google Scholar

[29]

Yong Wang, Jianyong Li, Xin Zhang, et, al., “Research on Control Strategy of Hybrid Electric Vehicle [J],” Beijing Auto, (2), 11 –14 (2004). Google Scholar

[30]

Xiaoqi Tang, Fawen Liu, Guoqing Zhao, et, al., “Research on Control Strategy of Hybrid Electric Vehicle [J],” Automotive Technology, (2), 21 –24 (2004). Google Scholar

[31]

Shaohua Wang, Jiaxin Li, Dehua Shi, et, al., “Control strategy of power-split HEV based on optimal transmission efficiency [J],” Journal of Jiangsu University, 39 (6), 621 –627 (2018). Google Scholar

[32]

Andaloro L, Arista A, Agnello G, et al., “Study and design of a hybrid electric (lithium batteries-PEMFC) [J],” International Journal of Hydrogen Energy, 42 (5), 3166 –3184 (2017). https://doi.org/10.1016/j.ijhydene.2016.12.082 Google Scholar

[33]

Xia H, Zhuang J, Yu DH, “Novel Soft Subspace Clustering with Multi-objective Evolutionary Approach for High-dimensional Data[J],” Pattern Recognition, 46 (9), 2562 –2575 (2013). https://doi.org/10.1016/j.patcog.2013.02.005 Google Scholar

[34]

Yajun Xu, Research on Subspace Clustering Algorithms and its Applications [D], Jiangnan University, Zhenjiang (2016). Google Scholar

[35]

Amorim RC, Mirkin B, “Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering[J],” Pattern Recognition, 45 (3), 1061 –1075 (2012). https://doi.org/10.1016/j.patcog.2011.08.012 Google Scholar

[36]

MUR PP, AHA DW, UCI repository of machine learning databases, (2014) http://mlearn.ics.uci.edu/MLRepository.html - 09 ). 2014). Google Scholar

[37]

Wang J, Deng Z, Choi KS, et al., “Distance metric learning for soft subspace clustering in composite kernel space[J],” Pattern Recognition, 52 113 –134 (2016). https://doi.org/10.1016/j.patcog.2015.10.018 Google Scholar

[38]

Jing L, Ng MK, Huang JZ, “An Entropy Weighting K-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data[J],” IEEE Transactions on Knowledge & Data Engineering, 19 1026 –1041 (2007). https://doi.org/10.1109/TKDE.2007.1048 Google Scholar

[39]

Gan GJ, Wu JH, Yang ZJ, Advanced Data Mining and Applications[M], 271 –278 Springer, Berlin (2006). Google Scholar

[40]

Gan G, Wu J, “A convergence theorem for the fuzzy subspace clustering (FSC) algorithm [J],” Pattern Recognition, 41 1939 –1947 (2008). https://doi.org/10.1016/j.patcog.2007.11.011 Google Scholar

[41]

Rand WM, “Objective Criteria for the Evaluation of Clustering Methods[J],” Journal of the American Statistical Association, 66 (336), 846 –850 (1971). https://doi.org/10.1080/01621459.1971.10482356 Google Scholar

[42]

Strehl A, Ghosh J, “Cluster ensembles: a knowledge reuse framework for combining multiple partitions[J],” Journal of Machine Learning Research, 3 (3), 583 –617 (2008). Google Scholar

Citation Download Citation

Wensheng Wang, Xindong Hu, Shendun Zhao, Liying Jin, Yangfeng Cao, and Yang Xie "Novel algorithm for identifying kinematics characteristic data under road conditions in Xi’an", Proc. SPIE 12462, Third International Symposium on Computer Engineering and Intelligent Communications (ISCEIC 2022), 124620V (2 February 2023); https://doi.org/10.1117/12.2661176

Access the abstract

PROCEEDINGS
11 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Kinematics

Roads

Evolutionary algorithms

Databases

Optimization (mathematics)

Data modeling

Detection and tracking algorithms

1.

INTRODUCTION

2.

RELATED WORKS

Figure 1.

3.

IMOSSC ALGORITHM

3.1

Multi-objective mathematical model

3.2

Search operator and repair operator

3.3

iMOSSC algorithm flow

4.

IMOSSC ALGORITHM PERFORMANCE TEST AND ANALYSIS

Table 1.

Table 2

4.1

UCI dataset test results and analysis

Table 3

4.2

Cluster Analysis of Kinematics Data of Driving Conditions in Xi’an City

Table 4

Fig. 2

Table 5

Table 6

5.

ANALYSIS OF VEHICLE FUEL PERFORMANCE AND POWER PERFORMANCE UNDER ROAD CONDITIONS IN XI’AN

Table 7

6.

CONCLUSIONS

FUNDING

REFERENCES

Show All Keywords

Keywords/Phrases

Search In:

Publication Years