Sammon Projected Feature Selection Based Linear Support Vector Regression for Big Data Predictive Analytics
Main Article Content
Abstract
In big data analytics, classification is a key problem to be resolved for providing efficient prediction results. Recently, many research works have been designed for big data classification. However, accuracy was not increased with minimal time complexity when considering large dataset as input. In order to handle such issues, Sammon Projected Feature Selection based Linear Support Vector Regression (SPFS-LSVR) Method is introduced. The SPFS-LSVR method is developed to minimize the time and space complexity involved in the classification with better accuracy. At first, SPFS-LSVR method takes big dataset as input where it includes number of features and data for executing predictive analytics. Then theSammon Projection is employed in SPFS-LSVR methodfor performing the feature selection in high dimensional data. Sammon Projection helps in finding the relevant features by mapping the high-dimensional space to the lower-dimensionality space. This in turns, the time and space complexity occurred during the classification is reduced. After that, SPFS-LSVR method uses a Linear Support Vector Regression (LSVR) model for examining the selected features of input data with higher accuracy. The designed model uses hyperplane for producing the exact prediction result with higher accuracy. The LSVR model determines the relationship between independent data (i.e., features of input data) and the dependent data (i.e., prediction outcomes) with the help of Laplace kernel function. From that, the maximum relationship between the input features of data is classified into different classes. This helps in SPFS-LSVR method to enhance the performance of classification with maximum accuracy and minimal error rate. Through the efficient classification performance of big data, proposed SPFS-LSVR method improves the big data predictive analytics process as compared to state-of-the-art works. Experimental evaluation is carried out using big dataset on factors such as prediction time, prediction accuracy, false positive rate and space complexity with respect to number of data.
Downloads
Metrics
Article Details
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.