FS2RNN: Feature Selection Scheme for Web Spam Detection Using Recurrent Neural Networks
Conference paper
Authors | Makkar, A., Obaidat, M.S. and Kumar, N. |
---|---|
Type | Conference paper |
Abstract | In modern era, Internet plays a key role in accessing and fetching web information and web resources from World Wide Web (WWW). The websites act as a medium for retrieving information from the web. Although it increases the data retrieval and users interactions, it also opens the gate for various types of attacks. For example, spams in the websites attract various Internet users. It has been observed from the literature that many authors attempted to detect the web spam using various machine learning techniques. However, none of these techniques used deep learning architecture for detection of hidden patterns. Hence, in this paper, a deep learning algorithm, i.e., Recurrent Neural Networks (RNN), has been used for the classification of spam nodes. We devise here a framework called FS2RNN: Feature Selection Scheme using Recurrent Neural Networks. In this framework, the dataset is preprocessed before applying RNN in which principal component analysis (PCA) is used for dimension reduction on the data set and recursive feature elimination (RFE) is used for feature selection. The accuracy of the proposed framework, when compared before and after preprocessing, is improved by 24.2 %, which is excellent result. |
Keywords | Deep learning; Feature extraction; Learning algorithms; Principal component analysis; Web browsers; Websites |
Year | 2018 |
Conference | 2018 IEEE Global Communications Conference, GLOBECOM 2018; Abu Dhabi National Exhibition Centre (ADNEC) Abu Dhabi; United Arab Emirates; 9 December 2018 through 13 December 2018; Category number CFP18GLO-ART; Code 145422 |
Journal | 2018 IEEE Global Communications Conference, GLOBECOM 2018 - Proceedings |
Publisher | IEEE |
ISSN | 2576-6813 |
Digital Object Identifier (DOI) | https://doi.org/10.1109/glocom.2018.8647294 |
Web address (URL) | http://www.scopus.com/inward/record.url?eid=2-s2.0-85063463649&partnerID=MN8TOARS |
Journal citation | 2018 (Article: 8647294) |
ISBN | 978-1-5386-4727-1 |
Web address (URL) of conference proceedings | https://ieeexplore.ieee.org/xpl/conhome/8634808/proceeding |
Output status | Published |
Publication dates | |
Online | 21 Feb 2019 |
Publication process dates | |
Deposited | 22 May 2023 |
https://repository.derby.ac.uk/item/9yx62/fs2rnn-feature-selection-scheme-for-web-spam-detection-using-recurrent-neural-networks
23
total views0
total downloads3
views this month0
downloads this month