Abstract:
Phishing is one of the top most cybercrime according to a lot cybercrime awareness organization.
“Exploratory data analysis of phishing sites to identify most important features to detect a
phishing site" is a research project which aims to explore the most significant features of a
phishing site in order to detect a phishing site. In order to explore these features data were
collected from an open source machine learning data repository. Later correlation and univariate
selection methods were applied to discover the most significant features to detect a phishing site.
Finally, based on the top five selected features a system was built to check whether it can identity
phishing sites or not.