DYNAMIC PHISHING WEBSITE DATASET COLLECTION
Annotation: The paper describes a list of deficiencies in the publicly available datasets of phishing websites. A method is proposed that mitigates those deficiencies. A prototype is described and the results which was successfully used to create a dataset of phishing site archives. Created dataset does not contain described deficiencies.
Keywords: Phishing site detection, machine learning, website archiving.