Fuzzy-import hashing: A static analysis technique for malware detection

Abstract

The advent of new malware types and their attack vectors poses serious challenges for security experts in discovering effective malware detection and analysis techniques. The preliminary step in malware analysis is filtering out samples of counterfeit malware from the suspicious samples by classifying them into most likely and unlikely malware categories. This will enable effective utilisation of resources and expertise for the most likely category of samples in subsequent stages and avoid nugatory effort. This process requires a very fast and resource-optimised method as it is applied on a large sample size. Fuzzy hashing and import hashing methods satisfy these requirements of malware analysis, though, with some limitations. Therefore, the proper integration of these methods, may overcome some of the limitations and improve the detection accuracy without affecting the overall performance of analysis. Hence, this paper proposes a fuzzy-import hashing technique, which is the integration of two methods, namely, fuzzy hashing and import hashing. This integration can offer several benefits such as an improved detection rate by complementing each other when one method cannot detect malware, then the other method can; and the generation of fuzzfied results for subsequent clustering or classification, as the import hashing result can be easily merged with the fuzzy hashing result. The success of this proposed fuzzy-import hashing method is demonstrated through several experiments namely: on the collected malware and goodware corpus; a comparative evaluation against the established YARA rules and application in fuzzy c-means clustering.

Publication DOI: https://doi.org/10.1016/j.fsidi.2021.301139
Divisions: College of Engineering & Physical Sciences > School of Informatics and Digital Engineering > Computer Science
College of Engineering & Physical Sciences
Additional Information: © 2021, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/
Uncontrolled Keywords: Malware Analysis,Fuzzy-Import Hashing,Fuzzy Hashing,YARA Rules,Fuzzy C-Means Clustering,Ransomware
Full Text Link:
Related URLs: https://www.sci ... 0378?via%3Dihub (Publisher URL)
PURE Output Type: Article
Published Date: 2021-06-01
Published Online Date: 2021-04-01
Accepted Date: 2021-02-27
Authors: Naik, Nitin (ORCID Profile 0000-0002-0659-9646)
Jenkins, Paul
Savage, Nick
Yang, Longzhi
Boongoen, Tossapon
Iam-On, Natthakan

Download

[img]

Version: Published Version

Access Restriction: Restricted to Repository staff only


[img]

Version: Accepted Version

Access Restriction: Restricted to Repository staff only until 1 April 2022.

License: Creative Commons Attribution Non-commercial No Derivatives


Export / Share Citation


Statistics

Additional statistics for this record