PhilaeX: Explaining the Failure and Success of AI Models in Malware Detection

Lu, Zhi; Thing, Vrizlynn L. L.

doi:10.5220/0010986700003194

Computer Science > Cryptography and Security

arXiv:2207.00740 (cs)

[Submitted on 2 Jul 2022]

Title:PhilaeX: Explaining the Failure and Success of AI Models in Malware Detection

Authors:Zhi Lu, Vrizlynn L. L. Thing

View PDF

Abstract:The explanation to an AI model's prediction used to support decision making in cyber security, is of critical importance. It is especially so when the model's incorrect prediction can lead to severe damages or even losses to lives and critical assets. However, most existing AI models lack the ability to provide explanations on their prediction results, despite their strong performance in most scenarios. In this work, we propose a novel explainable AI method, called PhilaeX, that provides the heuristic means to identify the optimized subset of features to form the complete explanations of AI models' predictions. It identifies the features that lead to the model's borderline prediction, and those with positive individual contributions are extracted. The feature attributions are then quantified through the optimization of a Ridge regression model. We verify the explanation fidelity through two experiments. First, we assess our method's capability in correctly identifying the activated features in the adversarial samples of Android malwares, through the features attribution values from PhilaeX. Second, the deduction and augmentation tests, are used to assess the fidelity of the explanations. The results show that PhilaeX is able to explain different types of classifiers correctly, with higher fidelity explanations, compared to the state-of-the-arts methods such as LIME and SHAP.

Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2207.00740 [cs.CR]
	(or arXiv:2207.00740v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2207.00740
Journal reference:	7th International Conference on Internet of Things, Big Data and Security, ISBN 978-989-758-564-7; ISSN 2184-4976, pp 37-46, 2022
Related DOI:	https://doi.org/10.5220/0010986700003194

Submission history

From: Zhi Lu [view email]
[v1] Sat, 2 Jul 2022 05:06:24 UTC (261 KB)

Computer Science > Cryptography and Security

Title:PhilaeX: Explaining the Failure and Success of AI Models in Malware Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:PhilaeX: Explaining the Failure and Success of AI Models in Malware Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators