• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Cross-architecture Binary Function Similarity Detection based on Composite Feature Model


Abstract

Recent studies have shown that the neural network-based binary code similarity detection technology performs well in vulnerability mining, plagiarism detection, and malicious code analysis. However, existing cross-architecture methods still suffer from insufficient feature characterization and low discrimination accuracy. To address these issues, this paper proposes a cross-architecture binary function similarity detection method based on composite feature model (SDCFM). Firstly, the binary function is converted into vector representation according to the proposed composite feature model, which is composed of instruction statistical features, control flow graph structural features, and application program interface calling behavioral features. Then, the composite features are embedded by the proposed hierarchical embedding network based on a graph neural network. In which, the block-level features and the function-level features are processed separately and finally fused into the embedding. In addition, to make the trained model more accurate and stable, our method utilizes the embeddings of predecessor nodes to modify the node embedding in the iterative updating process of the graph neural network. To assess the effectiveness of composite feature model, we contrast SDCFM with the state of art method on benchmark datasets. The experimental results show that SDCFM has good performance both on the area under the curve in the binary function similarity detection task and the vulnerable candidate function ranking in vulnerability search task.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
X. Li, G. Zhang, Q. Li, P. Zhang, Z. Chen, J. Liu, S. Yue, "Cross-architecture Binary Function Similarity Detection based on Composite Feature Model," KSII Transactions on Internet and Information Systems, vol. 17, no. 8, pp. 2101-2123, 2023. DOI: 10.3837/tiis.2023.08.008.

[ACM Style]
Xiaonan Li, Guimin Zhang, Qingbao Li, Ping Zhang, Zhifeng Chen, Jinjin Liu, and Shudan Yue. 2023. Cross-architecture Binary Function Similarity Detection based on Composite Feature Model. KSII Transactions on Internet and Information Systems, 17, 8, (2023), 2101-2123. DOI: 10.3837/tiis.2023.08.008.

[BibTeX Style]
@article{tiis:55877, title="Cross-architecture Binary Function Similarity Detection based on Composite Feature Model", author="Xiaonan Li and Guimin Zhang and Qingbao Li and Ping Zhang and Zhifeng Chen and Jinjin Liu and Shudan Yue and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2023.08.008}, volume={17}, number={8}, year="2023", month={August}, pages={2101-2123}}