From Local to Global Semantic Clone Detection

Yuan Yuan, Weiqiang Kong, Gang Hou, Yan Hu, Masahiko Watanabe, Akira Fukuda

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Clone detection detects similar code fragments (refer to as clone code) in software products. It can help with software optimization and maintenance. Code clone detection can be divided into textual, lexical, syntactic and semantic levels. The existing technologies have achieved many good results in the first three levels, but no significant results have been obtained in semantic clone detection. In this paper, we propose a novel semantic level clone detection approach. We use the control flow graph (CFG) as an intermediate representation of the program method, combining the classical dynamic time warping (DTW) algorithm in the field of speech recognition with two deep neural network models (bidirectional RNN autoencoder and graph convolutional network (GCN)) to detect semantic level clone from local to global. We experimented with a dataset consisting of five large-scale real-world systems and a code corpus containing a large number of programming problems. The experimental results show that our approach can achieve good results in detecting both local and global semantic clone.

Original languageEnglish
Title of host publicationProceedings - 2019 6th International Conference on Dependable Systems and Their Applications, DSA 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages13-24
Number of pages12
ISBN (Electronic)9781728160573
DOIs
Publication statusPublished - Jan 2020
Event6th International Conference on Dependable Systems and Their Applications, DSA 2019 - Harbin, China
Duration: Jan 3 2020Jan 6 2020

Publication series

NameProceedings - 2019 6th International Conference on Dependable Systems and Their Applications, DSA 2019

Conference

Conference6th International Conference on Dependable Systems and Their Applications, DSA 2019
CountryChina
CityHarbin
Period1/3/201/6/20

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality

Fingerprint Dive into the research topics of 'From Local to Global Semantic Clone Detection'. Together they form a unique fingerprint.

Cite this