Home > Projects > Code Clones Literature
Code Clones Literature
The papers that talk about clone detection techniques are listed first followed by papers that talk about other aspects of clone clones. These include:
- Categorization and visualization of detected clones to aid in the analysis of the clones
- Possible maintenance of clones
- Survey of the research field in general and evaluation of different clone detection tools
- Searching for crosscutting concerns
- Development of reference data for tool evaluation
- Evalution of copying and pasting by programmers
In addition, links to clone detection tools (standalone and Eclipse plugins), related events, and research groups are also listed.
If any of the information below is incorrect or out of date, please email
. Also, please email any suggestions of other papers. The papers are sorted by year of publication (most recent first).
View this list sorted by: [ Category | Publication Venue | Year | Author ]
Last updated: 02/01/2010
Surveys and Evaluations:
Tools:
Related Topics:
Related Links:
Process
Detection
Shinji Kawaguchi, Takanobu Yamashina, Hidetake Uwano, Kyohei Fushida, Yasutaka Kamei, Masataka Nagura, Hajimu Iida – Working Conference on Reverse Engineering (WCRE) – 2009
Yoshiki Higo, Shinji Kusumoto – Working Conference on Reverse Engineering (WCRE) – 2009
Tung Nguyen, Hoan Nguyen, Jafar Al-Kofahi, Nam Pham, Tien Nguyen – International Conference on Software Maintenance (ICSM) – 2009
Michel Chilowicz, Etienne Duris, Gilles Roussel – International Conference on Program Comprehension (ICPC) – 2009
[ PDF ] Scott Grant, James R. Cordy – International Conference on Program Comprehension (ICPC) – 2009
Nam Pham, Hoan Nguyen, Jafar Al-Kofahi, Tung Nguyen, Tien Nguyen – International Conference on Software Engineering (ICSE) – 2009
[ PDF ] Elmar Juergens, Florian Deissenboeck, Benjamin Hummel – International Conference on Software Engineering (ICSE) – 2009
Hoan Nguyen, Tung Nguyen, Nam Pham, Jafar Al-Kofahi, Tien Nguyen – Fundamental Approaches to Software Engineering (FASE) – 2009
[ PDF ] Yue Jia, David Binkley, Mark Harman, Jens Krinke, Makoto Matsushita – International Workshop on Software Clones (IWSC) – 2009
[ DOI ] Raimar Falke, Pierre Frenzel, Rainer Koschke – Empirical Software Engineering – 2008
[ PDF ] Rainer Koschke, Raimar Falke, Pierre Frenzel – Working Conference on Reverse Engineering – 2006
[ PDF ] Nils Gode – Diploma Thesis – 2008
Nils Gode, Rainer Koschke – European Conference on Software Maintenance and Reengineering – 2009
Nicholas Kraft, Brandon Bonds, Randy Smith – International Conference on Software Engineering and Knowledge Engineering (SEKE) – 2008
NICAD: Accurate Detection of Near-Miss Intentional Clones Using Flexible Pretty-Printing and Code Normalization
[ PDF ] Chanchal K. Roy, James R. Cordy – International Conference on Program Comprehension (ICPC) – 2008
[ PDF ] Florian Deissenboeck, Benjamin Hummel, Elmar Juergens, Bernhard Schaetz, Stefan Wagner, Stefan Teuchert, Jean-Francois Girard – International Conference on Software Engineering (ICSE) – 2008
[ PDF ] Mark Gabel, Lingxiao Jiang, Zhendong Su – International Conference on Software Engineering (ICSE) – 2008
[ PDF ] Peter Bulychev, Marius Minea – Spring Young Researchers Colloquium on Software Engineering (SYRCoSE) – 2008
[ DOI ] William Evans, Christopher Fraser, Fei Ma – Working Conference on Reverse Engineering (WCRE) – 2007
[ DOI ] Hamid Basit, Simon Pugliesi, William Smyth, Andrei Turpin, Stan Jarzabek – European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) – 2007
[ PDF ] Lingxiao Jiang, Ghassan Misherghi, Zhendong Su, Stephane Glondu – International Conference on Software Engineering (ICSE) – 2007
[ DOI ] Stephane Ducasse, Oscar Nierstrasz, Matthias Rieger – International Journal on Software Maintenance and Evolution: Research and Practice – 2006
[ DOI ] Seunghak Lee, Iryoung Jeong – Object-Oriented Programming, Systems, Languages, and Applictions (OOPSLA) – 2005
[ PDF ] Hamid Basit, Stan Jarzabek – European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) – 2005
[ PDF ] Richard Wettel, Radu Marinescu – International Symposium on Symbolic and Numeric Algorithms for Scientific Computing – 2005
[ PDF ] Matthias Rieger – Ph.D. Thesis – 2005
Method-Level Code Clone Detection on Transformed Abstract Syntax Trees Using Sequence Matching Algorithms
[ PDF ] Kevin Greenan – Student Project Report – 2005
[ PDF ] Zhenmin Li, Shan Lu, Suvda Myagmar, Yuanyuan Zhou – Symposium on Operating System Design and Implementation – 2004
[ PDF ] James R. Cordy, Thomas Dean, Nikita Synytskyy – IBM Centre for Advanced Studies Conference (CASCON) – 2004
[ DOI ] Vera Wahler, Dietmar Seipel, Gregor Fischer – International Workshop on Source Code Analysis and Manipulation (SCAM) – 2004
[ PDF ] Richard Wettel – Diploma Thesis – 2004
[ PDF ] Keith Gallagher, Lucas Layman – International Workshop on Program Comprehension (IWPC) – 2003
[ DOI ] Yasushi Ueda, Toshihiro Kamiya, Shinji Kusumoto, Katsuro Inoue – Asia-Pacific Software Engineering Conference (APSEC) – 2002
[ PDF ] Toshihiro Kamiya, Shinji Kusumoto, Katsuro Inoue – IEEE Transactions on Software Engineering – 2002
[ PDF ] Andrian Marcus, Jonathan Maletic – International Conference on Automated Software Engineering (ASE) – 2001
[ PDF ] Jens Krinke – Working Conference on Reverse Engineering (WCRE) – 2001
[ PDF ] Stephane Ducasse, Matthias Rieger, Serge Demeyer – International Conference on Software Maintenance (ICSM) – 1999
[ PDF ] Ira Baxter, Andrew Yahin, Leonardo Moura, Marcelo Sant'Anna, Lorraine Bier – International Conference on Software Maintenance (ICSM) – 1998
[ PDF ] Kostas Kontogiannis – Working Conference on Reverse Engineering (WCRE) – 1997
[ DOI ] Jean Mayrand, Claude Leblanc, Ettore Merlo – International Conference on Software Maintenance (ICSM) – 1996
[ PDF ] Kostas Kontogiannis – Automated Software Engineering – 1996
[ PDF ] Neil Davey, Paul Barson, Simon Field, Ray Frank, Stewart Tansley – International Journal of Applied Software Technology – 1995
[ DOI ] John Johnson – International Conference on Software Maintenance (ICSM) – 1994
Analysis
Elmar Juergens, Florian Deissenboeck, Christoph Domann, Martin Feilkas, Benjamin Hummel, Bernhard Schaetz, Jonathan Streit, Stefan Wagner – International Conference on Software Engineering (ICSE) – 2010
Elmar Juergens, Florian Deissenboeck, Benjamin Hummel – European Conference on Software Maintenance and Reengineering (CSMR) – 2010
Y. Fukushima, R. Kula, Shinji Kawaguchi, Kyohei Fushida, Masataka Nagura, Hajimu Iida – Asia-Pacific Software Engineering Conference (APSEC) – 2009
Yoshiki Higo, Ken-ichi Sawa, Shinji Kusumoto – Asia-Pacific Software Engineering Conference (APSEC) – 2009
Tung Nguyen, Hoan Nguyen, Nam Pham, Jafar Al-Kofahi, Tien Nguyen – International Conference on Automated Software Engineering (ASE) – 2009
[ PDF ] Christoph Domann, Elmar Juergens, Jonathan Streit – International Symposium on Empirical Software Engineering and Maintenance (ESEM) – 2009
[ DOI ] Journal of Software Maintenance: Research and Practice – 2009
[ PDF ] Nicolas Bettenburg, Weiyi Shang, Walid Ibrahim, Bram Adams, Ying Zou, Ahmed Hassan – Working Conference on Reverse Engineering (WCRE) – 2009
Ettore Merlo, Thierry Lavoie – Working Conference on Reverse Engineering (WCRE) – 2009
Nils Gode – International Working Conference on Source Code Analysis and Manipulation (SCAM) – 2009
[ PDF ] Elmar Juergens, Florian Deissenboeck, Benjamin Hummel, Stefan Wagner – International Conference on Software Engineering (ICSE) – 2009
[ PDF ] Wiebe Hordijk, Maria Ponisio, Roel Wieringa – International Conference on Evaluation and Assessment in Software Engineering (EASE) – 2009
[ PDF ] Toshihiro Kamiya – International Workshop on Software Clones (IWSC) – 2009
[ PDF ] Jan Harder, Nils Gode – International Workshop on Software Clones (IWSC) – 2009
Angela Lozano, Michel Wermelinger – International Conference on Software Maintenance (ICSM) – 2008
Stan Jarzabek – International Conference on Software Maintenance (ICSM) – 2008
Toshihiro Kamiya – International Conference on Software Maintenance (ICSM) – 2008
[ PDF ] Jens Krinke – International Working Conference on Source Code Analysis and Manipulation (SCAM) – 2008
[ DOI ] Cory Kapser, Michael Godfrey – Empirical Software Engineering – 2008
[ PDF ] Cory Kapser, Michael Godfrey – Working Conference on Reverse Engineering – 2006
[ PDF ] Elmar Juergens, Benjamin Hummel, Florian Deissenboeck, Martin Feilkas – Testmethoden fur Software (TESO) – 2008
[ DOI ] Yuseung Ma, Dukkuyn Woo – Asia-Pacific Software Engineering Conference (APSEC) – 2007
[ DOI ] Tibor Bakota, Rudolf Ferenc, Tibor Gyimothy – International Conference on Software Maintenance (ICSM) – 2007
[ PDF ] Jens Krinke – Working Conference on Reverse Engineering (WCRE) – 2007
[ PDF ] Lingxiao Jiang, Zhendong Su, Edwin Chiu – European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) – 2007
[ DOI ] Brenda Baker – IEEE Transactions on Software Engineering – 2007
[ DOI ] Yoshiki Higo, Toshihiro Kamiya, Shinji Kusumoto, Katsuro Inoue – Information and Software Technology – 2007
[ PDF ] Zhenming Jiang, Ahmed Hassan – International Working Conference on Source Code Analysis and Manipulation (SCAM) – 2007
[ PDF ] Damith Rajapakse, Stan Jarzabek – International Conference on Software Engineering (ICSE) – 2007
[ DOI ] Simone Livieri, Yoshiki Higo, Makoto Matsushita, Katsuro Inoue – International Conference on Software Engineering (ICSE) – 2007
[ DOI ] Simone Livieri, Yoshiki Higo, Makoto Matsushita, Katsuro Inoue – International Workshop on Mining Software Repositories (MSR) – 2007
[ DOI ] Angela Lozano, Michel Wermelinger, Bashar Nuseibeh – International Workshop on Mining Software Repositories (MSR) – 2007
[ DOI ] Lerina Aversano, Luigi Cerulo, Massimiliano Di Penta – European Conference on Software Maintenance and Reengineering (CSMR) – 2007
[ PDF ] Zhenming Jiang – Masters Thesis – 2006
[ PDF ] Cory Kapser, Michael Godfrey – International Journal on Software Maintenance and Evolution: Research and Practice – 2006
[ PDF ] Raihan Al-Ekram, Cory Kapser, Richard Holt, Michael Godfrey – International Symposium on Empirical Software Engineering – 2005
[ DOI ] Miryung Kim, Vibha Sazawal, David Notkin, Gail Murphy – European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) – 2005
[ PDF ] Cory Kapser, Michael Godfrey – International Conference on Software Maintenance (ICSM) – 2005
[ PDF ] Hamid Basit, Damith Rajapakse, Stan Jarzabek – International Conference on Software Engineering and Knowledge Engineering (SEKE) – 2005
[ PDF ] Hamid Basit, Damith Rajapakse, Stan Jarzabek – International Conference on Web Engineering – 2005
[ PDF ] Hamid Basit, Damith Rajapakse, Stan Jarzabek – International Conference on Software Engineering (ICSE) – 2005
[ DOI ] Miryung Kim, David Notkin – International Workshop on Mining Software Repositories (MSR) – 2005
[ PDF ] Matthias Rieger, Stephane Ducasse, Michele Lanza – Working Conference on Reverse Engineering (WCRE) – 2004
[ PDF ] Cory Kapser – International Workshop on Principles of Software Evolution – 2004
Filip Van Rysselberghe, Serge Demeyer – International Workshop on Object-Oriented Reengineering – 2004
[ PDF ] Giuliano Antoniol, Umberto Villano, Ettore Merlo, Massimiliano Di Penta – Information and Software Technology – 2002
[ PDF ] Akito Monden, Daikai Nakae, Toshihiro Kamiya, Shin-ichi Sato, Ken-ichi Matsumoto – Symposium on Software Metrics (METRICS) – 2002
[ PDF ] Giuliano Antoniol, Gerardo Casazza, Massimiliano Di Penta, Ettore Merlo – International Conference on Software Maintenance (ICSM) – 2001
[ PDF | PS ] Magdalena Balazinska, Ettore Merlo, Michel Dagenais, Bruno Lague, Kostas Kontogiannis – International Software Metrics Symposium – 1999
John Johnson – IBM Centre for Advanced Studies Conference (CASCON) – 1994
Maintenance
Ekwa Duala-Ekoko, Martin Robillard – ACM Transactions on Software Engineering and Methodology (TOSEM) – 2010
[ PDF ] Ekwa Duala-Ekoko, Martin Robillard – International Conference on Software Engineering – 2008
[ PDF ] Ekwa Duala-Ekoko, Martin Robillard – International Conference on Software Engineering – 2007
Michiel de Wit, Andy Zaidman – International Conference on Software Maintenance (ICSM) – 2009
Daqing Hou, Patricia Jablonski, Ferosh Jacob – International Conference on Program Comprehension (ICPC) – 2009
Thilo Mende, Rainer Koschke, Felix Beckwermert – Journal of Software Maintenance: Research and Practice – 2009
[ PDF ] Thilo Mende, Felix Beckwermert, Rainer Koschke, Gerald Meier – European Conference on Software Maintenance and Reengineering – 2008
[ PDF ] Huiqing Li, Simon Thompson – Workshop on Partial Evaluation and Semantics-Based Program Manipulation (PEPM) – 2009
[ PDF ] Sandro Schulze, Martin Kuhlemann, Marko Rosenmuller – Workshop on Refactoring Tools (WRT) – 2008
[ DOI ] Tung Nguyen, Hoan Nguyen, Nam Pham, Jafar Al-Kofahi, Tien Nguyen – International Conference on Automated Software Engineering (ASE) – 2008
[ PDF ] Valentin Weckerle – Diploma Thesis – 2008
[ PDF ] Yoshiki Higo, Yasushi Ueda, Shinji Kusumoto, Katsuro Inoue – Asia-Pacific Software Engineering Conference (APSEC) – 2007
[ PDF ] Andy Chiu, David Hirtle – Student Project Report – 2007
[ URL ] 2007
[ PDF ] Yoshiki Higo – Ph.D. Thesis – 2006
[ PDF ] Nicolas Juillerat, Beat Hirsbrunner – Workshop on Software Evolution through Transformations (SeTra) – 2006
[ DOI ] Salah Bouktif, Giuliano Antoniol, Ettore Merlo, Markus Neteler – Genetic and Evolutionary Computation Conference – 2006
[ PDF ] Stan Jarzabek, Shubiao Li – International Journal on Software Maintenance and Evolution: Research and Practice – 2006
[ PDF ] Michael Toomim, Andrew Begel, Susan Graham – Symposium on Visual Languages - Human Centric Computing – 2004
[ PDF ] Yidong Liu – Diploma Thesis – 2004
[ PDF ] Yoshiki Higo, Toshihiro Kamiya, Shinji Kusumoto, Katsuro Inoue – International Conference on Software Engineering and Applications – 2004
[ PDF ] Raghavan Komondoor – Ph.D. Thesis – 2003
[ PDF ] Georges Koni N'Sapu – Diploma Thesis – 2001
[ PDF | PS ] Magdalena Balazinska, Ettore Merlo, Michel Dagenais, Bruno Lague, Kostas Kontogiannis – Working Conference on Reverse Engineering (WCRE) – 2000
[ PDF | PS ] Magdalena Balazinska, Ettore Merlo, Michel Dagenais, Bruno Lague, Kostas Kontogiannis – Working Conference on Reverse Engineering (WCRE) – 1999
[ PDF ] Richard Fanta, Vaclav Rajlich – Journal of Software Maintenance: Research and Practice – 1999
Surveys and Evaluations
Survey of Overall Research
[ PDF ] Rainer Koschke – Dagstuhl Seminar: Duplication, Redundancy, and Similarity in Software – 2006
Evalution of Clone Detection Tools
[ PDF ] Chanchal K. Roy, James R. Cordy – International Workshop on Mutation Analysis (Mutation) – 2009
[ PDF ] Chanchal K. Roy, James R. Cordy, Rainer Koschke – Science of Computer Programming (SCP) – 2009
[ PDF ] Chanchal K. Roy, James R. Cordy – Canadian Conference on Computer Science and Software Engineering (C3S2E) – 2008
[ DOI ] Stefan Bellon, Rainer Koschke, Giuliano Antoniol, Jens Krinke, Ettore Merlo – IEEE Transactions on Software Engineering – 2007
[ PDF ] Filip Van Rysselberghe, Serge Demeyer – International Workshop on Evolution of Large Scale Industrial Applications (ELISA) – 2003
[ PDF ] Stefan Bellon – International Workshop on Source Code Analysis and Manipulation (SCAM) – 2002
[ DOI ] Elizabeth Burd, John Bailey – International Workshop on Source Code Analysis and Manipulation (SCAM) – 2002
Tools
Standalone Tools
Axivion Bauhaus Suite
[
URL
]
CCFinder
[
URL
]
CloneDR
[
URL
]
Clone Digger
[
URL
]
Clone Detective (part of ConQAT)
[
URL
]
Copy Paste Detector
[
URL
]
Duplo
[
URL
]
Scorpio
[
URL
]
Simian
[
URL
]
Eclipse Plug-ins
CloneBoard
[
URL
]
CloneTracker
[
URL
]
Consistent Renaming Tool (CReN)
[
URL
]
CopyPasteChange (CPC)
[
URL
]
Duplication Management Framework
[
URL
]
SDD
[
URL
]
SimScan
[
URL
]
In Visual Studio
Clone Detective (part of ConQAT)
[
URL
]
Related Topics
Reference Data
[ PDF ] Andrew Walenstein, Nitin Jyoti, Junwei Li, Yun Yang, Arun Lakhotia – Working Conference on Reverse Engineering (WCRE) – 2003
Copy and Paste Practices
[ DOI ] Zoltan Mann – IEEE Computer – 2006
[ PDF ] Mihai Balint, Tudor Girba, Radu Marinescu – International Conference on Program Comprehension (ICPC) – 2006
[ DOI ] Miryung Kim, Lawrence Bergman, Tessa Lau, David Notkin – Symposium on Empirical Software Engineering – 2004
Aspect Mining
Takashi Ishio, Hironori Date, Tatsuya Miyake, Katsuro Inoue – Working Conference on Reverse Engineering (WCRE) – 2008
[ PDF ] Kim Mens, Andy Kellens, Jens Krinke – Working Conference on Reverse Engineering (WCRE) – 2008
[ DOI ] Jing Zhang, Yuehua Lin, Jeff Gray, Robert Tairas – International Journal of Computer Applications in Technology – 2008
[ DOI ] Chanchal K. Roy, Mohammad Gias Uddin, Banani Roy, Thomas Dean – International Conference on Program Comprehension (ICPC) – 2007
[ PDF ] Silvia Breu, Thomas Zimmerman, Christian Lindig – Eclipse Technology Exchange Workshop (ETX) – 2006
[ PDF ] Silvia Breu, Thomas Zimmerman – International Conference on Automated Software Engineering (ASE) – 2006
[ DOI ] Magiel Bruntink, Arie van Deursen, Remco van Engelen, Tom Tourwe – IEEE Transactions on Software Engineering – 2005
[ PDF ] Silvia Breu – Position Paper – 2004
[ PDF ] Magiel Bruntink – Workshop on Aspect Reverse Engineering – 2004
[ PDF ] Magiel Bruntink, Arie van Deursen, Remco van Engelen, Tom Tourwe – International Conference on Software Maintenance (ICSM) – 2004
Related Links
Events
Fourth International Workshop on Detection of Software Clones
[
URL
] Held in conjunction with ICSE 2010 - May 8, 2010
Third International Workshop on Detection of Software Clones
[
URL
] Held in conjunction with CSMR 2009 - March 24, 2009
Duplication, Redundancy, and Similarity in Software
[
URL
] Dagstuhl Seminar - July 23-26, 2006
Towards Evaluation of Aspect Mining Workshop
Held in conjunction with ECOOP 2006 - July 4, 2006
Second International Workshop on Detection of Software Clones
[
URL
] Held in conjunction with WCRE 2003 - November 13, 2003
First International Workshop on Detection of Software Clones
[
URL
] Held before ICSM 2002 - October 2, 2002
Research Groups
Software Architecture Group (SWAG)
Software Composition Group (SCG)
Software Engineering Laboratory
Software Evolution Research Group (SWEVO)
Software Research Laboratory
This project is supported by NSF grant CPA-0702764