Journal article icon

Journal article

Understanding plagiarism linguistic patterns, textual features, and detection methods

Abstract:
Plagiarism can be of many different natures, ranging from copying texts to adopting ideas, without giving credit to its originator. This paper presents a new taxonomy of plagiarism that highlights differences between literal plagiarism and intelligent plagiarism, from the plagiarist’s behavioral point of view. The taxonomy supports deep understanding of different linguistic patterns in committing plagiarism, for example, changing texts into semantically equivalent but with different words and organization, shortening texts with concept generalization and specification, and adopting ideas and important contributions of others. Different textual features that characterize different plagiarism types are discussed. Systematic frameworks and methods of monolingual, extrinsic, intrinsic, and cross-lingual plagiarism detection are surveyed and correlated with plagiarism types, which are listed in the taxonomy. We conduct extensive study of state-of-the-art techniques for plagiarism detection, including character n-gram-based (CNG), vector-based (VEC), syntax-based (SYN), semantic-based (SEM), fuzzy-based (FUZZY), structuralbased (STRUC), stylometric-based (STYLE), and cross-lingual techniques (CROSS).Our study corroborates that existing systems for plagiarism detection focus on copying text but fail to detect intelligent plagiarism when ideas are presented in different words.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Publisher copy:
10.1109/TSMCC.2011.2134847

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Role:
Author

Contributors

Institution:
Universiti Teknologi Malaysia
Role:
Contributor
Institution:
MIRLabs
Role:
Contributor


Journal:
IEEE Transactions on systems, man, and cybernetics - Part C: Applications and reviews More from this journal
Volume:
PP
Issue:
99
Pages:
1-17
Publication date:
2011-05-12
Edition:
Accepted Manuscript
DOI:
ISSN:
1094-6977


Language:
English
Keywords:
Subjects:
UUID:
uuid:cc0ef51c-001c-4c48-bb4c-43f64e94895d
Local pid:
ora:9878
Deposit date:
2015-02-03

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP