This paper presents a reference study of available algorithms for plagiarism
detection and it develops semantic plagiarism detection algorithm for plagiarism detection
in medical research papers by employing the Medical Ontologies available on the
World
Wide Web.
The issue of plagiarism detection in medical research written in natural languages is
a complex issue and related exact domain of medical research.
There are many used algorithms for plagiarism detection in natural language, which
are generally divided into two main categories, the first one is comparison algorithms
between files by using fingerprints of files, and files content comparison algorithms, which
include strings matching algorithms and text and tree matching algorithms.
Recently a lot of research in the field of semantic plagiarism detection algorithms
and semantic plagiarism detection algorithms were developed basing of citation analysis
models in scientific research.
In this research a system for plagiarism detection was developed using “Bing” search
engine, where tow type of ontologies used in this system, public ontology as wordNet and
many standard international ontologies in medical domain as Diseases ontology which
contains a descriptions about diseases and definitions of it and the derivation between
diseases.
This paper deals with automatic detection of plagiarism in Arabic documents. We present in this paper a new idea based on the experimentation of lexical chains. The proposed method extracts those chains from original document and uses a search engine
to verify if such chains occur in other documents. The second step in our methods uses automatic translation system to translate lexical chains and verify by using search engine if those chain occurs in document in other languages. Then we compute a correlation ratio between lexical chains and lexical chains extracted from documents provided by the search engine to detect plagiarism in the original document.
We present in the end of this paper our prototype called « Alkachef » developed to detect plagiarism in Arabic document .