International Journal of Emerging Trends & Technology in Computer Science
A Motivation for Recent Innovation & Research
ISSN 2278-6856
www.ijettcs.org
Call for Paper, Published Articles, Indexing Infromation
Title: |
Authorship Identification: Naïve Bayes with XGBoost Approach
|
Author Name: |
Dr. B. S. Daga, Jason Dsouza, Ryan Furtado, Manupendra Tiwari |
Abstract: |
Abstract: In today’s world, electronic text is used for
communication on a large scale. Most of this content is
provided anonymously or under unverified names. For
forensic applications, it is important to segregate text into
groups of text that may be written by the same individual
under a different alias. There are many copyright dispute
cases, where multiple people claim the ownership of some
content. Authorship identification along with
mathematical or statistical analysis of texts could be the
key to solve this problem. When an individual writes, they
subconsciously use a certain array of words or writing
patterns and sentiments, and we could use this to
determine their writing style. The fundamental
assumption of authorship identification is that each
individual has a habit of subconsciously using certain
words, patterns and emotions that make their writing style
unique. Extraction of these individual features from text
could be used to distinguish one author from another.
The problem statement for our system is as follows:
Building a system that can be trained to recognize a
certain individual based on his writing style i.e. the set of
words (features) used frequently by the individual. This is
also known as generating a writeprint (similar to a
fingerprint). With the help of this writeprint the system
will be able to identify any other documents or texts which
have been written by the same individual. This should
help reduce plagiarism in case of authors and can also be
used in forensics to identify criminals based on their
writing.
Keywords: Authorship Identification, Handwriting
Analysis, Plagiarism Detection, Writeprint, Feature
Extraction. |
Cite this article: |
Dr. B. S. Daga, Jason Dsouza, Ryan Furtado, Manupendra Tiwari , "
Authorship Identification: Naïve Bayes with XGBoost Approach " , International Journal of Emerging Trends & Technology in Computer Science (IJETTCS),
Volume 8, Issue 3, May - June 2019 , pp.
001-007 , ISSN 2278-6856.
|