Abstract
Detecting the security level of a confidential document is a vital task for organizations to protect the confidential information encapsulated in. Diverse classification rules and techniques are being applied by human experts. Increasing number of confidential information in organizations are making difficult to classify all the documents carefully with human effort. A hybrid approach involving support vector classifier and adaptive neuro-fuzzy classifier is proposed in this study. Also states preprocessing tasks required for document classification with natural language processing. To represent term-document relations a recommended metric TF-IDF was chosen to construct a weight matrix. Agglutinative nature of Turkish documents is handled by Turkish stemming algorithms. At the end of the article some experimental results and success metrics are projected with accuracy rates.
Original language | English (US) |
---|---|
Pages (from-to) | 1412-1417 |
Number of pages | 6 |
Journal | Procedia Computer Science |
Volume | 3 |
DOIs | |
State | Published - 2011 |
Externally published | Yes |
Event | 1st World Conference on Information Technology, WCIT-2010 - Istanbul, Turkey Duration: Oct 6 2010 → Oct 10 2010 |
Keywords
- ANFIS
- Document classification
- Expert systems
- SVM
- Turkish NLP
ASJC Scopus subject areas
- General Computer Science