Formation of categories in document classification systems

Sanjiv K. Bhatia, Jitender S Deogun, Vijay V. Raghavan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Information retrieval systems employ the classification of documents into various categories to facilitate retrieval. The problem of categorization depends on the successful solution to three subproblems: creation of categories, determining the relationship between categories, and maintenance of the categorization system. In existing document categorization systems, the categories are formed by using hit and trial methods. This increases the initial setup period for the system. The initial setup time is further affected by an empirical assignment of relationships between categories. In this paper, we propose a solution to the problem of developing categories by the application of techniques originating in knowledge acquisition. The approach is based on capturing the knowledge of a user to ensure continuity with the existing categorization system. The use of Personal Construct Theory for knowledge elicitation helps in making explicit the subconscious hierarchical relationships between various categories as perceived by the user.

Original languageEnglish (US)
Title of host publicationComputing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings
PublisherSpringer Verlag
Pages91-97
Number of pages7
Volume507 LNCS
ISBN (Print)9780387976280
DOIs
StatePublished - 1991
Event1st Great Lakes Computer Science Conference, 1989 - Kalamazoo, United States
Duration: Oct 18 1989Oct 20 1989

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume507 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other1st Great Lakes Computer Science Conference, 1989
CountryUnited States
CityKalamazoo
Period10/18/8910/20/89

Fingerprint

Document Classification
Knowledge acquisition
Information retrieval systems
Categorization
Knowledge Elicitation
Setup Times
Knowledge Acquisition
Hits
Information Retrieval
Maintenance
Retrieval
Assignment

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Bhatia, S. K., Deogun, J. S., & Raghavan, V. V. (1991). Formation of categories in document classification systems. In Computing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings (Vol. 507 LNCS, pp. 91-97). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 507 LNCS). Springer Verlag. https://doi.org/10.1007/BFb0038478

Formation of categories in document classification systems. / Bhatia, Sanjiv K.; Deogun, Jitender S; Raghavan, Vijay V.

Computing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings. Vol. 507 LNCS Springer Verlag, 1991. p. 91-97 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 507 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Bhatia, SK, Deogun, JS & Raghavan, VV 1991, Formation of categories in document classification systems. in Computing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings. vol. 507 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 507 LNCS, Springer Verlag, pp. 91-97, 1st Great Lakes Computer Science Conference, 1989, Kalamazoo, United States, 10/18/89. https://doi.org/10.1007/BFb0038478
Bhatia SK, Deogun JS, Raghavan VV. Formation of categories in document classification systems. In Computing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings. Vol. 507 LNCS. Springer Verlag. 1991. p. 91-97. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/BFb0038478
Bhatia, Sanjiv K. ; Deogun, Jitender S ; Raghavan, Vijay V. / Formation of categories in document classification systems. Computing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings. Vol. 507 LNCS Springer Verlag, 1991. pp. 91-97 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{91803560979e4ad98484bcafd12d71a2,
title = "Formation of categories in document classification systems",
abstract = "Information retrieval systems employ the classification of documents into various categories to facilitate retrieval. The problem of categorization depends on the successful solution to three subproblems: creation of categories, determining the relationship between categories, and maintenance of the categorization system. In existing document categorization systems, the categories are formed by using hit and trial methods. This increases the initial setup period for the system. The initial setup time is further affected by an empirical assignment of relationships between categories. In this paper, we propose a solution to the problem of developing categories by the application of techniques originating in knowledge acquisition. The approach is based on capturing the knowledge of a user to ensure continuity with the existing categorization system. The use of Personal Construct Theory for knowledge elicitation helps in making explicit the subconscious hierarchical relationships between various categories as perceived by the user.",
author = "Bhatia, {Sanjiv K.} and Deogun, {Jitender S} and Raghavan, {Vijay V.}",
year = "1991",
doi = "10.1007/BFb0038478",
language = "English (US)",
isbn = "9780387976280",
volume = "507 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "91--97",
booktitle = "Computing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings",
address = "Germany",

}

TY - GEN

T1 - Formation of categories in document classification systems

AU - Bhatia, Sanjiv K.

AU - Deogun, Jitender S

AU - Raghavan, Vijay V.

PY - 1991

Y1 - 1991

N2 - Information retrieval systems employ the classification of documents into various categories to facilitate retrieval. The problem of categorization depends on the successful solution to three subproblems: creation of categories, determining the relationship between categories, and maintenance of the categorization system. In existing document categorization systems, the categories are formed by using hit and trial methods. This increases the initial setup period for the system. The initial setup time is further affected by an empirical assignment of relationships between categories. In this paper, we propose a solution to the problem of developing categories by the application of techniques originating in knowledge acquisition. The approach is based on capturing the knowledge of a user to ensure continuity with the existing categorization system. The use of Personal Construct Theory for knowledge elicitation helps in making explicit the subconscious hierarchical relationships between various categories as perceived by the user.

AB - Information retrieval systems employ the classification of documents into various categories to facilitate retrieval. The problem of categorization depends on the successful solution to three subproblems: creation of categories, determining the relationship between categories, and maintenance of the categorization system. In existing document categorization systems, the categories are formed by using hit and trial methods. This increases the initial setup period for the system. The initial setup time is further affected by an empirical assignment of relationships between categories. In this paper, we propose a solution to the problem of developing categories by the application of techniques originating in knowledge acquisition. The approach is based on capturing the knowledge of a user to ensure continuity with the existing categorization system. The use of Personal Construct Theory for knowledge elicitation helps in making explicit the subconscious hierarchical relationships between various categories as perceived by the user.

UR - http://www.scopus.com/inward/record.url?scp=85030854678&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85030854678&partnerID=8YFLogxK

U2 - 10.1007/BFb0038478

DO - 10.1007/BFb0038478

M3 - Conference contribution

SN - 9780387976280

VL - 507 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 91

EP - 97

BT - Computing in the 1990's - 1st Great Lakes Computer Science Conference, Proceedings

PB - Springer Verlag

ER -