Title
Document Classification and Routing
Publication Date
1999
Document Type
Book Chapter
Abstract
A document classification and routing system is described which uses a probabilistic approach to determine the “flavor” of a text. The necessary probabilities are determined from the relevant training documents. Development, refinement, and testing of the system’s ability to route 120,000 documents into 50 topics are discussed as well as the mathematical model on which it is based.
COinS
Comments
Guthrie L., Guthrie J., Leistensnider J. (1999) Document Classification and Routing. In: Strzalkowski T. (eds) Natural Language Information Retrieval. Text, Speech and Language Technology, vol 7. Springer, Dordrecht