Reference. Julian Kupiec, Jan Pedersen, Francine Chen, “A Trainable Document Summarizer”, SIGIR’95 Seattle WA USA, 1995.
Julian Kupiec, Jan Pedersen and Francine ChenXerox Palo Alto Research Center
which can be expressed using Bayes’ rule as follows:
Assuming statistical independence of the features:
is a constant and and can be estimated directly from the training set by “counting occurrences”
Xiaodan Zhu and Gerald PennDepartment of Computer Science University of Toronto