Web Classification Experiments
n
WebKB dataset
n
Four CS department websites
n
Bag of words on each page
n
Links between pages
n
Anchor text for links
n
Experimental setup
n
Trained on three universities
n
Tested on fourth
n
Repeated for all four combinations