Web Classification Experiments
nWebKB dataset
nFour CS department websites
nBag of words on each page
nLinks between pages
nAnchor text for links
nExperimental setup
nTrained on three universities
nTested on fourth
nRepeated for all four combinations