The matlab file contains the following variables: wl - cell array with list of words (>14K) docname - cell array names of documents (2486 docs) authors_names - cell array with authors names. names are in the format 'Sejnowski_T' counts - n_words x n_docs count matrix aw_counts - n_words x n_authors count matrix Words from each document were assigned to each of the authors, after dividing by the number of authors of that document