Collection 36: Articles containing the word "humanities" but that have been classified as not being about the humanties from U.S. top-circulating newspapers and student newspapers, c. 1998-2018

A collection of word-frequency and other data representing 27,362 unique articles (no duplicate or close-variant documents) that contain the word "humanities" but that have not been classified as being about the humanities published from 1998-2018 in 545 U.S. top-circulating and student newspapers and their associated blogs. The collection includes 13,309 articles from U.S. top-circulating newspapers and 14,053 articles from student newspapers. Supervised classification models have classified these articles as not being about the humanities; this collection therefore helps WE1S understand what articles that contain the word "humanities" but that aren't about the humanities per se are like.

News sources in Collection 36 include 15 top-circulation U.S newspapers: Boston Globe, Chicago Tribune, Daily News (New York), Dallas Morning News, Denver Post, Houston Chronicle, Los Angeles Times, New York Post, New York Times (and its blogs), Newsday (New York), Seattle Times, Star Tribune (Minneapolis, MN), Tampa Bay Times, USA Today, Washington Post. Also included are documents from 530 U.S. campus newspapers, among which the top 15 sources in the collection are: The Stanford Daily (Stanford University), The Tartan (Carnegie Mellon), The Dartmouth (Dartmouth), The Harvard Crimson (Harvard), The Daily Princetonian (Princeton), The Brown Daily Herald (Brown), The Daily Cardinal (UW Madison), The Daily Titan (California State University, Fullerton), The Daily Bruin (UC Los Angeles), The Daily Universe (Brigham Young University), Badger Herald (UW Madison), Daily Eastern News (Eastern Illinois University), The Columbia Spectator (Columbia), Indiana Daily Student (Indiana University), Cornell Daily Sun (Cornell).

Kinds of Sources (by Tags)

Sources in Collection 36 are associated with the following non-exclusive metadata categories, which describe the kinds of sources in the collection. Of the 27,362 total articles: 11,576 are from publications located in the North East, 5,192 in the South, 4,997 in the Midwest, 3,886 in the West Coast, 1,407 in the Rockies and Southwest. Of the 14,053 student newspaper articles: 9,064 are from publications located at doctoral universities, 7,327 are from public schools, 6,623 are from private schools, 1,780 are from liberal arts institutions, 1,497 are from Hispanic-serving institutions, 1,410 are from the Ivy League, 661 are from the Cal State system, 577 are from Catholic institutions, 569 are from community colleges, 522 are from the UC system, 497 are from science, tech, and/or ag schools, 403 are from Christian institutions, 202 are from institutions associated with the Church of Latter-Day Saints, 107 are from Jewish institutions, 101 are from Historically Black Colleges or Universities, 82 are from women's colleges. Sources are assigned to categories based solely on explicit publication information and/or self-identification.

Suggested Citation

WhatEvery1Says (WE1S) Project. (May 22, 2020). Collection 36: Articles containing the word "humanities" but that are not about the humanties from U.S. top-circulating newspapers and student newspapers. Zenodo. DOI 10.5281/zenodo.4948902.


Collection Metadata


Topic Models of This Collection

Model Family 1 (created May 22, 2020): models for 25, 50, 100, 150, 200, 250 topics

Visualizations for this model family:

25 topics 50 topics 100 topics 150 topics 200 topics 250 topics
Dfr-browser
TopicBubbles
pyLDAvis
DendrogramViewer
Diagnostics

WE1S Developers Only

This start page for the collection last revised: June 13, 2021