HathiTrust Statistics (as of Jan. 4, 2012)
9,992,316 total volumes
5,304,534 book titles
263,333 serial titles
2,711,662 volumes (~27% of total) in the public domain
The public domain is generally defined as consisting of works that are either ineligible for copyright protection or with expired copyrights. No permission whatsoever is needed to copy or use public domain works. If the work was published in the United States prior to 1923, it is in the public domain. U.S. Government documents are also considered public domain. Rules on what is in the public domain vary from country to country. Definition adapted from University of California Copyright.
Public domain materials are available full text in HathiTrust and can be downloaded by WU users.
The HathiTrust Research Center (HTRC) enables computational access to published works in the public domain and, in the future, on limited terms to works in-copyright from the HathiTrust. The HTRC offers a suite of tools for computational text analysis. For example, Author Search can provide text analysis and visualization of words in a text.
Below is word cloud of terms from The prose works of Jonathan Swift. London: G. Bell and sons, 1897-1908.
The HathiTrust is a partnership of major academic and research libraries collaborating in an extraordinary digital library initiative to preserve and provide access to the published record in digital form. Established in 2008 by 13 universities, it has grown to over 60 partners.
As of January 1, 2012, Washington University became a member of the HathiTrust. Full press release. With our membership, WU students, faculty, and staff can now download materials in the public domain (+2.9 million volumes) in addition to conducting full text searches of all materials in the HathiTrust Catalog.