Thurs 18 Nov (rescheduled from Oct). 6:30pm
The Internet Archive is one of the best things about the Web, and it deserves a careful
examination and
appreciation. This talk will look at the history, purpose, and amazingly complete features
of the Internet Archive.
I promise you will learn a lot of cool new things about the Internet Archive that you can
immediately!
by Scott Granneman
The Internet Archive celebrated its 25th Anniversary 1 month ago on Oct 21.
So, Scott Granneman of SLUUG will take us down memory lane.
Scott has pulled various gems from the Archieve for classes he's taught at Wash U
& Webster U.
Scott will give us a flavor of what goodies are available.
The Internet Archive is a non-profit digital library in the heart of San Francisco, CA.
It convenes the Decentralized Web Summit (
https://decentralizedweb.net/
<https://www.google.com/url?q=https://decentralizedweb.net/&sa=D&source=calendar&usd=2&usg=AOvVaw0FBjAqgAsxIeB6KZrDc8Nx>)
and other events.
================================
What's in The Archive...
Official Internet Archive Statistics for 2020
GENERAL
• 70+ unique petabytes of data (that’s 70,000,000 gigabytes!)
• 65+ million public media items (texts, movies, audio, images, etc.)
• 1.5 million unique web site users per day
• 17,000 uploads from users per day
WAYBACK MACHINE
• 588 billion web pages
• 750 million pages captured per day
• 550,000 users per day
ARCHIVE-IT
• 49 billion curated URLs captured in 14,000 collections
• 800+ partner organizations creating collections
BOOKS
• 28 million texts in the collections (books, documents, magazines, etc.)
• 4.6 million books digitized by Internet Archive
• 3,500 books digitized per day in 18 digitization centers worldwide
TELEVISION
• 2 million news programs available for closed caption search
• 3,000 broadcast hours from the events of 9/11/01
MOVIES
• 4 million movie items (not including television programs)
AUDIO
• 14 million audio items, including
• 220,000 live music concerts from 8,000 bands
• 200,000 78rpm sides digitized
SOFTWARE
• 580,000 software titles, many emulatable through a web browser
IMAGES
• 3.5 million images