18th Century British parliamentary papers(to
go online)
27 November 2003
See also Eighteenth
Century
Collections Online(below)
A2A Access to Archives(UK)
catalogues archives held locally through England &
Wales
20 March 2007
dating from C8; updated January 2006; 10m records/9.2m items in
410 local record offices, libraries, universities, museums,
national & specialist institutions; Global
Search of A2A with 10 other resources based at TNA(The
National Archives);
Access to Archival
Databases(AAD) from National Archives of US
18 December 2005
free meta or federated searching of 85m.documents in 475 files
from 30+ federal agencies; freetext searching across all series
& files;
AgEcon Search (after 10 May 2008)
free, open access repository of fulltext scholarly literature in
agricultural & applied economics, including: working papers,
conference papers, journal articles;
Alex Catalogue of
Electronic Texts
3
December 2002
14000 public domain documents from American & English
literature, Western philosophy
Alan Lomax Archive
11 June
2004
audio & videotape, 16mm film, photos, published recordings,
papers documenting folk music, world dance & ritual; Audio
Archive 5000+ hours field recordings of folk music,
interviews, oral histories, folk tales from US, Caribbean, Europe;
Film &
Videotape Archive dance & movement libraries 115000+
feet 16mm & 35mm ethnographic film from 400+ cultures. Indexed
by culture, area with emphasis on dance & movement styles of
indigenous peoples; Paper
Archive field diaries, notes, correspondence; discographies
& filmographies; Photograph
Collection of field recording expeditions in Spain, Italy,
Caribbean, American South;
Amazon.com Search Inside the Book
authors,
titles, short snippets, hyperlinked title of book 29
October 2003
scanned pages with search terms highlighted; OCR technology; find
page, read, browse back & forth; cant download, copy,
read beginning to end, link directly to any page of a book;
extensive excerpts require physical volume, purchasable from
Amazon by credit card; view less than 1000p/month & 20% of any
book (See The
Great Library of Amazonia Wired, 11.12 -
December 2003 ); Amazon
Shorts short-form literature for 49 cents delivered
electronically; no printed editions; save or print;
Anvil Academic
fully digital, non-profit publisher for the
humanities
17
February 2012
Council on Library & Information
Resources(CLIR) & National
Institute forTechnology in Liberal Education(NITLE);
Publishes under Creative Commons licenses on Web & via apps on
portable devices, conforms to DPLA standards & protocols,
works closely with OAPEN guidelines, allows academic &
cultural institutions, including CLIR & NITLE members, to
publish under their own imprints;
Ara Irititja Archive
20 December 2003
identifies, copies & electronically records historical
materials from museums, libraries & private collections about
Anangu(Pitjantjatjara/Yankunytjatjara people). Where possible,
Pitjantjatjara language is used. Innovative software protects
&/or restricts access to private, sensitive & offensive
materials
Archive CD Books
Project(International)
4
January 2005
reproduces
old
books, documents & maps on CD for genealogists & historians in cooperation with local
libraries, societies & record offices, providing money to
renovate old books, donate to their collections; began in UK,
March 2000, now worldwide, with each country scanning & producing its own books; Archive
CD Books Australia formed mid 2003; working with
societies, libraries & individuals to provide range of
Australian historical resources; Post Office & Trade
Directories, Government Gazettes, Early Electoral Rolls,
Local, Family, Military & Church Histories, Public
Service Lists, Gazetteers & Road Guides, Cyclopedias, Early
Telephone Directories, School & University Calendars & Yearbooks, Almanacs;
ArchiveGrid
2
December 2005
family histories, political papers, & historical records from
archives, libraries & museums worldwide; 1m record
descriptions & growing;
Archive-It(subscription
service created by Internet Archive)
22
November 2005
designed for needs of organizations & individuals, including
state archives, libraries, academic institutions, non profits,
museums, historians, & independent researchers; institutions
build, manage, search their web archive through user friendly
application, without technical expertise or hosting facilities
Archives Hub national
gateway to descriptions of archives in 180+ UK universities &
colleges 11 January 2011
access by keyword, subject, title, name of person, place, or
corporate entity, to primary source materials; Most not available
online, but access information included in descriptions; Intwine, a
subject guide to Collections of
the Month mailing list; FAQ;
ARCHON
Directory
24 September 2004
contact details for repositories in UK & overseas
with substantial collections of manuscripts noted in indexes to
National Register of Archives; central contacts directory for
major UK networking projects including National Register of
Archives(NRA), Access to Archives, AIM25, Archives Network Wales
& Archives Hub; includes 275 repositories in US, 53 in Canada,
30 in Australia, along with repositories in other countries 21
Mar 2012: National Archives(UK): ARCHON directory of
archives released as Linked Data;
Arctic
Blue Books online Andrew Taylor's index to British
Parliamentary Papers, 1818-1878
concerned with Canadian Arctic; digitized, searchable, 6000+pages;
native communities, living conditions on ships, expeditions,
meteorological observations, status of commercial whaling,
botanical discoveries
ARROW: Australian Research
Repositories Online to World now Australian Research Online
24 August 2009
National Discovery Service for searching through Libraries
Australia as a free target database of Australian Academic
research deposited in university repositories, National Library of
Australia & research agencies such as Australian Policy
Online; find all works by a researcher, topic of research,
or combined output of a consortium of universities; ARROW
&
RQF: Meeting needs of Research Quality Framework using an
institutional research repository; See also AuseSearch
meta-search facility on all open access repositories in Australian
& New Zealand universities containing refereed articles &
PhD theses; (1/5/2008)ARROW
Discovery Service - 143587 records harvested from 23
university repositories & 12 research collections, including
Australasian Digital Theses program, & e-journals, most have
fulltext online, many unpublished theses or preprints, journal
articles, images, working papers & technical reports, sound
files & multimedia presentations; relevance-ranked results;
Faceted search narrowed by institution, subject, resource type,
date or creator; 14000 resources from research & university
repositories now indexed by Google;
arXiv eprint archive (Cornell U;
National Science Foundation) 584760 eprints
30 January 2010
eprint service in fields of physics, mathematics, non-linear
science, computer science, quantitative biology & statistics;
relies on annual payments by member libraries to support ongoing
maintenance & upkeep; moved to Cornell University Library in
2001 & now has mirror sites worldwide; March 2010 The future
of arXiv: a crucial repository for high energy physics,
cosmology, & related areas of physics & mathematics;
TheAtlantic.com site
free to all visitors
25 January 2008
blogs, author dispatches, slideshows, interviews, videos, browse
issues from 1995, articles back to year of foundation,1857;
digitized 151 year backfile with OA
only to last15 years' worth, toll access(TA) to the rest;
Audubon's
Birds
of America
6 May 2005
Online version of John James Audubon's Birds of America; from 1840
First Octavo Edition of Audubon's 7 volume text
AuseSearch
(Google Custom Search)
Open Access(OA) repositories in Australia & New Zealand 4
November 2006
fulltext articles & theses available from research
repositories listed in The
state of the nation: A snapshot of Australian institutional
repositories(First Monday, Feb 2009);
AUSTLII
7 April 2005
provides browse access by year to numbered acts Commonwealth
numbered
Acts (1973-); ACT
numbered Acts(1989-); AustLII Point-in-Time (PiT)
Legislation System launched 7 April 2005; see an Act or
section as for any date covered by the system & visually
compare sections at different dates; covers most NSW legislation,
part of legislation of South Australia & Queensland; fulltext
Australasian, Indigenous & Human Rights law journals; Update
status for case law AustLII case law databases for each
Australian jurisdiction, update date, most recent decision,
approximate frequency of updates &, links to Court or
Tribunal's website; full text of Macquarie
University Australian Journal of Legal History from 2003; AustLII
Toolbar for Netscape/Mozilla, Mozilla Firefox & Microsoft
Internet Explorer web browsers; links to major category
materials for Australasian jurisdictions with links to other Legal
Information Institutes;
Australasian Digital Theses
Program Database
3 June 2006
comprehensive metadata repository of 150000 Australian PhD &
Masters theses; 5000+ fulltext online, link direct to home
institutions to access non-digitised theses; augmented soon
by New Zealand theses;
Australian Data Archive(ADA)
ADA Indigenous
datasets
5
March 2012
national service for collection & preservation of computer
readable data relating to social, political & economic affairs
& make data available for further analysis; consortium of
leading national Australian universities, managed by Australian
National U(ANU) &including nodes around Australia, at U of
Melbourne, U of Queensland, U of Technology Sydney & U of
Western Australia; visitors can browse & search catalogue,
view study & variable documentation(including frequencies),
download related material(questionnaires, codebooks, etc);
Registered users can analyse & visualise most data online
& users who have completed relevant undertaking form(s) can
download entire studies or subsets of variables in a range of
formats; Data Access;
Australian government
public information datasets
4 January 2010
mash-up government information to create something new; licence
attached to government agency datasets makes clear what you can
& can’t do with the data or contact the contributing agency;
Australian National Data
Service(ANDS)
3
August 2009
aims to influence national data management policy in Australian
research community, inform best practice for curation of data,
transform disparate collections of research data around Australia
into a cohesive collection; funded by Australian Commonwealth
Government's Department of Innovation, Industry, Science &
Research(DIISR) through National
Collaborative Research Infrastructure Strategy(NCRIS) as
part of Platforms
for Collaboration Investment Plan; 22 July 2010
Australian
Newspapers: Historic Australian Newspapers,
1803 to 1954 migrated to Trove in 2010
78000 pages South
Australian
Advertiser(1901-1919) available in July 2009 &
remainder of Argus
up to 1945(1857-1915, 1933 -1945); public
availability delivery schedule; currently scanning Queenslander(1866-1939),
Courier Mail(1934-1954),
Launceston
Examiner(1880-1954), SA Register(1836-1846), SA Advertiser
& subsequent titles(1864-1949) with different contractors; scanning
schedule; Many
hands make light work: public collaborative OCR text correction
in Australian Historic Newspapers; 14 July 2010: "There is a
standalone version of Australian Newspapers (blue original
interface) & a Trove version
(green header). In 2009 we received positive feedback from users
on having the ability to search across other content at the same
time as newspapers. Therefore a plan to migrate the Australian
Newspapers fully into Trove was developed. Trove searches millions
of resources simultaneously. Users can also have the option to
search across newspapers only. It was originally anticipated that
the migration of Australian Newspapers into Trove would be fully
complete by July 2010 and my previous messages on progress noted
this. The migration is almost complete but final switch over and
automated redirection from the blue version to the Trove version
is not expected to happen until the end of this year now rather
than next week. Up until very recently the 2 versions have had the
same functionality, so it didn’t really matter which version a
user was in. However because of the July migration plan the recent
enhancement work to Australian Newspapers has been undertaken in
the Trove version only. Enhancements are mainly things that
users requested in 2008 & 2009. There are now significant
differences between the Trove version & the original
standalone blue version which I am sure many users will want to be
aware of. Many users may wish to transfer from using the
blue version to the Trove version now. At the end of the
year an automatic redirect will be in place & the blue version
will no longer be accessible"; completion date for South
Australian titles(Advertiser 1858-1954, Register 1836-1931) is now
November 2010; 18 Nov 2010: 30m+articles now, 4m+ pages containing
40m articles will be online by mid-2011; 25 Mar 2011: now includes
Sunday mail(Adelaide), 1912-1917,
1925-1929, 1931-1954;
Australian Research Online
formerly ARROW
Discovery
Service(see above)
24 August 2009
searches simultaneously across Australian university &
government research repositories & collections of Australian
research including Australian Policy Online, Australasian Digital
Theses Program & various e-journals. Many items have a digital
fulltext copy available, others may have restricted access, as
determined by policies of contributing research institution.;
cross-searched by Heriot-Watt’s TechXtra service; search
313384 Australian research outputs including theses; preprints;
postprints; journal articles; book chapters; music recordings
& pictures; harvested from hosting repositories via Open
Archives Initiative-Protocol for Metadata Harvesting(OAI-PMH)
& linked to original repository; indexed by Google
Australian Social Science Data
Archive(ASSDA)
12
February 2010
consortium of national Australian universities, managed by
Australian National University(ANU) established at the ANU in 1981
to provide a national service for collection & preservation of
computer readable data relating to social, political &
economic affairs for analysis;
Australian
Women's Weekly Digitisation Project(10 June 1933 - 15
December 1982)
21 February 2010
National Library of Australia in association with publisher,
Australian Consolidated Press, & State Library of New South
Wales, will digitise the iconic magazine – from its first issue on
10 June 1933 to when it changed into a monthly, on 15 December
1982; See also Australian Women's Weekly 1946-71 Index:
detailed index of years 1946, 1951, 1956, 1961, 1966 & 1971;
decision to adopt a slice
approach, 1 year in every 5e for the period 1946 to 1971,
placed priority on a detailed index of 6 years over the 25 year
period rather than a much less thorough index of all 25 years;
years chosen have their highlights - first Women's Weekly Paris
Fashion Parades in 1946, Communist Party Referendum Bill &
50th anniversary of Federation celebrations in 1951, introduction
of television & Melbourne Olympics in 1956; cf Who
Was
That Woman? The Australian Women's Weekly in the Postwar Years(U
of NSW Press, 2002);
AWS Hosted Public
Data Sets(Amazon Web Services)
25 November 2008
enables you to use public data within your Amazon EC2 environment;
Select public data sets hosted on AWS free as an Amazon EBS
snapshot; Current data sets Amazon are working on include:
annotated Human Genome data, PubChem & UGI Virtual Conformer
libraries, US Census, various labor statistics, economic &
transportation databases; access, modify, & perform
computation on data sets directly using an Amazon EC2 instance
& pay for compute & storage resources used;
Bartleby.com:
Nonfiction
8 September 2007
100+ free popular nonfiction works; World's Famous Orations edited
by William Jennings Bryan; collected works of Francis Bacon, John
Stuart Mill's Autobiography, Thomas à Kempis'
devotional work The Imitation of Christ;
BASE:
Bielefeld
Academic Search Engine
16 November 2005
Identification & selection of high-quality scientific
repositories; Contact & negotiations with content
providers(universities, libraries, commercial content providers);
Data aggregation, preprocessing & dataprocessing of
internationally distributed, heterogeneous resources; Data
production(e.g.German enlightment, JADE); Delivering indexes in
standardised formats(XML) for platform-independent reuse by other
providers; Integration of BASE within meta search environments(eg
SISIS -Elektra); Providing additional content in OPAC
environments; searches 2.7m+ documents from 189 sources; academic
full text archives accessible through international Open
Archive Initiative(OAI). Most documents freely accessible &
searchable by metadata or fulltext; registered
OAI service provider & contributes to Digital Repository
Infrastructure Vision for European Research(DRIVER);
BBC
Creative Archive all radio & television programs to be
online free
24 August 2003
Streaming
audio of all BBC's science stories; 20
Mar 2012: Project
Barcelona seeks approval from the BBC Trust classic so BBC
programmes from Television Centre archives canbe download for a
"relatively modest" fee; launch of Global BBC iPlayer in foreign
countries to licence-fee payers & non-payers; catch-up service
is free in Britain, overseas users have to pay;
Bibliotheca
Alexandrina Digital Library
3
May 2002
Internet Archive digital library of Internet sites & cultural
artifacts; free access; web, tv & movie; 100 terabytes; Scopus®, world’s
largest abstract & citation database of research information
& quality web sources;
BiblioVault
1 February 2006
older, recently published, new books from scholarly presses;
fulltext searches; from Chicago Digital Distribution
Center(CDDC) which includes a digital printing center adjacent to
Chicago Distribution Center; digital files for 11000 books from
nearly 45 university presses;
Big Data:
Big
Data era is here
31 March 2012
27 Apr 2012: IBM
buys Vivisimo allegedly for its big data prowess;
BioModels Database
European Bioinformatics Institute
30 April 2005
data resource for biologist to store, search & retrieve
published mathematical models of biological interests; annotated
& linked to data resources such as publications, databases of
compounds & pathways, controlled vocabularies, etc; browse
& search
BOPCRIS
27 November 2003
British Official Publications Collaborative Reader Information
Service; 1688-1995; abstracts, & subject indexing of key
documents; some fulltext
British
Library Archives Email
19 October 2004
library appointed world's first digital manuscripts curator to
collect email messages authored by nation's top authors &
scientists
British
Library Newspaper Digitisation Programme with brightsolid
21 May 2010
Digitised material will include extensive coverage of local,
regional & national press across 3 1/2 centuries, will focus
on specific geographic areas, periods such as census years between
1841 & 1911. Additional categories will be developed looking
at key events & themes such as Crimean War, Boer War &
suffragette movement. The aim will be to build a 'critical mass'
of material for researchers - particularly in the fields of family
history & genealogy; This resource will be available for free
to users on-site at the British Library and copies of all scanned
materials will be deposited with the Library to be held in the
national collection in perpetuity; brightsolid
access through their brands findmypast.co.uk &
genesreunited.co.uk;
British
Library Online Newspaper Archive
21 July 2007
Manchester
Guardian & Weekly Dispatch et al for a limited
range of years, view the images, & navigate through each paper
by column or complete page; British Newspapers,
1800-1900 2m+pages of 19th century newspapers, many of them
require payment of a fee; browse complete articles from Penny Illustrated
Paper
& The
Graphic free of charge; thematic essays includes access
to relevant articles from newspapers of the day; 29 Nov 2011: 4m
pages of historical C19th newspapers from UK & Ireland now
online via BL: Fulltext search & snippets free to all(British
Newspaper Archive);
British
Newspaper Archive BL partnership with brightsolid
- fulltext search & snippets
free 29 November 2011
29 Feb 2012: plans to digitise 40m pages from BL collection over
next 10 years, currently has 3m;
CAB
Abstracts Archive
18 March 2005
1910 - 1972 literature; 1860000+ records; agricultural science,
veterinary medicine, nutrition & natural resources; 17 printed
abstract journals(600 volumes); fully searchable &
completely re-indexed; Obsolete terms replaced with modern
equivalents; (23 April 2009) CABI, owned by governments of 42
countries, abstracts material in 58 languages & translate 98%
to English. Now include fulltext for hard-to-find articles from
learned societies in Central or Eastern Europe, etc; Environmental
Impact Database, Biofuels Information Exchange, CAB Abstracts,
VetMedResource; CAB Direct 2 launches in Spring 2009 with Web
2.0 concepts & tools, now has 9m life post-1973 science
records(Nov 2010) adding 300000abstracts/yr;
Cambridge
Journals Digital Archive
18 August 2009
pay once for perpetual access in entirety or Humanities &
Social Science/Science, Technology & Medicine packages; 3m
scanned pp. 350000 articles from 20000 journals in searchable
PDFwith Digital Object Identifiers(DOI)
CAMEO; Conservation
& Art Materials Encyclopedia Online
18
August 2007
10000+ historical & contemporary materials used in
conservation, preservation, & production of artistic,
architectural, & archaeological materials; keyword &
material search for description, boiling point, melting point,
molecular weight; description, synonyms, & materials
properties; images & authority;
Canada
Year Book Historical Collection
19 April 2008
annual Canada Year Book,1867 to 1967; browse by year or topic,
browse tables, charts, & maps as they see fit; links to
related sites;
Canadian Broadcasting
Corporation Archives
17
October 2002
search & access clips of CBC Radio & Television
programming since 1936; browse 8 categories &/or timeline;
negotiating Internet rights with organizations, including trade
unions, artists & writers for excerpts from Canada's best
radio dramas, television serials & specials
Canadian
Pamphlets & Broadsides Thomas Fisher Rare Book Library,
University of Toronto 19 August 2005
Pamphlets & Broadsides Collection; 20000+ page images
include 1763 prospectus for Quebec Gazette; ephemeral documents
from different political campaigns to company reports; search
engine based on author, document language;
Canberra
times, 1926 - 1995
7 March 2012
free
access NLA Digitisation of Canberra times, 1955-1995 for
Centenary of Canberra, 1913 (complementing Trove
digitisation of 1926-1954);
Century of Lawmaking
for a New Nation
25 January 2003
additions include 25 volume Letters of Delegates to Congress (1774-89)
published by Library of Congress, 38 volume American
State Papers (1789-1838), 6 volume Revolutionary
Diplomatic Correspondence, Joint Resolutions of the Senate
(1824-73), volumes 1-3 of the Congressional Record
(1873-75); 700 volumes selected by Law Library of Congress; 26 Feb
2011: LC collection of US Congressional Documents & Debates
from Continental Congress in 1774 to 43rd Congress in 1875
digitized for students, scholars, & public; browse by
categories including Continental
Congress & Constitutional Convention, Journals of Congress, Debates of Congress, Statutes & Documents; Special Presentations for
non-researcher or law student; Timeline: American History as Seen in
Congressional Documents includes drawings & links to
illustrative documents such as Debates
of Congress; maps & history of Native Americans’
dealings with US government: Indian
Land Cessions in the United States 1784-1894; maps
browsed by Date,
Tribe & State/Territory; debates
& documents from Senate impeachment trial of Andrew Johnson
are also available;
CHEMnetBASE(annual
subscription)
4 July 2007
web-based compilation of chemistry reference books produced by
Chapman & Hall/CRC Press; chemical & physical property
data; Combined
Chemical Dictionary(CCD), Dictionary of Commonly Cited
Compounds, Dictionary of Drugs, Dictionary of Inorganic &
Organometallic Compounds, Dictionary of Natural Products,
Dictionary of
Organic Compounds, Handbook of Chemistry & Physics,
Polymers: A Property Database, & Properties of
Organic Compounds(POC); subscribe to entire package or
part; IP authenticated access;160000+ CCD entries on compounds
& their derivatives, uses, & properties from 5 chemical
dictionaries; 87th edition Handbook of Chemistry and Physics("the
CRC") compilations of chemical data; Polymers: A Property Database
wide range of physical properties & commercial information;
POC property information & searchable spectral data on 29000
organic compounds;
Chronicling
America(National Endowment for the Humanities & Library
of Congress)
31 March 2009
National Digital Newspaper Program added 112000+ additional
historic newspaper pages; free & open access to 977440 pages
from 112 titles published between 1880 & 1910 in 9 states (CA,
FL, KY, MN, NE, NY, TX, UT, VA) & District of Columbia;
Arizona, Hawaii, Missouri, Ohio, Pennsylvania &
Washington–will contribute in 2009; 18 Sept 09: now provides free
access to 1442000 pages from 171 titlese published between 1880
& 1922 in 15 states & District of Columbia;
Cinderella
Collections: University museums & collections in Australia
24
December 2008
See also Australian
University Museums Information System(AUMIS); Access to
AUMIS has been discontinued & all AUMIS content was archived
in July 2005; Transforming
cinderella collections: University Museums & Collections in
Australia report on government sponsored review of
Australian University museums & collections
Classics in History of
Psychology
29 September 2005
full text historically significant public domain documents from
scholarly literature of psychology & allied disciplines; 25+
books/200+ articles & chapters, links to 200+ works at other
sites; Index
by author, Index by topic;
CLOCKSS(Controlled
Lots of Copies Keep Stuff Safe) See LOCKSS
below
Collection of Last Resort(US GPO, Discussion Draft ) 6 April 2004
Collections of
the Month (Archives Hub, UK)
2 December
2007
Common Crawl Foundation’s
repository of openly & freely accessible web crawl
data
24 March 2012
about to go live as a Public Data Set on
Amazon Web Services; Our work; FAQ; CC produces &
maintains a repository of web crawl data openly accessible
& currently covering 5b pages & valuable metadata stored
by Amazon’s S3 service, allowing bulk downloads & directly
accessed for map-reduce processing in EC2 making wholesale
extraction, transformation, & analysis of web data cheap &
easy so that small startups or even individuals can access high
quality crawl data previously only available to large search
engine corporations;
Confederation of Open
Access Repositories(COAR) launched in October
2009
12 November 2011
uniting 59 institutions in 23 countries from throughout Europe,
Latin America, Asia, & North America; promotes infrastructure
interoperability& joint global data store of Open Access
repositories to enable & support re-use of data by service
& portal providers;
C-SPAN:
American Political Archive
20 Aug 2005
created in 1979 by cable television industry as a public service
providing public access to the political process; no government
funding; operations funded by fees paid by cable & satellite
affiliates who carry C-SPAN programming; audio programs from
National Archives, presidential libraries, Smithsonian, Library of
Congress; oral histories recorded by members of Congress who
served in WWII or Vietnam; archived programs;
Danger
of web crawled datasets (First Monday, Vol.15, No.2, 1
February 2010)
11 February 2010
Data Australia(beta) portal of open Australian government datasets See also Mashup Australia 3 October 2009
DataCite Repositories
16 June
2011
working document collaboration between DataCite, BioMed Central & Digital Curation Centre to
capture repositories for research data. It is provided for
information purposes only: DataCite provides no endorsements as to
the quality or suitability of the repositories listed. We
encourage community participation in developing this resource
Data.gov US Federal
Government datasets See also National
Data Catalog
25 May 2009
increases ability of public to find, download, & use datasets
generated & held by US Federal Government; provides
descriptions of Federal datasets(metadata); Federal, Executive
Branch data included in first version; descriptions, how to
access, leveraging tools; Search raw data catalog by category or
federal agency to get machine-readable, platform-independent data
sets; links to tools for mining data sets & a tutorial; See We need publishing
standards for datasets & data tables; 8 April 2011: Data.gov
& 6 other open government sites to shut down because of
Obama government budgetary crisis; 21
May 2011: shifting from data repository to cloud-based
platform for creating new applications & services; development
& service-delivery platform, providing new ways for public,
developers, & agency to plug into the site;
Data
searching
See Infochimps, Wolfram Alpha, Comprehensive Knowledge Archive
Network(CKAN) registry of open knowledge packages &
projects, Public
Data Sets on AWS centralized repository of public data sets
integrated into Amazon Web Services(AWS) cloud-based applications;
Wikipedia
for data tables; DBpedia,
one of the largest sources of Linked Data on the Web, extracts
structured information from Wikipedia & makes it available:
2.9m+ things including at least 282000 persons, 339000
places, 88000 music albums, 44000 films, 1,000 video games, 119000
organizations, 130000 species & 4400 diseases; Factual making data
accountable: platform for sharing & mashing open
data on any subject, an open data repository with
collaborative tools, data accountability sorcing &
improvement; EbookBrowse free
engine searches for PDF, DOC, XLS, & other data files;
crawlers harvest 3m+ PDF & DOC files through open Internet
resources such as blogs, forums, BBS etc & regularly check
file validity; Zanran(Beta)
finds ‘semi-structured’ data on web( formatted numerical data, not
text), numerical & graphical data, numerical data presented as
graphs, tables & charts in a graph image or table in an HTML
file, as part of a PDF report, or in an Excel spreadsheet; EbookBrowse free
searches for PDF, DOC, XLS, & other data files; 27 Oct 2011: How to cite datasets
& link to publications(A Digital Curation Centre
‘working level’ guide);
Data
Visualisation
(Info Graphics) See also Visualisation tools
4
January 2011
Gapminder unveiling
the beauty of statistics for a fact based world view: non-profit
venture to promote sustainable global development &
achievement of UN Millennium Development Goals by increased use
& understanding of statistics using Trendanalyzer
software visualizations; Keeping our tools’ statistical content
up-to-date & making time series freely available in Gapminder World
& Gapminder
Countries, producing videos, Flash presentations &
PDF charts showing major global development trends with animated
statistics in colorful graphics; Trendanalyzer
was sold to Google in 2007 but Gapminder has a license in order to
make data freely available to the public, Google have continued
the technology & launched Motion
Chart free of charge. A Motion Chart takes your
Analytics graphs for a multi-dimensional analysis of metrics
report. Plotting your data into 4 dimensions to spot opportunities
or anomalies faster & easier; Begin using Motion Chart as
follows: Click Visualize
icon from any report with a table displaying segmented data, e.g
In the Keyword
Report - Once the Motion Chart loads, you'll see an array of
bubbles, each representing a different keyword from the
report.Select 4 dimensions to plot your data: X-axis, Y-axis,
Color, Size. Click on each control to see a menu of metrics. Point
your mouse over any bubble to see numeric value for each. Press Play at bottom of chart to
see how your keywords perform over time. If you click on a bubble
and check the Trails box
underneath the Size control, you can map out the bubble's movement
over time; Freebase entity
graph of people, places & things, built by a community with
all types of publicly available open data to create entities connected &
manipulated into a graph format; access to 13m entities; compatible with all
operating systems including those running Linux; Wiki, Blog &
Discussion List; Google
Public Data Explorer visualizes
your data: new data format, Dataset
Publishing Language(DSPL), openly available as an interface;
GPDE Dataset
Directory; 22
free tools for data visualization & analysis(Gary D.
Price): Data cleaning, Statistical analysis, Visualization
applications & services, Code help: Wizards, libraries, APIs,
GIS/mapping on the desktop, Web-based GIS/mapping, Temporal data
analysis, Text/word clouds, Social & other network analysis; The
beauty of data visualization: David McCandless
turns complex data sets (like worldwide military
spending, media buzz, Facebook status updates) into
beautiful, simple diagrams that tease out unseen
patterns and connections. Good design, he suggests, is
the best way to navigate information glut -- and it may
just change the way we see the world; Interesting
data visualizations: Australian
Bureau of Statistics Presents series of videos with a
statistical literacy focus; World
Factbook Dashboard graphical representation of C.I.A
Factbook data worldwide using IBM ILOG Elixir & Adobe
Flex technology includes synchronised interaction between maps,
gauges, treemaps, radars & 3D chart display; Geocommons open, accessible
repository of data & maps of world allows users to create maps
for visualising & analysing their data;
10
Tools for creating great Info-Graphics; What
is a Data Scientist? - someone who can obtain, scrub,
explore, model & interpret data, blending hacking, statistics
& machine learning;
DeepDyve
15
January 2010
online rental service for scientific, technical & medical
research with over 30m articles from authoritative journals;
“Rented” articles can only be viewed & cannot be downloaded,
printed or shared; Search
for scholarly journals on CiteULike with DeepDyve;
Dictionary of
Canadian Biography 14 Vol.
16 February 2004
persons who died (or last date of activity) between 1000 &
1920
Dictionary
of National Biography ed. Leslie Stephen & Sidney Lee
(UK) also
at
Internet Archive 27 August 2009
biographical reference for deceased persons notable in British
history; current edition published online by Oxford University
Press since 2004; earlier editions are freely available online,
& remain of historic interest; 1st & 2nd ed &
supplementary vols free;
Digging into Data:
List
of Data Repositories digital libraries, data archives,
& data repositories 10 August
2010
international researcher teams exploit new web tools &
computing power to eplore large bodies of data; funded by
JISC(UK), National Endowment for the Humanities, National Science
Foundation & Social Sciences & Humanities Research Council
(Canada);
Directory of
Folklore & Mythology Electronic Texts
8 December 2003
also Germanic Myths, Legends, & Sagas(short annotated link
list) & related link list for folk & fairy tales.
Directories
of Data Repositories & repositories for scientific data sets
2 December 2010
Public Data
Sets on Amazon Web Services; Oceanographic Data
Repositories; Distributed
Data
Curation Center: Other Data Repositories; Gene Expression Omnibus;
Global Change
Master Directory; MIT
Data Management & Publishing: Sharing Your Data; Open
Access Directory: Data Repositories;
Directories
of Institutional Repositories
8 July 2006
See Table 2, pages 9-12 of Directories
of Institutional Repositories: Research Results &
Recommendations;
Directory of Open Access
Repositories: DOAR Open Access research archives around
world. 4 March 2005
DocumentCloud public
access to news reporters’ original source materials to debut in
late 2009 1 October 2009
open standards make original source documents easy to find, share,
read & collaborate on;
DOE Green Energy
portal (US Department of Energy)
13 May
2010
OSTI
More-Like-This(MLT)
search power aka Term vector similarity analysis
new semantic search tool on Information Bridge
flagship collection of DOE research reports; 2-stage search
process -conventional search, then select an anchor document & MLT
button to re-search entire database for similar documents;
Dreambank
11 January 2009
16000+ dream reports in English & 6000 in German people
ages 7 to 74 analyzed using the search engine & statistical
programs; Dreambank
Search Engine;
Drudge report ( New
media vs old media: portrait of the Drudge report
2002-2008 )
7 July 2009
powerful agenda setter in the US national political sector; DrudgeReportArchives.com
snapshot every 2 minutes 24/7 of special reports filed by Matt
Drudge since Saturday November 18, 2001;
Early
Journal Content on JSTOR free worldwide
21
February 2012
journal content in JSTOR published prior to 1923 in US & prior
to 1870 elsewhere released free on a rolling basis since September
6, 2011; includes discourse & scholarship in arts &
humanities, economics & politics, mathematics & other
sciences; nearly 500000 articles from 200+ journals represents 6%
of JSTOR content; JSTOR currently provides access to scholarly
content to people through a growing network of 7000+ institutions
in 153 countries & now to independent scholars & others to
provide more access options to the content on JSTOR for these
individuals; video
tutorial about how to access content; for broad use
including ability to reuse for non-commercial purposes
acknowledging JSTOR as source & providing a link back to site,
cf Terms
& Conditions of Use; Available content; FAQ;
E book providers
19 October 2004
aggregators have grown in past 2 years, with netLibrary up from 40000 to
65000+ titles & Ebrary
offering 25000+ books & 20000 other documents from 5000; New
book series from Elsevier via Science
Direct, Gale Virtual
Reference Library, Safari Books Online
& Oxford
Scholarship
Online; eBooks on
Demand(EOD) European libraries are hosting millions of books
published from 1500 to 1900 often only accessible to users present
at these libraries. Users will order via the common library
catalogues; libraries digitise the requested item & send it to
the user via EOD service network; books digitised will
simultaneously be incorporated into the digital libraries of the
participating libraries & thus accessible on the Internet;
funded in part by the European Union; Ebook
&d Texts Archive: Free Books: Free Texts: Download &
Streaming: Internet Archive; Some places to
obtain free e-books; 30
Oct 2011: ebrary patrons may now download content to
computers & other devices including iPad, iPhone, Kindle,
Kobo, Nook, & Sony Reader! at no additional charge; 6 Jan
2012: Comparing the e-book providers(Tennessee libraries, Vol.61
No.4);
Economist
Historical
Archive subscription through National Library of Australia's
eResources
16 December 2008
8000 editions of Economist magazine from 1843–2003 fully
digitised & keyword searchable witf full-colour images,
exportable financial tables, & a gallery of front covers;
Eighteenth
Century Collections Online(Thomson/Gale)
19 March 2005
significant English-language & foreign-language titles printed
in Great Britain during C18, with important works from America; available
for public access at National Library of Australia
complemented by NLA set
of Early English Newspapers; See also 18th Century
British parliamentary papers(above)
Electronic
Biologica
Centrali-Americana
18 September 2004
58 volumes of natural history created & composed during C19 to
identify, categorize, & document flora & fauna of
Meso-America; model for biodiversity informatics worldwide;
excellent descriptions, elegant plates & illustrations; ample
documentation online
Electronic Records
Archives(ERA) National Archives & Records
Administration(US)
4 January 2006
NARA preserve & provide long-term access to uniquely valuable
electronic records of US Government & transition
government-wide management of lifecycle of all records into realm
of e-government;
E-LIS
4
February 2003
open access archive for scientific/technical documents, published
or unpublished, in Librarianship, Information Science &
Technology, & related applications; deposit preprints,
postprints, other LIS publications; find & download documents
in electronic format; free service;
Encyclopedia
Britannica: 1911 Edition classic edition. Still an
outstanding source for historical subjects. 30 March 2002
Energy & Environment Data Reference Bank Statistical data from Open Sources 29 November 2008
ENUMERATE project (2007-9)
helped European Library identify digitised collections across
Europe 24 April 2012
funded by ICT Policy Support Programme of European Commission,
newly-developed content strategy supports strategic aims of
national libraries of Europe, represented in European Library;
3-year, EC-funded project to create a reliable baseline of
statistical data about digitisation, digital preservation &
online access to cultural heritage in Europe; led by Collections
Trust in UK;
eScribe.com Search for a
mailing list archive
21
September 2004
ESDS International
(Economic
& Social Data Service)
7 September
2004
free access to regularly updated international macro datasets;
user guides, International data FAQs, case studies &
exemplars; OECD, UNIDO, IMF, World Bank, UN Common Database,
Eurostat New Cronos.;
EuroDocs:
Primary Historical Documents From Western Europe
1
July 2006
key historical happenings within respective countries transcribed,
reproduced in facsimile, or translated; political, economic,
social & cultural history;
European
Union Bookshop Historical Database(Free)
15 October 2009
110000+ publications including speeches, treaties &
publications from EU institutions, agencies & other bodies
dating back to 1952; 14m+PDF pages; 5% of documents are fee-based
but PDF will remain free;
Fact Sheets(National Archives of Australia) 23 June 2006
Festival
Books Digitisation Project 253 books from late C15 to
C18
16
August 2005
official accounts of events in life of a princely dynasty;
marriage, birth of an heir, christenings, coronation or funeral
celebrated by a public festival; Religious festivals on saints'
days & significant dates in Church calendars; theatrical,
operatic or ballet performances
FindFiles.net: search,
find & download 300m+ publicly available data files
17 March 2011
all existing mime
types: mp3 audio & wav sound files, midi musical
instrument interfaces, mp4, avi & quicktime videos, jpeg, gif,
png & tiff images, Microsoft doc & Excel documents &
exe executables, pdf & plain text documents, dwg AutoCAD &
wrl virtual reality data files, archives like zip, gzip & jar;
Food &
Agriculture Organization of the United Nations: Corporate
Document Repository
23 July 2011
FAO works on 4 main areas that inform their mission: access to
information, sharing policy expertise, meeting space for nations,
& bringing knowledge to the field; FAO has concentrated their
efforts on rural areas, i.e.where majority of poor & hungry
people reside; "The State of..." links to download publication on
subject, see a table of contents & basic overview; new
document releases;
Footage.net archived stock
footage since 1994
June 2004
ABCNEWS VideoSource; Action Sports - Scott Dittrich Films;
AlwaysHD; AniStock; Archive Films by Getty Images; BBC Motion
Gallery; Budget Films; CONUS Archive; eFootage; F.I.L.M Archives;
Film Images(Paris); FRAMEPOOL; Global Image Works; HBO Archives;
HBO March of Time Series; Historic Films; Ina - Institut national
de l’audiovisuel; MrFootage; National Film Board of Canada;
Natural History New Zealand (NHNZ); NBC News Archives; Producers
Library; Reelin’ In The Years; Silverman Stock Footage; SPPN
Images (Grinberg); StormStock; The Sir David Frost Archive; WGBH
Stock Sales; WPA Film Library; CNN, National Geographic, network
TV news(StormStock featuring tornadoes, lightning, hurricanes,
storm clouds, flash floods, giant hail, microbursts); Boolean,
global search or individual database; sample clips; Zap Request -
free, instant e-mail pipeline to footage companies, archives &
footage researchers to find exact shot; See also ITN Archive footage from
UK archives, including Reuters
Footnote: history for the
people
3
August 2007
14 m+ digitized documents, subscription-based; unaltered view of
events, places & people that shaped the American nation &
the world;
Ford
opens manufacturing archives in Australia
28 November 2011
Ford Motor Company has opened Ford
Australia Archives, 1 of only 4 historical archive centres
in the company worldwide; located at Ford Australia's head office
in Campbellfield, Melbourne, to house 80+ years of Ford history in
Australia; important Ford documents & photos from the
company’s early manufacturing days in Australia; it continues to
acquire & preserve significant company materials, including
paper, audio-visual & electronic records; not directly
accessible to the public; submit enquiries to Ford Ford Customer
Relationship Centre Phone: 133673; Fax: 03 9929 3175; Email:
customers@fordcrc.com.au;
FreeFullText.com links
to 7000+ scholarly periodicals with some or all content online
free 10
February 2006
Free
FullText
Journals
in Chemistry
1
March 2004
permanently & temporarily available free fulltext journals in
chemistry, biochemistry & related subjects. Updates &
additions twice a month.
freely
accessible
archives of serials magazines, journals, newspapers; Serials
archives & indexes listings; 5 January 2007
Free Online Scholarship(FOS: see Peter Suber's FOS below)
Fulltext
classics: find & download books(LibrarySpot.com feature)
25 July 2001
complete & downloadable titles mainly in public
domain(copyright expired )
Global
Open Access Portal(GOAP) UNESCO
12 November 2011
presents a snapshot of the status of Open Acces (OA) to scientific
information worldwide; country reports from 148+ countries with
weblinks to 2000+ initiatives/projects in Member States;
Gnosis Archive
25
March 2006
mystical sects or groups primarily active around Mediterranean in
first few centuries of Common Era; historical coverage &
exploration of faith as currently practiced ; Library section
includes complete writings of G.R.S. Mead, noted scholar of
tradition, transcriptions of Gnostic scriptures & fragments.
Google Books
11 October 2004
digitally
formatted extracts & descriptive material contributed by
selected publishers; Publishers with ISBN
can sign on; link to booksellers for
purchase; Google
Print for Libraries – ALPSP position statement(July 2005);
Google
Publisher project to scan books publishers volunteer;
Google
Library project to scan books in libraries with
consent of libraries but not necessarily consent of
publishers; larger Google Print project to scan fulltext
print materials for adding to Google search index;
participating books in Google Publisher & public- domain
books in Google Library will display fulltext pages;
copyrighted books in Google Library display fair-use
snippets to match user searchstring; sample
screen
shots show difference; see also Open Content Alliance;
(30 November 2006)Select search results now show page or pages in
book on which key terms appear; (Review
Dec '06); Booksearch
x 3 searches A9, Google Book Search, Windows Live Search(for
books); Google
Book Debate: Scholar vs. Public; reviews added May 2007; Inheritance
& loss: a brief survey of Google Books(First Monday,
August 2007); 1923–1963: Google
Book Search targeting more books for Public Domain?(Who's
using it); Embeddable
Previews; End
of Snippet View: Google settles lawsuit with book publishers
- readers now can preview 20% of book; Libraries, universities,
& other organizations to purchase an institutional
subscription giving users access to fulltext of titles in Google
Books index; Google
Book Search Settlement Agreement(29 Oct 08); Google
Book Search Settlement: ‘The Devil’s in the Details’; Share
this Clip feature in Book Search; 50
Years of American
Motorcyclist Online(American Motorcyclists
Association); Ebony
magazine, 1945-; Barcode
your bookshelf with Google Books use a USB-powered barcode
scanner or type in by hand, rate & view these titles in My
Library on GB & use GB-powered search to browse your
home library; New
features(19 June 2009): Embeds
&
links toolbar option to embed preview of a full view or
partner book in any website or blog with a simple html snippet, Search shows more context
around term including an image from part of the page where it
appears, thumbnail overview of all pages in a public domain book
or magazine, Contents dropdown
to jump to chapters within book or articles within magazine, plain
text versions of public domain books for visually impaired, Page Turn Button & Animation
keep track of location in text, Overview
Page data about book including reviews, ratings,
summaries, related books, key words & phrases, references from
web, places mentioned in book, publisher information;Google
Books settlement & privacy FAQ; GB
offers Creative Commons licensing; G prints
full versions of out-of-copyright books for its Library Project;
G
opens up its EPUB Archive: download 1m books for free; 5
ways Google Book Settlement will change future of reading; Google
funding scholars in humanities to textmine Google Books corpus;
Download
GB(ghacks.net); 3D
viewing mode on GB via a special URL parameter. To see a
book in 3D, just add &edge=3d to the book’s URL (Note: be sure
to add this parameter before the # in URL); GB
partners
British
Library
to digitize 250000 books from between 1700 & 1870;
Google Earth
Engine planetary-scale platform for environmental data &
analysis
4
December 2010
for scientists to monitor & measure global changes & build
applications based on archive of past 25 years of satellite
imagery; Google’s cloud infrastructure reduces analysis time &
new tools will pre-process images to removing haze & cloud
cover.
GreyText:
Archive of Grey Literature
6 April 2007
compilation(organized by last name of author) of papers &
presentations dealing with grey literature; fee-based content
(first page is free) along with large collection of PowerPoint
presentations (free);
Guardian
& Observer Digital Archive
6 November 2007
Guardian(1821-1975),
Observer(1900-1975);
in early 2008 will extend both to 2003; free searching; viewable
in full-page & individual-article levels on timed passes;
£7.95/24-hour & £49.95/m; Original
copies since 1900 from Remember
When; free trial soon;
Guardian Archive(since 1899) 12 December 2003
Harvesters
- national & international lists services which harvest
institutional repositories;
8 May 2009
See also Harvesters
- subject- or discipline-based;
HEARTH
29 July
2003
core books & journals in Home Economics published, 1850 -
1950; fulltext, bibliographies & essays
Historic
Australian Newspapers, 1803 to 1954(Australian Newspapers
Digitisation Program) 19 November 2009
Progress
Report; Cumulative
usage statistics; Integration
of Australian Newspapers search service with Trove; Sydney
Morning Herald Progress; Digitisation
Progress Chart; Titles
coming soon updated; Expanded
FAQ; Draft
Tagging Guidelines; Tagging
functionality & enhancements for Trove; Updated
METS/ALTO doc; Titles
completed; In
the
media; Publications;
Links to the open source code for delivery system & functional
spec for Australian Newspapers v1.0 are now on the project page;
18 Nov 2010: 30m+articles now, 4m+ pages containing 40m articles
will be online by mid-2011;
History
of Gems, Gemology & Mining Library(Farlang Gem &
Diamond Foundation )
8 September 2007
25000 pages, covering gems, diamonds, mining, gemology & gold
rushes on 5 continents; including Herbert Hoover translation of
Agricola's De
re metallica; WWW
Diamond & Fine Jewelry Virtual Library;
History of Medieval & Renaissance Europe: Primary Documents 1 July 2006
Home Economics
Archive: Research, Tradition, History
16 November 2003
core books & journals in Home Economics & related
disciplines published 1850 - 1950; 838 books/7 journals/331695
pages; brief essays about sub-disciplines such as clothing,
textiles & home management, an extensive bibliography; See
also "Make It
Yourself": Home Sewing, Gender, and Culture, 1890-1930 home
sewing laden with multiple meanings about femininity, labor,
family, creativity, sexuality, identity, & economics;
established & more unusual source materials, including
dresses, sewing workbooks & paper dolls; illustrations, audio
clips of interviews,slide show of someone sewing a skirt;
HTTP Archive now part of Internet Archive
16 June 2011
permanent repository of web performance information such as size
of pages, failed requests, technologies utilized allowing us to
see trends in how the Web is built & provides a common data
set from which to conduct web performance research;
Human Genome Project, Chromosome 01 to 24(Project Gutenberg Etext) 26 February 2011
Humanities
Network(project HuNI) infrastructure for unlocking &
uniting Australia's cultural data 2 March 2012
awarded Aust$1.3m by NeCTAR(National
eResearch Collaboration Tools & Resources) will
allow "arts & humanities researchers to access &,
through appropriate tools & services, work with combined
resources of nation's major cultural datasets &
information assets. for new scholarly outcomes & create an
enduring exemplar of national cultural infrastructure to
suit needs of future generations of researchers; names &
links to all the datasets;
ibiblio: public's library &
digital archive
6 February 2002
free information, including software, music, literature, art,
history, science, politics, cultural studies. Sorted by UDC
number. FTP archives; Linux archives
ICE Virtual Library(
Institution of Civil Engineers)
28
October 2010
Institution journal & book content since 1836; premium
service; free keyword search; advanced search has field searching
options, years to search by, content typ (books or journals),
subjects, etc; civil engineering & construction market books,
journals, recruitment & training, best practice, news &
networking opportunities around NEC & Eurocodes;
Illustrated London News
(private ongoing digitisation project)
10 October 2008
first printed 1842; 3000 original copies from estimated 8000
printed; Ilustrated London News 1870; June 6th 1944; 1876; 24th
February 1923 Tutankhamen Illustrated London News 1875;
1874; 1880; 1890; Illustrated London
News, Shipping & Emigration extracts (some pictures, used to illustrate, may
not be original to articles); Civil War
in America from Illustrated London News 10 volume digital archive
of ILN during Civil War years; click on Articles to view
accompanying illustrative material; search fulltext or
illustrations;
Institutional
Repository & ETD Bibliography 2011
review
1 September 2011
600+ English-language articles, books, technical reports, &
other works that are useful in understanding institutional
repositories & ETDs(digital versions of graduate students'
theses & dissertation)s & covers IR country & regional
surveys, multiple-institution repositories, specific IRs, IR
digital preservation issues, IR library issues , IR metadata
strategies, institutional open access mandates & policies, IR
R&D projects, IR research studies, IR open source software,
& electronic theses & dissertations;
Institution
Archives Registry
15
August 2005
registry of institutional repositories; browse by Country, Archive Type,
or Archive Software; Institutional
Repository Bibliography, Version 3: XHTML website with live
links primarily includes published articles, books, &
technical reports wit some conference papers & unpublished
e-prints; All in English & under a Creative Commons
Attribution-Noncommercial 3.0 United States License;
Institution of Civil Engineers ICE Virtual Library
premium
service
4 August 2009
Institution journal & book content since 1836; keyword search,
advanced search field searching options, years to search by,
content type(books or journals), subjects; Results include title
of article, author, & source; click on title for more details
including keywords & abstract; most popular articles(Introduction of steel columns in US
buildings, 1862–1920) & latest news from ICE
(irregular updates); products & services for civil engineering
& construction markets: books, journals, recruitment &
training, best practice, news & networking opportunities
around NEC & Eurocodes;
International archives 7 September 2007
International Repositories
Infrastructure wiki
8 May
2009
Harvesters
- national & international lists services which harvest
institutional repositories; Harvesters
- subject- or discipline-based;
International
Tracing Service at Bad Arolsen(ITS)
30 November 2007
serves victims of Nazi persecutions & their families
preserving these records for research; alphabetically &
phonetically arranged Central Name Index contains 50m+ reference
cards for 17.5m+ people; under International
Commission for the International Tracing Service (ICITS)
auspices;
Internet Archive launches Library for the Visually Impaired with 1m books See Accessible Book 7 May 2010
Internet Archive Wayback
Machine (Surf web as it was)
7 November 2002
search for archived Webpages by existing or former URL; limit
search results by archived date or file type; eliminate duplicates
or mergealiases; 100 terabytes+/10 b+ web pages from 1996;
DocuComp capability identifies differences between 2 historical
web page versions in website archive; keyword search added Sept
2003; request for archived versions of page:
http://web.archive.org/web/*/http://www.domain.com -
asterisk is wildcard; to find versions from 2002 enter:
http://web.archive.org/web/2002*/http://www.domain.com; all
versions from September 2002 enter: http://web.archive.org/web/200209*/http://www.domain.com
; Internet
Archive &10 universities produce Text Archive of
digitized books; see also The Memory Hole; (9 Feb
06) 55b archived web pages now; 2004
Presidential
Term Web Harvest - special collection of 100m items
collected/harvested/captured for National Archives(NARA) NOW
keyword searchable - largest
public single text searchable collection to date created
using NUTCH & NUTCHWAX extensions open source software; National Security Archive
independent nongovernmental research institute & library at
George Washington University un-archiving of documents US
government will not; (9 Dec 2006) 86 b web pages/ 1.5 petabytes
(1.5m megabytes); See Turn
of the century magazines (below); Archive-It: Internet
Archive subscription service: archive special collections &
make them searchable, publicly or privately; fulltext search
feature queries public special collections & collections in
Internet Archive; search archived pages as well as other archived
file formats by keyword free; 3
petabytes now stored in a Modular Datacenter (Sun MD) equipped
with Sun's Open Storage technology; Dewey Music: a
tool to browse & Search tracks in Internet Archive Music
Library results pages are organized by Album, Song, Venue;
browse by: Top Rated, Most Played, Newest, Genre; connect your to
your Facebook account; suggestions about bands to listen to;
listen to a track(s); create a playlist; 21 Jan 2011: Wayback Machine Beta 2
homepage buttons take you directly to latest available archived
version of URL entered & Calendar Interface to browse for
Archived Pages; FAQ;
BETA version of Internet Archive
homepage also available; IA
provides access to AdViews database of vintage, 1950s-1980s, TV
commercials; 16
June 2011: Steve Souders’ HTTP
Archive joins IA to add data about the performance of 18000
web sites captured & archived since October 2010; 1
Sept 2011: See also British Library's UK Web Archive, UK
Government Web Archive, Library
of Congress Web Archives(LCWA) & Web
Archives: The Future(s);
Internet
Medieval Sourcebook
1 July 2006
Internet Movie Script
Database(IMSDb)
20 September 2005
Internet Sacred
Text Archive Open Source for human soul; world religions;
traditions; mysteries 5 October 2002
Irish
Newspaper Archives via National Library of Australia's eResources
portal
11 April 2011
Access for Australian residents is via National Library of
Australia's eResources
portal; largest online database of Irish newspapers in the world
dates from 1763 to present & includes out-of-print &
current titles; Titles include Irish Independent(1905-), Freeman's Journal(1763-1924),
Connacht
Tribune(1909-2007), Leitrim Observer(1924- Jan 1976), Meath Chronicle(1897-
April 2005) & Southern Star(1892-2006);
Irish Traditional Music Archive
2 June 2007
multi-media reference archive & resource centre for
traditional song, instrumental music & dance of Ireland
(Dublin);
Islamic Philosophy
Online: Philosophia Islamica
25 March 2006
full-length books, articles: classical texts in canon of Islamic
philosophy to modern Muslim philosophy; Dictionary of Islamic
Philosophy; Map of Islamic Philosophy & relation to
world philosophies; Major Islamic Philosophers, thought &
works; Islamic Philosophy Forum (E-discussion board); Journal of
Islamic philosophy;
Jewish
Encyclopedia.com
30 August 2002
12-volumes published 1901-1906; 15000+ articles &
illustrations searchable; Jewish history, law, theology,
philosophy, literature, biography;
JSTOR
Launches XML Gateway better facilitated
metasearching (federated/cross-database searching)
1 September 2005
21 May 2010: current
participants,
available collections, 176 titles from 19 publishers; 8
Sept 2011: Free
"Early Journal Content" from JSTOR; journal content in
JSTOR published prior to 1923 in US & prior to 1870
elsewhere freely available includes discourse & scholarship
in arts & humanities, economics & politics, & in
mathematics(tutorial);15
Jan 2012: Research
archive JStor moves toward Open Access view articles from
70 of their 1400 journals after registering & view up to 3
documents at a time but no download or print, privileges
reserved for people who buy articles or are affiliated with
schools & libraries that pay for JStor subscriptions;
21 Feb 2012: Early
Journal Content on JSTOR free worldwide - journal content in
JSTOR published prior to 1923 in US & prior to 1870 elsewhere
released free on a rolling basis since September 6, 2011; includes
discourse & scholarship in arts & humanities, economics
& politics, mathematics & other sciences; nearly 500000
articles from 200+ journals represents 6% of JSTOR content; JSTOR
currently provides access to scholarly content to people through a
growing network of 7000+ institutions in 153 countries & now
to independent scholars & others to provide more access
options to the content on JSTOR for these individuals; video
tutorial about how to access content; for broad use
including ability to reuse for non-commercial purposes
acknowledging JSTOR as source & providing a link back to site,
cf Terms
& Conditions of Use; Available content; FAQ;
JTA: The Global News Service of
the Jewish People, 1923- formerly Jewish Telegraphic Agency
9 May 2011
Jewish hisory from 1917 free from not-for-profit media company
similar to Associated Press; 7000+ contemporaneous articles
reported from Europe between 1937-1945 documenting the Holocaust
on a daily basis, another 7000+ documenting experience of Russian
Jews through reign of Communism, coverage of life in Palestine
before Israel inaugurated in 1948 See also
JURN Directory of
1500+ Open Access journals in arts & humanities
6 June 2009
Journals listed are either free, or offer significant free content
Karpeles Library
8 July 2006
world's largest private holding of important original manuscripts
& documents includes Literature, Science, Religion, History
& Art;
Koorie Heritage
Archive
20 December 2003
interactive database of cultural & historical information on
Koorie people of Victoria; historical photographs, oral history
recordings, manuscripts, film & video footage, artefacts &
artwork from Victorian Koorie communities
Life
Magazine Archive (Browse
via Google Books) & “search
inside” specific issues
24 September
2009
Life
Science data repositories in publications of scientists &
librarians
14 June 2011
Each data repository web site manually reviewed for content
describing its scope, year of establishment, descriptive
statistics, available data visualization, analysis & search
features, & Web 2.0 community support features such as blogs,
wikis, & RSS feeds;
Listener
Historical Archive, 1929-1991(BBC) free
trial
12 April 2011
collaboration between BBC Worldwide & Gale, part of Cengage
Learning to search 125000 pages of the publication digitised from
originals in full colour; accessing transcripts of many early BBC
broadcast recordings & commentary;
Linked Open Copac
Archives Hub(LOCAH) Project test datasets
14
May 2011
project on modelling complex archival data & transforming into
RDF Linked Data, available in a variety of forms via
data.archiveshub.ac.uk home page; working on visualisation
prototype showing how to link Hub Data with Linked Data sources on
Web using enhanced dataset to provide a useful graphical resource
for researchers;
Literature for Children
14 June 2004
digitized children's literature treasures published in US & UK
from before 1850 to beyond 1950; 550+ titles by 300 authors.
Browse by author or title; search by keyword, author, title,
subject; section on color in literature for children & color
management strategies for reproduction of children's literature
LOCKSS(Lots
of Copies Keep Stuff Safe) technology
28 February 2006
open source software for libraries to
collect, store, preserve & provide acess to their authorized
content; CLOCKSS(Controlled LOCKSS) initiative began 2006 as a
group of publishers & librarians to access content in backup
nodes & monitor access to the broader LOCKSS community; See also CLOCKSS,
LOCKSS, SOCKSS, FROCKSS?; Meet
More Digitization Pioneers: The LOCKSS Team, Vicky Reich and
David Rosenthal; 4 Dec
2009 Royal Society of Chemistry & Royal Scociety join CLOCKSS
London Datastore:
Greater London Authority’s data for all Londoners to see & use
free
launching 21 January 2010
Releasing GLA data - Data
Packages
for Launch; encouraging other public sector organisations to
releasing data here; 4iP
Developers’ Fund to encourage clients to transform rows of
text & numbers into Facebook apps, websites or mobile
products;
London
gazette
5 September 2007
Official newspaper of record in recording & disseminating
official, regulatory & legal information; Used by professional
researchers & authors for military & social history, as
well as family history hobbyists; 350 years, last 200 years
indexed & last 2 years indexes available in print form from TSO Bookshop; older
indexes from local main reference library or regional library; The
London, Edinburgh & Belfast Gazette archives comprise around
2m pages;
MacTutor History of Mathematics archive topics, biographies, Famous Curves Index; 10 October 2005
Mail Archive free
archiving service without warranty for electronic mailing lists;
1990 lists currently; 21 September 2004
Million
Books Project(aka Universal Library Project) Internet
Archive
12 July 2009
Governments of India, China, & Egypt are helping fund this
effort through scanning facilities & personnel; Internet
Archive has contributed 100k books from Kansas City Public Library
along with servers to India & has automated conversion of
scans into collection;
Million
Song Dataset, official website 300GB downloadable(A-Z) HDF5 file format
15 March 2011
freely-available audio features & metadata for contemporary
popular music tracks from Echo
Nest API encompasses metadata & audio analysis features,
1file/track -1 song, 1 release & 1 artist with information
about track, song, release, artist; dataset does not include
audio, only derived features, but sample audio can be fetched from
services like 7digital; subset
of
10K songs(1%, 1.8 GB compressed); Subsets available on UCI Machine Learning
Repository in a simple text file format;cf Thierry
Bertin-Mahieux, Daniel P.W. Ellis, Brian Whitman, & Paul
Lamere. The million song dataset. In Proceedings of the 11th International
Society for Music Information Retrieval Conference (ISMIR
2011), 2011;
MoOM (Museum of Online
Museums)
16 May 2009
See also Museum Sites Online
links to 1700+ international museum & museum-related WWW
sites;
Mother
Earth News Archive Fulltext of articles from first issue in
1970 through 2003
14 June
2004
mydigitalnewspaper
search engine for international, national & regional digital
newspapers 15
August 2009
search tool highlights key terms or phrases segmented by country,
publication, date & type of title with supplements, archived,
daily, weekly, monthly, free & paid titles;
National Archives(UK)
23
Juen 2006
subject classification based on extensive user consultation(sort
of folksonomy) links subject-based information guides with
descriptions of records.Global Search for 28 m+ records: 10m at National
Archives Catalogue; Access
to
Archives (A2A) &10m from local archive holders in
England & Wales; Documents
Online; National
Register of Archives(NRA); ARCHON; Moving Here; Electronic
Records Online; Family Records (&); TNA's Bookshop(The
National Archive), Research Guides & website; Advanced
Search option allows flexible searching by database & subject;
National
Digital Archive of Datasets (NDAD) preserves & provides
online access to archived digital datasets & documents from UK
central government departments; collection spans 40 years of
recent history, with earliest available dataset dating back to
about 1963; NDAD datasets can be viewed, queried & downloaded;
downloads up to 10MB are free of charge; National
Archives: Ancient Petitions 17000+ images from petitions
presented to kings, Parliament, chancellors, & officers of
state in 2 primary categories: redress of grievances which could
not be resolved at common law; requests for a grant of favour;
from reign of Henry III (1216-1272 to James I (1603-1625);
search by places mentioned, petitioner name, occupation, or
subject; UK
National Archives photostream on Flickr; 24 Jan 2012: Launch of our
new online catalogue Discovery
in Feb 2012 (Beta);
National Archives of Australia
(RecordSearch)
6
June 2009
Check other Australian archival repositories through Register of Australian
Archives & Manuscripts (RAAM), Australian Historic Records
Register(AHRR) & Directory
of Archives(Australian
Society of Archivists); In April 2011, National Archives
Adelaide office & reading room will co-locate with State Records of South
Australia City Research Centre, in Leigh Street , Adelaide;
National Data Catalog(US)
open platform for government data sets and APIs (review)
6 May 2010
find datasets by & about government at all levels (federal,
state, local) & branches (executive, legislative, judicial);
data is cataloged from Data.gov, District of Columbia, Utah, &
Sunlight Foundation currently; all formats: XML, MAP, CSV, ATOM,
XLS, ESRI, etc; Playing
with the NDC;
National Security
Archive
29
July 2003
research institute on international affairs; declassified US
documents(Freedom of Information Act); 2m+ pages in 200+
collections; Document
of the Month
Nature Archive
2 February 2008
Subscribe for fulltext, free searching; every article(1869-1949)
of the
world's foremost weekly scientific journal; Personal subscriptions
& institutional site licenses include access back to 1997;
4000 issues, 180000 articles in PDF with HTML abstracts on
nature.com platform; access to 1987-1996 archive, or 1950-1986 by
site licences or purchase individual articles; history of the
journal; Nature
Online Video Streaming Archive complements selected articles
& letters featuring analysis & commentary from Nature
editors & selected scientists; honeybee genome, smoking &
lung cancer genes, evolution of language; Gateways &
databases; (19 June 2009)Nature
Publishing
Group(NPG) permit academic reuse of archived author manuscripts:
data-mine & text-mine author manuscripts from NPG journals
archived in PubMed Central & other academic repositories;
NewspaperArchive.com
27 April 2007
63.6m pages/676 cities/2577 titles; historic newspapers, 1759-; 9
free special interest archives fromn 15-50 thousand digitized
pages as originally printed; papers from US, Canada, Great
Britain, Ireland, Jamaica, Denmark, South Africa, & US Virgin
Islands;
New
York Times complete article archive from
September 1851 to December 1995
19 December 2003
Search fulltext, author, or headline free; specify date range or
include advertisements & other listings in your search; Search
listings include title, date, author, icons offer sample portion
or purchase article; individual article costs US$2.99, buy article
packs & pay less (25- article pack costs equivalent of
US$1.05; 180 days access); New York
Times & LinkedIn
US social network for professionals content partnership for
personalized news targeting industry verticals on Business &
Technology sections of NYTimes.com; Times
Extra: opens webfront page to outside content gathered &
ranked by their Blogrunner news aggregator; Article
Skimmer streamlined interface scans top headlines in every
section of the Times; Times
Wire: New
York Times experiments with real time news, FriendFeed style
desktop app, notable for swapping out Microsoft's Silverlight
technology for Adobe's AIR platform - a significant win for Adobe
over its RIA (Rich Internet Apps) rival; real time 'river of news'
in reverse chronological order updated every minute; view full
stream of content from across the site or just Business &
Technology section; customize view from favorite sections and
blogs; features a photo gallery displaying the latest news in
pictures; first NYT product built with its Newswire API; NYT
to publish index as Linked Data; Custom Feeds
tool to create keyword-based RSS feeds for NYT content,
keyword/topic search gets snippets & sometimes images; 5000
(of 30000) subject headings to data clouds for browsing or
download under Creative Commons license;
New York Times Book
Review Archive Free (for reviews from past 10 years),
registration required 6 May 2005
New Zealand Electronic Text
Centre(NZETC)
11 September
2009
1000+ electronic books about, by, or relating to New Zealand &
New Zealanders; simple or advanced search, browse by author,
works, or subject(Contemporary & Historical Māori &
Pacific Islands, Language, Literature, New Zealand History); books
available in ePUB format or TEI XML; Kindle wont read ePub, but
Calibre open source program converts to/from a variety of e-book
formats including from ePub to MOBI;
NISCAIR Online Periodicals
Repository(NOPR)
12 February 2012
Noel
Butlin Archives Centre(Australian National
University)
14 May 2003
national collection of primary source material; archives of
industrial organisations, businesses, professional associations,
industry bodies and the labour movement
Northern New York Historical
Newspapers free; 900000+ pages of archives from 27
newspapers 4 January 2008
Notes
and Queries, November 1849 - June 1922 (Online Index with
links to some fulltext)
24 May 2009
a sort of 19th Century Wikipedia in which anyone could contribute
in a weekly published paper on a wide range of topics; index links
to all volumes in public domain sorted by date; All links default
to Internet Archive collection, where available.
Alternate sources are Google Books collection & U of
Michigan collection;
OAIster digital
resources union catalogue (metasearch)
24 September 2002
collections from institutions include description, number of
records from each collection; (20 May 2005) Open Archives
Initiative (OAI) Protocol for Metadata Harvesting (PMH) 4.8m
metadata records from 400+ organisations worldwide; some hidden
behind search forms or CGI script inaccessible to search engines;
text, images, audio & video files - repositories carrying
mainly bibliographic records are removed or asked to isolate
digital content for harvesting; monthly updates; duplication
& search via variable date formats problem ; metadata included
in Yahoo! search index since March 2004; Google use only URLs; If
you use Firefox, you can add OAIster as a search engine plugin in
your browser toolbar; (13 April 09) 20m+ rcords from 1000+
institutions; Search
Entire Record, Title, Author/ Creator, Subject & filter on
Language; Boolean search & wildcards; Filter results on types
like text, images, audio, video & data sets; Sort criteria;
repositories & number of hits; works with SRU
(Search/Retrieval via URL) & supports CQL (Contextual Query
Language)
OECD
iLibrary initially available to subscribers via link from SourceOECD
18 August 2009
collective brand name for OECD Publishing's online services;
access to OECD textual publications including those of
International Energy Agency(IEA), Nuclear Energy Agency(NEA),
International Transport Forum(ITF) - in PDF or HTML form &
access to OECD statistical databases in which data can be selected
and downloaded in HTML, XLS or CSV formats; bundled with print
versions or purchased separately as online-only; includes 17
thematic book collections(50 books/year), 14 journals, 24 Working
Paper Series, & 22 OECD statistical databases;
Official
history of Australia in the war of 1914–1918
24 March 2004
directed & edited by Charles Bean, 1920 - 1942; single
greatest source of interpretation of Australia’s part in First
World War.
Old Car Manual Project
10 June
2004
volunteers scan hard-to-find documents - manuals, brochures,
wiring diagrams since 1911. The site is slow-loading
Online
archive of UK science(BL project): audio library of 200
British scientists' recollections
25 February 2010
Online Books Page
(Penn U) 35000+ free books
24 May 2009
Open
Access News
3 January
2005
Putting peer-reviewed scientific & scholarly literature on
internet, free of charge & of most copyright & licensing
restrictions; Removing barriers to serious research; See Timeline
of Open Access Movement(below); Open
Access Overview, 31 Dec 2004; Registry of Open Access
Repositories: ROAR (Custom
Google Search) ; On
the Verge of Revolution - Open-access Publishing;
Open Archives Forum(OAI)
21
November 2003
interoperable repositories for metadata sharing, publishing &
archiving; OAI Online
Tutorial - Open Archives
Initiative Protocol for Metadata Harvesting; Explore
Open Archives lists & comments on other lists of
individual open archives giving a broad overview of
structure, size & progress of fulltext open access eprint
archives maintained & updated to assist quantitative research
on open access eprint phenomenon;
Open Content
Alliance building a digital archive of global content
for universal access
6 January 2008
Open Culture: best free
cultural & educationa media on web
28 August 2010
Free audio books, free online courses, free movies, free language
lessons, free ebooks;
Open
Data Portal Pilot part of Government of Canada's commitment
to open government
29 March 2011
3 streams: open data, open information & open dialogue, aims
to drive innovation & economic opportunities for all
Canadians; seeks to improve ability of public to find, download
& use Government of Canada data; search catalogue, download
datasets & explore possibilities of Open Data; for both
non-commercial & commercial use; formats should be
machine-readable & typically XML-based;
Open Data Search
metasearch tool: find & access Open Government data sets
23 March 2011
global version of prototype publicdata.eu site; aggregator for
datasets, providing simple, unified search interface to all
catalogues contained including known instances of CKAN software,
Sunlight Foundation’s National Data Catalog(with many US-based
data sources), World Bank data catalogue, Sweden’s
DCat-enabled OpenGov.se & Nexedi’s Data Publica portal,
search.ckan.net with access to combined index of all CKANs; how
it works;
Open source
Comparing
Open
Source Indexers
12 April 2008
ourmedia(formerly
Open Media): global
home for grassroots media
27 November 2004
open-source initiative undertaken to create global library for
grassroots media of all kinds; foster & promulgate open
standards(APIs, schemas & user interfaces) to bring personal
media to desktop; DNS registration of CC content; Simple upload
APIs; Jukeboxes & Photo Albums with built-in Content; Shared
Jukeboxes & Shared Galleries; Standardized schemas & APIs
for shared server; Personal Media Server - hosted or downloadable;
Multiple Directory viewpoints: words, images, net maps - sorted in
various ways: alpha, category, by library location, author, time,
by keywords (support multiple ontologies); Retraction or changing
of license; Provenance; Creator Authentication; Brewster Kahle
& Internet Archive are supporting project with free storage
& bandwidth for grassroots media; ourmedia.org will serve as
central resource to bring grassroots video, audio & photos
together; learning toolkit on how to create rich & compelling
works; community space; archive..For those who grant permission,
serves as clearinghouse that allows others to search for video,
download & reuse or remix it, with proper attribution
PANDORA(Preserving &
Accessing Networked Documentary Resources of
Australia)
12 November 2008
Australia's Web Archive is a growing collection of Australian
online publications established initially by National Library of
Australia in 1996, now built in collaboration with 9 other
Australian libraries & cultural collecting organisations;
Paper of
Record's Searchable Archive of Historical
Newspapers
2 January 2007
Papers Past
15 June 2006
selected C19 New Zealand newspapers & periodicals; 1m+ pages
from 41 publications; viewONE
software for viewing large image files; in Māori too.
Peter Suber's
Free Online Scholarship site newsletter, blog,
timeline of FOS movement
15 November 2002
guide to scholarly open archives; 5 July 2003 - open access =
barrier-free online availability of scientific & scholarly
literature; FOS aka open access since the launch of Budapest Open
Access Initiative in February 2002; Newsletter &
Forum home page (SPARC).
PLoS
Biology(Public Library of
Science)
3
November 2003
works of exceptional significance in biological science; molecules
to ecosystems; works at interface with other disciplines; PLoS
Pubget
links to all journal sites, linking PLoS readers to fulltext
when the articles are OA, & when the articles are TA(toll
access), either to the publisher site or to a special PubGet site
where more than 200+ institutions support subsidized access;
Popular
Mechanics(PM) magazine Archives: Tools
7 August 2005
hand & power tool topics, including purchase & use of
specific tools, product reviews, & tool tests - circular saws,
table saws, routers, tape measures, clamps, chisels, wire
strippers, & screwdrivers; photos;
Popular Science 137-year backfile free browsing 5 April 2010
Portal to Public
Records(BRB Publications)
14 September 2007
directory of free public record sites, reference material, &
public record vendors available;
Portico electronic archiving
service(2005-)
16 March 2006
funded by JSTOR, Andrew W. Mellon Foundation, Ithaka, & Library of Congress;
signed archiving agreement with Oxford Journals division of Oxford
University Press; formerly Electronic-Archiving Initiative(JSTOR
2002-); 18 June 2010: 110 publishers representing 2000+
professional & scholarly societies e-books(33000+),
e-journals(11000+), & d-collection; 15m articles; publishers
in JSTOR’s Current Scholarship Program have their current content
preserved in Portico; Facts
& Figures; list of titles & participating
publishers;
Print News
archives(JournalismNet) includes Historical Archives
& Non-print Archives
9 September 2008
Project
Gutenberg first & largest single collection of free
electronic books, Michael Hart founded 1971 22
April 2011
Project
Gutenberg Mission Statement by Michael Hart; Tag cloud
visualization & search system;
PRONOM
5 May 2006
on-line information system about data file formats &
supporting software products originally developed to support
accession & long-term preservation of electronic records held
by National Archives but now generally available;
ProQuest
Historical Newspapers
2 April 2005
NYT -
Issue 1, 1851 - ; Wall Street Journal, 1899-; Washington Post,
1877-; Christian
Science Monitor, LATimes, Chicago Tribune, Atlanta
Constitution, Boston Globe; news, editorials,
letters to editor, obituaries, birth & marriage announcements,
historical photos, illustrations & advertisements ; enhanced
digital reproductions of every page from every issue in PDF format
Pubget: search engine for
life-science PDFs
5 April 2010
20m life science research documents including those in
PubMed®; free to scientific community, funded by
marketing(ads) & intranet services to lab equipment,
biotechnology, pharmaceutical, & other research industries
& through premium services; PLoS
Pubget
links to all journal sites, linking PLoS readers to fulltext
when articles are OA, & when the articles are TA(toll access),
either to the publisher site or to a special PubGet site where
more than 200+ institutions support subsidized access;
Public Data Set
on Amazon Web Services
24 March 2012
centralized repository of
public data sets that can be seamlessly integrated
into AWS cloud-based applications; hosted at no charge
for community &, like all AWS services, users pay
only for compute & storage they use; large data
sets such as mapping of Human Genome & US Census
data required hours or days to locate, download,
customize, & analyze. Now, anyone can access these
data sets from their Amazon Elastic Compute
Cloud(Amazon EC2) instances & start computing on
the data within minutes; Users can also leverage
entire AWS ecosystem & collaborate with other AWS
users; Public
Data Sets forum;
Public domain review: Guide to finding interesting public domain works online 27 January 2012
Public
Library Complete(elibrary)
9 February 2004
5100+ full-text books, maps & documents from 90 publishers…8
individual databases may be purchased separately;
Public
Records Directory Australia
15 May 2006
PubMed
Central: Archive of Life Science Journals
3 April 2004
free access to 50 peer-reviewed journals with search engine;
sequence databases and other factual databases available to
scientists, clinicians
Questia - Online Library
7 March 2004
48000+ books, 390000+ journal, magazine, newspaper articles;
US$19.95/m; $44.95/qtr; $119.95/yr
Safari
Rough Cuts Service
27 January 2006
searchable fulltext access technology books; peak at manuscripts
yet to be published online or downloaded as PDF; initial book not
fully edited, subjected to final technical review, or completely
formatted; PDF updated every time author & editor make
changes; built-in Notes feature, readers send feedback,
suggestions, bug fixes, & comments directly to author &
editor;
SA-NT DataLink
13 November 2010
supporting health, social & economic research, education &
policy in South Australia & the Northern Territory;
Administrative, clinical & service datasets are linked using a
minimum number of variables. De-identified, linked variables are
then made available to researchers by Data Custodians(the agency
from which the data originates) for ethically approved statistical
linkage projects; Useful
resources; Consumer Reference Group provides independent
advice on issues of consumer & communiity interest; SA-NT Data
Linkage Anmimation;
Scopus
abstracting & indexing database
4
November 2004
Elsevier; access to 14000 peer-reviewed titles from 4000+
international publishers; multidisciplinary records since mid
1960’s; Abstracts, 1966-; fulltext links;
ScraperWiki: all the tools
you need for screen scraping, data mining &
visualisation
20 March 2011
Make bad data good, collaborate & discover new datasets; centralised location for custom-built
scrapers which turn web pages into usable data such as RSS
feed or database;
SearchSystems.net Public
Records Directory
28 May 2006
subscription access to worldwide business information, corporate
filings, property records, deeds, mortgages, criminal & civil
court filings, inmates, offenders, births, deaths, marriages,
unclaimed property, professional licenses;
Sears Archives 100+ years of stories, product & brand histories, photographs, catalog images 19 October 2004
Serials
archives & indexes listings (Online Books Page)
24 May 2009
some of the major sources & indexes of free online texts, in
all languages, both general & specialized; Large-scale
repositories; Significant
indexes & search aids; Significant
smaller-scale archives; Serials(Online Books Page) lists freely
accessible archives of serials( magazines, journals, newspapers,
& other periodicals) listed according to these
criteria;
Slave Narratives: folk history of Slavery in US from interviews with former slaves. V.1-16 25 February 2011
Slave
Trade Archives
September 2005
clearinghouse for documents related to transatlantic slave trade
& slavery across both hemispheres; external websites &
multimedia archives from other institutions; African Studies
Collection at Indiana University & Amistad Research Center at
Tulane University.
SmallTownPapers(US)
2 June 2005
free digitized newspaper archives; archived as printed; search
through articles & advertisements, look for photos..weekly
updates; .
Spectator
Online Archive freemium model, browse for free,
register(& pay) for fulltext (Spring
2012) 18 March 2012
weekly magazine with conservative leaning continually-published
since 6 July, 1828, currently owned by David & Frederick
Barclay who own conservative Daily Telegraph newspaper; iOS app for iPad, iPhone
& iTouch;
Spectator Text
Project(Center for Electronic Texts in Humanities )
27 March
2004
Published by Joseph Addison & Richard Steele; innovative C18
periodical, format & style (influenced by The Tatler,
published 1709-11) imitated throughout Europe & Americas;
compare formats & text passages in split-screen comparison
page; download DJVU plug-in free; The Tatler(1709-1711); The Spectator (1711-14);
The Female
Spectator ; Le Spectateur Français; Bailey's
Dictionary; Der Biedermann (1727-1728); Johnson's Rambler
(1812); Steele's Plays (1894); Biography of Addison; Predictions of
1708; The Spectator, March - December 1711 also
available in XML format;
State Records of South
Australia
7 January
2011
In April 2011, National Archives Adelaide office & reading
room will co-locate with State
Records
of South Australia City Research Centre, in Leigh Street ,
Adelaide; As part of collocation with National Archives of Australia's
South Australian Office, South Australian Colonial records moved
interstate will return to Adelaide & post, railway &
customs records held in Sydney since 2000, will be returned to
mark the State's 175th
anniversary of settlement this year; The move will allow
academics, researchers, genealogists & students to access
federal, state & local government archival records from the
State Records Centre on Leigh St in Adelaide;
Sydney Morning Herald
Archives trialling
13 July 2007
subscribe for access to every edition of Sydney Morning
Herald & Sun-Herald between 1955-1990; Sydney Morning
Herald (1831 - 1954) digitisation by NLA to be completed
in 2009; .
Technical
Reports & Archive Image Library(TRAIL) US
15 May 2011
20000+ at-risk US Gov Technical Reports digitized including
documents by US Atomic Energy Commission(AEC) responsible for
peacetime use of atomic science & technology from 1946 until
1974; U of Arizona in collaboration with Center for Research
Libraries(CRL) & other interested agencies to identify,
digitize, archive, & provide access to federal technical
reports issued prior to 1975;
Time Archive Some free
online
23 December 2004
266000+ articles since inaugural issue in March 1923; browse by
topic, search by keyword. Full-text articles free of charge to
TIME subscribers; subscribe online, but access is not immediate; Time Magazine
Covers Archive browsable, searchable archive from 1923;
Timeline
of
Open Access Movement
3 January 2005
Times Digital
Archive
12 December 2003
subscription; search & access full-page facsimiles of every
page of Times,
1785-1985
Trove(National Library of
Australia) replaces free Libraries Australia service &
other resources 24 February 2010
integrated access to 45m+ items from NLA's collaborative services
& elsewhere; Metadata from Australian sources including Australian
National Bibliographic Database containing location
information from 1200+ Australian libraries; Picture Australia;
Australian
Research Online; Music Australia; Register of
Australian Archives & Manuscripts; People Australia
program including biographies & relationships from Australia Dancing,
Australian
Dictionary of Biography Online, Australian Mines
Atlas, Australian Women's Register, Collections
Australia Network (CAN), Libraries Australia, Music Australia;
Overseas sources including, OAIster, PubMed Central, Project MUSE,
NASA
Technical Report Server, CiteBase, Nature Publishing Group,
etc; Open
Library - online public domain books; Hathi Trust
- online public domain books; Wikipedia - keywords (tags)
associated with books; Features access to National
Bibliographic Database for checking holdings in Libraries
Australia wide; Increased online content with access to fulltext
online resources; free account for users to search for holdings in
their preferred libraries, add comments & tags to resources,
& (soon) save lists of favourite resources in their profiles;
Improved search functionality with relevance ranking, facets for
narrowing results & ability to search across multiple formats;
Trove:
innovation in access to information in Australia; 4 Nov
2010: 61 individual newspaper titles available with aim to make
115 available by July 2011; 1.5 – 2m lines corrected by
community each month & running total of lines of newspaper
text corrected is 22.6m; Migration of Australian Newspapers
(blue interface) into Trove newspapers zone: There is a
standalone version of Australian Newspapers
http://newspapers.nla.gov.au/ndp/del/home(blue original interface)
and a Trove version http://trove.nla.gov.au/newspaper(green
header). In 2009 we received positive feedback from users on
having the ability to search across other content at the same time
as newspapers. Therefore a plan to migrate the Australian
Newspapers fully into Trove was developed for completion in 2010.
Users will still have the option to search across newspapers only
in Trove. All work to replicate the functionality of the original
blue Australian Newspapers service has now been completed in
Trove(ongoing over last 6 months). In addition the Trove
newspapers zone has been enhanced based on user feedback &
there are now 35 new features that will help newspaper users in
Trove. A complete list of the new features available in Trove that
are not available in the standalone version are here:
http://www.nla.gov.au/ndp/project_details/documents/ANDP_Differencesandsimilaritiesbetweenblueandgreen1Nov2010.pdf;
The
three
most
significant
items
are:
1.
The
ability
to
add
newspaper
articles
into
a
list
(public
or
private);
2.
The
RSS
alerts
for
new
content
(articles,
issues
&
titles);
3.
The
user
forum
to
contact
other
users
and
discuss
newspaper
topics.
Soon
an
automatic
re-direct
will
be
in
place
on
the
standalone
version
of
newspaper (blue) so that all newspaper users are directed into
Trove. This will happen before the end of 2010 & a message
will be posted closer to the time on the standalone interface. The
blue standalone version will not be available in 2011; 4 Nov 2010:
Climate
history newspaper tagging project - sources in Trove show
how weather events have affected society, with eye witness
accounts of floods & bushfires, e.g. 1851 Black Thursday
bushfires in Victoria; To search within tags only:
use publictag:<tag>
in your search, combine the use of publictag: with Boolean searches, wildcard
searches(only match characters at start of tag), only exact
matches for the tag entered will be found(search for tag "raymond"
will not match tag "Alfred John Raymond"), It is not currently
possible to search for private tags, it is now possible to specify
an exact match search & turn off fuzzy searching - In
Newspapers, syntax for this is fulltext:,
syntax for other zones is text:
(To perform a search across all zones using only an exact search
term, use OR to capture both) , cite button now includes Wikipedia
citation format; RSS feeds: set up RSS feed for a
specific search. To do this, scroll to the bottom of the search
results (this can be the results from any single zone, or the
results across all zones, and click on the link ‘Subscribe to this
web feed’. You will then be notified as new items that are
relevant to your search are added to Trove. Be aware that this
will only notify you of new items added, not items that have tags,
comments or corrections added that mean the item then meets your
search terms; 44m articles, 4.4m pages including 225000 pages from
Australian
Women’s Weekly are now available through Trove; Text
correction: In January 2011 2m+ lines of text corrected
in a month for the first time, total of corrected lines has now
reached 31m; 25 Mar 2011: now includes Sunday mail(Adelaide), 1912-1917,
1925-1929, 1931-1954;30 May 2011: forthcoming new Journals, articles & datasets
zone in conjunction with eResource vendors Gale &
Informit have loaded millions of new article level records,
simplifying access to Australian library subscription databases
from those vendors: initial titles included are
From Gale: Academic OneFile, Health & Wellness Resource
Center, Literature Resource Center; From Informit: Australian
Public Affairs Full Text, Business Collection, Engineering
Collection, Families & Society Collection, Health Collection,
Humanities & Social Sciences Collection, Meanjin, Media
International Australia, New Zealand Collection; If your library
has a subscription to these resources you can click through, even
if you’re offsite, & authenticate with your library card
barcode, proxy server or IP authentication to see articles;
7 March 2012: Canberra times, 1926 - 1995 free
access NLA Digitisation of Canberra times, 1955-1995 for
Centenary of Canberra, 1913 (complementing Trove
digitisation of 1926-1954); 8 March 2012: The Dawn, a
journal written & printed by women (NLA digitisation). One of
Australia's first feminist journals now in digital collection of
National Library of Australia(NLA) to mark International Women's
Day; published monthly in Sydney from May 1888
until final
issue in July 1905 by Louisa Lawson, mother of poet Henry
Lawson covering issues from campaign for equal pay & women's
suffrage to evils of corsetry; closed in 1905 when Lawson became
ill - 3 years after most Australian women won right to vote; There
is one sole surviving copy of entire publication, held by State
Library of NSW;
Turn
of the century magazines(Internet Archive)
21 July 2007
Munsey’s
Magazine(1898), Argosy
Magazine(April 1891), Scribner’s,
vol 1 (1887), The
Mentor(Multiple volumes, 1913-1914); See also the Prelinger
Library's representative magazines from the period & Project
Gutenberg ;
TVO Public Archive
online (Ontario, Canada)
23 February 2011
public/educational television from Ontario first 40 years of
broadcasting; 325+ programs & segments so far; keyword
search for shows, browse collections, most viewed programs OR
programs new to archive, featured shows; browse by Program
Name, Subjects, People;
UC Irvine Machine
Learning Repository Browse
datasets
15
March 2011
collection of databases, domain theories, & data generators
that are used by the machine learning community for the empirical
analysis of machine learning algorithms; includes simple text file
subset of Million
Song Dataset;
UK Data Archive
7 September 2004
internationally-renowned centre of expertise in data acquisition,
preservation, dissemination & promotion; UK digital data in
social sciences & humanities
UK Documents
Online(PRO)
27 March 2004
full cost recovery service, cost/document £3.50 from 1 April
2004; pilot free access at National Archives & Family Records
Centre (FRC)
UK
Government
Web Archive
12 May 2005
UK Hansard 1804 -
2005 online by 2009
17 July
2008
Contemporary
Hansard can be found at UK Parliament site, including
content going back to 1988 in House of Commons & 1995 in House
of Lords. Hansard is available on the site about three hours after
debates; 18th
Century
Official Parliamentary Publications Portal includes
information from Hansard prior to 1803 up to 1834. Access to the
site is only available to the higher and further education
academic communities within the UK & other selected
institutions.
[UK] National Digital Archive of
Datasets(NDAD)
4
July 2004
free archived datasets; earliest types of digital record produced
by Government departments since 1963; appraised & selected by
National Archives e.g. Ancient Woodland Inventory; North Sea
Geographical Information System; Public Health Common Data set;
Digest of Museum Statistics; British Crime Survey
UK web archive
26 February 2010
national web archiving project by UK Web Archiving
Consortium(UKWAC) for research community - social, historic &
culturally significant material from UK - British Library, Joint
Information Systems Committee of Higher & Further Education
Councils(JISC), National Archives, National Library of Wales,
National Library of Scotland & Wellcome Trust; Browse by Topic
or Search (Searching Metadata/Cataloging); mission to archive a
record of major cultural & social issues being discussed
online working with copyright holders to capture & preserve
6000+ carefully selected websites, helping to avoid creation of a
‘digital black hole’ in the nation’s memory; Search by Title of
website, fulltext or URL, or browse by Subject, Special collection
or Alphabetical list;
UNdata: updates
several
web-accessible databases
3
February 2011
United Nations Industrial Development Organization: UNIDO's INDSTAT4
2010 Database; World Tourism
Organization: Tourism Data; Human Development
Indices on UNdata; International Labour Organization: Tables on Youth
Unemployment;3 Dec 2011: United
Nations Development Programme(UNDP): Open Data Exploring the Data section,
look over information organized by country or project, visual
representation of where UNDP directs its resources; country or
project data sets in visual format, export, filter;
Universal
Library
Project(aka Million Books Project) Internet
Archive
12 July 2009
Governments of India, China, & Egypt are helping fund this
effort through scanning facilities & personnel; Internet
Archive has contributed 100k books from Kansas City Public Library
along with servers to India & has automated conversion of
scans into collection;
UNOG Registry, Records
and Archives Unit(1870-)
2
February 2006
archival research catalogue of the League of Nations and of the
United Nations Office at Geneva; ten fonds incompliance with
ISAD-G standard; sub-fonds, series, sub-series, files and
documents; keyword search, browse; allows attaching a digitized
document to a unit of description; .
US
Army Field Manuals, Training Circulars,
Technical Manuals, War Department/Department of the Army
Pamphlets
11 June 2009
full text selected US Army Field Manuals(FMs), Training
Circulars(TCs), Technical Manuals(TMs), War Department
Pamphlets(WD PAMs) & Department of the Army Pamphlets(DA
PAMs) range in date from early C20 through early C21;
intelligence interrogation, land warfare, prisoners of war;
content added regularly;
Usenet FAQ
Archive Search
2 September 2003
See also Harley Hahn's
Master List of Usenet Newsgroups & Triceron.com; See also BinSearch above; (Oct
2009)Google’s
abandoned library of 700m titles; (8 Oct 09)Google
fixes Usenet Archive; GrabIt
Described for Noobs(open
source tool for accessing Usenet content & assembling split
files) 5
Jan 2010; 3 June 2010: Duke U
shuts down its Usenet Server, Electronic newsgroups began at
Duke in 1979; Usenet Archives
collected by Norman Yarvin;
Usenet search See also Forum
Search(Newsgroups, mailing & discussion lists,
Listserv, Usenet) 25 October 2008
US War Department
Papers, 1784-1800 (Center for History & New Media,
George Mason U) 10 January 2009
decade long project to locate all records thought to have been
lost in an 1800 fire; Indian affairs, veteran affairs & naval
affairs records from 200+ repositories consulting 3000+
collections in US, Canada, England, France, & Scotland; browse
55000 documents by year or person of interest or search;
Video archives BBC Motion Gallery – BBC Archive & NewsFilm Online 25 August 2010
Wayback
Machine
13 January 2002
launched 24 Oct 2001, Internet Archive's public access to 100+
terrabytes of monthly snapshots of entire WWW from 1996; past
contents of a given URL; 10b pages(7 June 2003 ); Recall (in Beta)
keyword search on 35% of database; graphs returned pages by date
with options for narrowing your search using categories &
topics; limit your search by date the page was captured by Wayback
Machine; 27 Jan 2011: Inside
the Wayback Machine with George Oates; 31 Jan 2011: In-depth
overview of new beta release; Archive-It fee- based
service from Internet Archive allows users to specify which URLs
they want crawled & frequency to be recrawled; Archive-It
provides access to 1300 collections(as selected by the client) of
archived web pages from a collection of multiple social media
sites from NASA to subject based collections from U of Toronto
keyword searchable;
Web
Archives: The Future(s)
5 September 2011
report by researchers at Oxford Internet Institute for
International Internet Preservation Consortium(IIPC)
WebDataCommons.org joint
project Freie Universität Berlin & Karlsruhe Institute of
Technology 24 March 2012
extracts Microformat, Microdata
& RDFa data(structured data describing for instance products,
people, organizations, places, events, resumes, & cooking
recipes) from Common
Crawl web corpus, largest, most up-to-data web corpus
available; provides data for download in form of RDF-quads with
basic statistics; Pages in Common Crawl corpora are included based
on their PageRank
score making crawls snapshots of popular Web;
WebWorld
- Archives around the World(UNESCO)
27 February 2010
We need
publishing standards for datasets & data tables (OECD
White Paper)
18 August 2009
cf Data.gov US data in machine
readable format & OECD
iLibrary;
Whistleblowers See Wikileaks.org
Wikileaks.org publishes
classified, confidential, censored or otherwise secret documents
26 November 2008
anonymous international network of activists; host company has
ties to Pirate Bay; uncensorable Wikipedia for untraceable mass
document leaking & analysis; exposing oppressive regimes in
Asia, former Soviet bloc, Sub-Saharan Africa & Middle East;
revealing unethical behavior in governments & corporations;
opens leaked documents up to stronger scrutiny than any media
organization or intelligence agency can provide;
Women Working, 1870-1930
Open Collections Program Women Working project
13 June 2004
will provide access to digitized books(2000+), manuscripts(10000
pages) & images(1000) from collections of Harvard University
Libraries & Museums on topic of women in US economy,
1870-1930. Search, browse by topic, individual, dates &
events, organization
World
academic repository search engine, 2012 version
27 January 2012
Search 2756 academic repositories lists from ROAR, OpenDOAR, BASE,
Open Archives, etc combined in Excel, comma delimited on / &
URL path trimmed, de-duplicated, & cleaned manually with the
useful Sobolsoft addins for Excel; special OAI-PMH query/harvester
URLs were excluded; search results may default to being drawn from
main Google database; usual Google Search modifiers such as
filetype:pdf or filetype:doc;
World Intellectual Property
Organization(WIPO) On-line data
collections hosted by WIPO;
12 February 2010
World Memory Project
information about individual victims of the Holocaust & Nazi
persecution 8 May 2011
US Holocaust Memorial Museum
historical documents made freely name searchable online with help
from Ancestry.com; 170m
pages of documentation featuring information on 17m+ individuals
with names, dates, locations, conditions, & physical
descriptions of victims; See also Yad
Vashem(The Holocaust Martyrs' & Heroes' Remembrance
Authority) Resource Centre & Archives; See also JTA: The Global News Service of
the Jewish People, 1923-(formerly Jewish Telegraphic Agency)
Jewish history from 1917 7000+ contemporaneous articles reported
from Europe between 1937-1945 documenting the Holocaust on a daily
basis, another 7000+ documenting experience of Russian Jews
through reign of Communism, coverage of life in Palestine before
Israel inaugurated in 1948 free from not-for-profit media company
similar to Associated Press; Jewish
Encyclopedia.com 12-volumes published 1901-1906; 15000+
articles & illustrations searchable; Jewish history, law,
theology, philosophy, literature, biography;
World War I Document
Archive
25 June 2004
Primary documents mostly in English from volunteers of World War I
Military History List(WWI-L); conventions, treaties,
official papers, documents by year, personal reminiscences,
biographical dictionary. From library at Brigham Young
University(BYU); See also Encyclopaedia
of the First World War;
Year book
Australia, 1908 - 2008
27 July
2008
All past issues of ABS publication 1301.0 have been digitised and
can be found at the final edition Year
Book Australia, 2008 by clicking on the Past
& Future Releases tab
Yearbook of the United
Nations, 1946 to 2005
18 October 2008
Power
Search across set, decade or annual volume; browse, read
chapters, look for specific topics;