A prototype app for browsing and searching Association for Diplomatic Studies and Training (ADST) Oral History Interviews.
From the ADST Oral History homepage:
Since 1986, the Foreign Affairs Oral History Program of the Association for Diplomatic Studies and Training (ADST) has recorded more than 2500 interviews with former participants in the U.S. foreign affairs process. Collectively, these oral histories span over 80 years. About 60 new interviews are added annually. The series also contains some significant oral histories dealing with American diplomacy, which were provided by universities and presidential libraries.
The oral history collection has become one of the largest in the country on any subject and the most significant collection on foreign affairs.
The Oral History Collection is a part of the Library of Congress American Memory collection. It is unclassified and available to the public and can be found at Library’s Front Line Diplomacy website. It is also available on ADST’s site under Oral History Interviews.
The Library of Congress (LOC) hosts the ADST interview transcripts at https://www.loc.gov/collections/foreign-affairs-oral-history/about-this-collection/. ADST's site has these, as well as some newer interviews not yet posted on LOC's site, in PDF form, at http://adst.org/oral-history/oral-history-interviews/. Some of the ADST PDFs contain tables of contents not found in the LOC edition. The LOC edition is available in basic TEI (P4-era?) XML and PDF, and is enriched with subject headings available in JSON-encoded metadata.
For example, the LOC makes the ADST Interview with L. Bruce Laingen available in TEI XML and JSON-encoded metadata. These were adapted and enriched from the original ADST manuscript (PDF).
- Obtain all 1,724 interviews (text and metadata) from LOC link above.
- Download source XML and JSON data. The XQuery scripts used to download the data are in the
import-scripts
directory. - Place JSON-encoded metadata in
data/metadata
- Place full text XML-encoded interviews in
data/interviews
- Convert LOC's P4(?)-era TEI XML to P5-conformant TEI XML (referencing American Memory DTD documentation at https://memory.loc.gov/ammem/amdtd.html; see XQuery scripts in
import-scripts
)
- Download source XML and JSON data. The XQuery scripts used to download the data are in the
- Generate eXist app with TEI Publisher
- Make customizations to
modules/config.xqm
, all supported by TEI Publisher: set$config:address-by-id
totrue()
, set$config:search-default
totei:body
, set$config:login-domain
toorg.adst
, set$config:data-root
to$config:app-root || "/data/interviews"
- Make customizations to
- Extract subject headings and other useful metadata from
data/metadata
- Remove unneeded filters and columns (author, title)
- Show columns with relevant info: interviewee name, date, subject
- Add filtering and sorting by interviewee name, date, subject
- eXist-db 3.1.0+ (required for TEI Publisher libraries)
- Apache Ant (required to build a package from source code)
- Download the latest release via https://github.com/joewiz/adst/releases, or clone this repository and build an installer package by calling
ant
to build an application package (deposited in thebuild
directory) - Install the package (the .xar file that you download or build) via eXist Dashboard > Package Manager
- Access via http://localhost:8080/exist/apps/adst/
-
Landing Page
-
Document View
-
Search Autosuggest
-
Keyword Search Results
-
App-generated PDF (via XSL-FO) (Download PDF)