Project Description: Phase 2 - WaxCylinderRevival/frus-dates-project GitHub Wiki

Phase 2 Summary

  • Timeframe
    • October 2017 to May 2018
  • Staff
    • Metadata Specialist: Amanda Ross (@WaxCylinderRevival)
    • Search Product Owner: Joe Wicentowski (@joewiz)
    • Backlog Publishers: Virginia Kinniburgh, Stephanie Eckroth
  • By the Numbers
    • Between October 21, 2016* and June 1, 2018, we've added at least one dateline to 275,704 historical documents (a 65.32% increase).
    • As of May 17, 2018, 100% of historical documents (excluding attachments) in the FRUS digital archive have at least one dated dateline.
    • As of May 17, 2018, 303,711 (99.97%) tei:div have been assigned machine-readable dateTime values and are date-searchable and -sortable.
Category October 12, 2016 May 17, 2018 Change August 30, 2017 May 17, 2018 Change
Total Documents 192,930 284,088 +91,158 documents (47.25% increase) --- --- ---
I. Editorial Notes 7,765 (4.02%) 8,080 (2.84%) +315 editorial notes (0.04% increase) --- --- ---
II. Historical Documents 185,165 (95.98%) 276,000 (97.15%) +90,835 historical documents (49.06% increase) --- --- ---
IIa. Historical Documents w/at least 1 dateline 166,774 (90.00%) 275,704 (99.89%) +108,930 (65.32% increase) --- --- ---
IIb. Historical Documents (excluding attachments) w/at least 1 dateline --- --- --- 225,113 (99.71%) 276,137 (100%) 51,024 (22.67% increase)
IIc. Historical Documents (excluding attachments) w/at least 1 dateline//date --- --- --- 224,751 (99.55%) 276,136 (100%) +51,385 (22.86% increase)
Parent Subchapters, Chapters, Compilations (etc.) w/dateTime-min/max 0 19,629 (99.58%) +19,629 increase --- --- ---
Front and Back Matter w/dateTime-min/max 0 2,539 (100%) +2,539 increase --- --- ---

[* October 21, 2016 is the date of the first FRUS-dates-project commit. October 12, 2016 is the date of the first query-based analysis of the FRUS corpus.]

Project Brief

As of May 2018, we completed the work detailed in Phase 1 for the existing digital corpus and newly integrated quarterly releases. 100% of historical documents have been assigned a date/date range, including those “undated” by FRUS compilers past.

In December 2017, the Office of the Historian launched its new search function, which allows date searching and sorting: https://history.state.gov/search

We developed logic for applying comprehensive dateTime ranges as div/@frus:doc-dateTime-min and div/@frus:doc-dateTime-max for 19,629 (99.58%) parent subchapters, chapters, and compilations. (Exceptions are in volumes frus1902app1 and frus1902app2, which need restructuring work.)

We analyzed front and back matter sections, applying div/@subtype as well as div/@frus:doc-dateTime-min and div/@frus:doc-dateTime-max. 2,539 (100%) sections in front and back matter now have searchable dates.

(For more on completed work and future development, please visit Issue Tracking)


Previous: Phase 1 | Next: Date Encoding Practices