Project Description: Phase 2 - WaxCylinderRevival/frus-dates-project GitHub Wiki
Phase 2 Summary
- Timeframe
- October 2017 to May 2018
- Staff
- Metadata Specialist: Amanda Ross (@WaxCylinderRevival)
- Search Product Owner: Joe Wicentowski (@joewiz)
- Backlog Publishers: Virginia Kinniburgh, Stephanie Eckroth
- By the Numbers
- Between October 21, 2016* and June 1, 2018, we've added at least one dateline to 275,704 historical documents (a 65.32% increase).
- As of May 17, 2018, 100% of historical documents (excluding attachments) in the FRUS digital archive have at least one dated dateline.
- As of May 17, 2018, 303,711 (99.97%)
tei:div
have been assigned machine-readable dateTime values and are date-searchable and -sortable.
Category | October 12, 2016 | May 17, 2018 | Change | August 30, 2017 | May 17, 2018 | Change |
---|---|---|---|---|---|---|
Total Documents | 192,930 | 284,088 | +91,158 documents (47.25% increase) | --- | --- | --- |
I. Editorial Notes | 7,765 (4.02%) | 8,080 (2.84%) | +315 editorial notes (0.04% increase) | --- | --- | --- |
II. Historical Documents | 185,165 (95.98%) | 276,000 (97.15%) | +90,835 historical documents (49.06% increase) | --- | --- | --- |
IIa. Historical Documents w/at least 1 dateline |
166,774 (90.00%) | 275,704 (99.89%) | +108,930 (65.32% increase) | --- | --- | --- |
IIb. Historical Documents (excluding attachments) w/at least 1 dateline |
--- | --- | --- | 225,113 (99.71%) | 276,137 (100%) | 51,024 (22.67% increase) |
IIc. Historical Documents (excluding attachments) w/at least 1 dateline//date |
--- | --- | --- | 224,751 (99.55%) | 276,136 (100%) | +51,385 (22.86% increase) |
Parent Subchapters, Chapters, Compilations (etc.) w/dateTime-min/max | 0 | 19,629 (99.58%) | +19,629 increase | --- | --- | --- |
Front and Back Matter w/dateTime-min/max | 0 | 2,539 (100%) | +2,539 increase | --- | --- | --- |
[* October 21, 2016 is the date of the first FRUS-dates-project commit. October 12, 2016 is the date of the first query-based analysis of the FRUS corpus.]
Project Brief
As of May 2018, we completed the work detailed in Phase 1 for the existing digital corpus and newly integrated quarterly releases. 100% of historical documents have been assigned a date/date range, including those “undated” by FRUS compilers past.
In December 2017, the Office of the Historian launched its new search function, which allows date searching and sorting: https://history.state.gov/search
We developed logic for applying comprehensive dateTime ranges as div/@frus:doc-dateTime-min
and div/@frus:doc-dateTime-max
for 19,629 (99.58%) parent subchapters, chapters, and compilations. (Exceptions are in volumes frus1902app1 and frus1902app2, which need restructuring work.)
We analyzed front and back matter sections, applying div/@subtype
as well as div/@frus:doc-dateTime-min
and div/@frus:doc-dateTime-max
. 2,539 (100%) sections in front and back matter now have searchable dates.
(For more on completed work and future development, please visit Issue Tracking)
Previous: Phase 1 | Next: Date Encoding Practices