XmsDescDataCreation - chigc/documentation GitHub Wiki
The date that appears in this field comes from the following sources
- A readily available online catalogue (i.e., manuscripta.at for Austria, archivesetmanuscrits.bnf.fr for France, manuscripta-mediaevalia.de for Germany). The date is copy and pasted exactly as it appears on the web page. If there are multiple dates for different parts of the manuscript, ''only'' the date for the folios containing the compendium is taken. If there are multiple unspecified dates then all dates are recorded.
- If there is no online catalogue, the date comes from the most recent printed catalogue (often digitized, linked in the Erfassungstabelle), also copied exactly as it appears in the catalogue entry.
- If there is no readily available catalogue entry the date comes from other literature. The author of the source is indicated in square brackets [ ], for example: XIII century first third [Moore]. The full title of the source appears in the notes.
- Piggin is the last resort if there is no other source. The date is copy and pasted from the table on this website: https://www.piggin.net/stemmahist/petercatalog.htm from the column "Notes", followed in our table by [Piggin]
- If the information in the literature is clearly wrong or very vague (e.g., 12th or 13th century) than we may give a date based on our own assessment, indicated in brackets with our initials [EG]
In this field we give regularised/normalised values in the form [yyyy] or [yyyy-yyyy]. The rules of transformation are
1. Quarters
- First quarter of the 13th century; 1. Viertel 13. Jahrhundert; 13. Jh.¹ = 1200-1224
- Second quarter of the 13th century; 2. Viertel 13. Jahrhundert; 13. Jh.² = 1225-1249
- Third quarter of the 13th century; 3. Viertel 13. Jahrhundert; 13. Jh.³ = 1250-1274
- Fourth quarter of the 13th century; 4. Viertel 13. Jahrhundert; 13. Jh.⁴ = 1275-1299
2. Halfs
- First half of the thirteenth century; 1. Hälfte 13. Jahrhundert = 1200-1249
- Second half of the 13th century; 2. Hälfte 13. Jahrhundert = 1250-1299
3. Thirds
- First third of the 13th century; 1. Drittel 13. Jh. = 1200-1233
- Second third of the 13th century; 2. Drittel 13. Jh. = 1234-1266
- Last third of the 13th century; 3. Drittel 13. Jh. = 1267-1299
4. Beginning / End
- Beginning of the 13th century; Anfang 13. Jahrhundert; début XIIIe; s. xiii in. = 1200-1224
- End of the thirteenth century; Ende 13. Jahrhundert; fin XIIIe; s. xiii ex. = 1275-1299
5. Middle
- Middle of the thirteenth century; Mitte 13. Jahrhundert = 1225-1274
6. Approximate Dates
- um 1300 = 1290-1309
- c. 1300 = 1290-1309
- ca. 1300 = 1290-1309
- um 1350 = 1340-1359
The place of origin comes from the following sources
- A readily available online catalogue (i.e., manuscripta.at for Austria, archivesetmanuscrits.bnf.fr for France, manuscripta-mediaevalia.de for Germany).
- If there is no online catalogue, the place of origin comes from the most recent printed catalogue (often digitized, linked in the Erfassungstabelle), also copied exactly as it appears in the catalogue entry.
- If there is no readily available catalogue entry the place of origin comes from other literature.
- If there is no literature, or if the place of origin is clearly wrong, we may give a place of origin based on our own assessment
The normalised place of origin gives city, region, and country information
- If only a city is given, the normalised entry also gives country
- If only a region is given, the normalised entry also gives country
- Names are given in English according to present-day regions
- Tegernsee --> Tegernsee, Germany
- Bayern --> Bavaria, Germany
- Westdeutschland --> Western Germany
- Nordfrankreich --> Northern France
- Böhmen --> Czech Republic
-
ISO 639-1 (2 letter abbr., see ISO 639-1) is used for language abbreviations.
-
If ISO 639-1 has no language-code for a language, the abbr. - as long at it is not already taken for another language - consists of the first two letters of the language (e.g. al. for alemannic).
-
If a Manuscript holds more than one language, all languages are to be separated with commas in the Erfassungstabelle and the main language should be entered first (will be rendered to mainLang).
- The Abbrevation of countrynames follows the ISO 3166-1 Alpha 3 standard, which consists of 3 letters.
The PIDs of the manuscripts consist of two parts:
[place][institution]-[signature of manuscript in this location]
The rules for the creation are as follows:
- [place] = first letter of placename, e.g. Graz --> g
- [institution] = two letters of institution, e.g. Universitätsbibliothek --> ub
- [signature]:
- ommit "ms", "cod" and the like;
- normalize all special characters to hyphen
- keep collection if necessary (number of letters as seems appropriate);
- keep "fol" as fol and 2° as 2;
- -nd for manuscripts with no signature
- multiple manuscripts in one library without a signature - -nd + consecutive number --> -nd-1, -nd-2, etc.
- 3 letter abbrevation of [place][instituion] already taken? Workarounds can be as follows:
- three letter for city and none for library --> Alencon bibliotheque municipal & Auxerre bibliotheque municipal = ale & aux
- other letters for [institution] --> Melk Benediktinerstift = mbs BUT München Bayerische Staatsbibliothek = mbb
- use three letters for [institution] and none for place --> Rijsmuseum Meermanno-Westreenianum in Den Haag = rmw
- etc.
A list of all already used [place][institution] abbrevations can be found here.
In order to visualise the amount of Manuscripts at specific locations on a map, the location part of the PID has to be unique
The initial data collection was carried out using a Google Spreadsheet. The following transformation workflow was created to convert the data semi-automatically into XML/TEI, which serves as the source data for the edition.
- Download of Table from Google Drive
- Delete Unnecessary rows (not! columns) and sheets, e.g. header-row, legend of color-coding, etc.)
- Conversion of the Spreadheet table to a TEI-Table with TEIGarage.
- Transformation of TEI-Table to TEI-Files with chigc.xsl
- all information of Spreadsheet represented in the TEIHeader
- One TEI-File per row ( = One TEI-File per manuscript)
- check on correct transformation of dates (esp. when manuscript has multiple dates)
- check on correct transformation of Version / Extension
- Different levels of data collection
- Erfassungstabelle --> "basis metadata collection"
- Erfassungstabelle + Info from Form --> "rich metadata collection"
- Github Account & writing rights to the data repository
- OxyGen Editor
- Github Desktop (Introduction to Github Desktop can be found here)
- Clone software and data repository from Github.
- Associate the framework in the Oxygen Editor (see Wiki, on how to do this)
- Configure OxyGen Editor:
Create Oxygen Project vor easier access to the files (see here)
Set "Author" as default view (see here)
- Decide, who is working on which manuscript to minimize the risk of merge conflicts
- Open Github Desktop
- Fetch (and Pull) Software Repository, to keep it up to date
- Fetch (and Pull) Data Repository, to have access to the newest changes
- Open OxyGen Editor and select the manuscript in the project view
- Fill the data into the Form (Framework)
- Change the “Status” on the bottom of the Page, if a phase is done
Data Recording of a new Manuscript (DONE)
Review of Content (DONE)
Publicationready
- Open Github Desktop
- Commit and Push your changes to the Repository on Github
!! Commit and Push your changes regularly, at least at the end of the workday!!!
- Send us (Team Graz) an e-mail with the following information on the manuscript:
- City
- Name of Institution, Library or Archive
- Signature
- Team Graz will create the initial TEI for you on GH and inform you, that the file is fetchable and ready to be worked on.
- They need to send us their GitHub credentials in order to give them the necessary permissions.
- Add ID's everywhere - even if the item is not in the manuscript and you chose 'no' in the dropdown;
- Not every field has it's own note field, sometimes only a notefield for the whole section is provided - if you want to take notes to a specific item in those sections, not them down as follows --> {ItemName1}: observation; {ItemName2}: observation; e.g.
Color Coding: somethingsomething; Letter Coding: somethingsomething
Some ID's might not be so easy to find, because the name in the ID is written differently. Below is a list of those special cases:
- Zedekiah : #p-sedechia