December 20, 2023

See time zone conversion
Meeting norms
Present: Crystal, Deborah, Adam, Benjamin, Laura, Sofia, Sita, Theo, Cypress, Junghae, Ebe, Jian
Notes: Theo
Time: Crystal

Announcements (5)

No meeting next week. Happy Holidays!

Aggregates (85)

Presentation from Deborah

Pre-meeting Notes:

areas we plan to address:
- Examples from MARC records
- Identification markers
- Mapping element values
- Questions that have come up in the process
Moving forward with aggregates
- Where to track questions
  - The existing Aggregates discussion?
  - A new discussion for each question?
    - 📢 Decision: We will add discussions for each aggregates question, and track them using a new label for questions about aggregates.
    - Sample discussion
  - The Decisions Index for tracking decisions + link to relevant discussion
- When should we discuss questions that have already been raised: Immediately, later, start on some today?
- Start on own transformation application profiles for:
  - SES, e.g., AAPs; subject headings
  - VES, e.g., Extent of manifestation
- Special kinds of resource descriptions
  - Aggregate manifestations
  - Single expression manifestations
  - Reproduction manifestations
  - Collection manifestations
  - Diachronic works embodied in any of those manifestations

Meeting discussion included the following

Deborah's question: is this approach to aggregates generally useful?
- Theo: yes
- Sofia: yes, for more than just the transformation logic:
  - start a list of MARC21 fields related to aggregates;
  - National Library of Greece is about to release guidelines on cataloging aggregates and Deborah's presentations make strong contributions
  - can translate Deborah's presentation and share with the NLG team
    - Deborah: not ready to share; it's a quick-and-dirty presentation riddled with errors; better to work toward a more polished presentation; however, it could be shared with internal colleagues only at NLG but it should include a disclaimer
- Laura: would like to see more RDF triples represented not just as visualized graphs or as entity/relationship diagrams
  - Deborah notes the presentation currently represents the triples using an Excel table; it is possible, using RIMMF, to enter the data and output some RDF serialization, which can be done for some version of the presentation
Slides from Dec 13 have not yet been loaded to Google Drive; that will happen soon
Tracking questions: where and how
- Where: Github
- Use discussions
- Use multiple discussions, preferably one question per discussion
  - Possible shortcoming of discussions: may not be able to add to project boards
- Use a Github label to distinguish these discussions as "Aggregate Question"
  - Confirmed: can add a label to a discussion
- To get started, create a template question
Presentation begins at 12:36 of the meeting recording
Reviewed previous week's example: single expression manifestation (not an aggregate) using the MARC record for Emma, White's Books, 2009
- Assumptions include:
  - single main entry (MARC 100), resource is textual (MARC LDR), assume person author
Next example (15:16): Augmentation aggregate: aggregate manifestation + 1 aggregated (augmented) expression/work + 2 agents using the MARC record for Emma, Airmont Pub., 1966
- Scenario: cataloger opts to describe the augmented work but not the introduction separately
- Entities anticipated: 2 persons (author and contributor/agent), aggregated/augmented work, aggregated/augmented expression, aggregate manifestation
- How do we know this MARC record describes an aggregate?
  - 1 augmentation marker found: MARC 700 $e "writer of"
- Also noted:
  - 1 person in main entry
  - no collection markers
  - no parallel markers
- After deciding this must be an augmentation, what distinguishes treatment of augmentations? What assumptions can we make? This will be logic passed to the programmers: IF [something in MARC record] THEN [something in RDA/RDF]
  - In this case, note it is a single expression and only 1 augmented work
    - The 008/35-37 value will be part of the expression description
    - if MARC LRD/06 = a = text;
      - that is, content type for the work is text
  - Subject headings
    - In general, for aggregates, we do not know what they apply to. Aggregated work or aggregating work?
      - Some or all might apply to aggregating work, not to each aggregated work
    - However, as an augmentation, we can assume the MARC record is only describing the primary augmented work and, therefore, the subject headings are for that one work
  - Consider: 700 $e states specifically that Duffy is the writer of introduction; RDA output from this, if we only describe the augmentation by choice, only allows us to use the relation contributorPersonOfText.
    - NLG uses $i and $4 alongside $e; they think, someday, there may be narrower elements to contributorPerson; even if not, $e values can be useful in descriptions of Works and Expressions.
    - "writer of" allows us to assume a writer of text and so use contributorPersonOfText rather than contributorPersonOfAggregate
  - Additional mappings that are possible:
    - 245$c --> statementOfResponsibilityRelatingToTitleProper
    - (260$a) or (008/15-17) or (008/17) --> placeOfPublication
    - (LDR/07 = m) and (260$c not '-') --> extensionPlan = static
    - (see 27:25 for more)
Next example (31:50): Aggregate manifestation + aggregating work + 1 agent from MARC record for Understanding FRBR, Libraries Unlimited, 2007
- Has collection markers
  - 505$a or 505$r with '/'
- Has augmentation markers
  - 504 bibliography note
    - safe to assume bib notes are not part of the work?
- MARC record describes the aggregating work (and expression) as well as the aggregate manifestation
- Cannot describe any of the aggregated works: there are no analytical added entries
- (Many MARC subfields mapped to RDA element; see slides)
- Can create a aggregating work; use representative expression (RE) elements
- Subject for the whole (since we're describing aggregating work)
- For any aggregating work, any person involved in the realization of the plan (not the creation of the plan, however) can be the value of relatedPersonOfWork
- Additional elements mapped (see 40:17) include:
  - 245$b --> otherTitleInformation
    - uses the colon as the marker
  - 260$c contains 'c' -->copyrightDate
  - 300$a contains 'p.' --> modeOfIssuance = 'single unit'
  - 300$a --> extentStatement = '1 volume'
  - 300$c --> dimensions
Next example (41:14): Aggregate manifestation + aggregating work + 2 agents from MARC record for My green hills of Jamaica amd five Jamaican short stories, Heinemann, 1979
- Collection markers
  - 245 contains the collection term "short stories"
    - recommended: compile list of collection terms
- Augmentation markers
  - 245$c contains augmentation term "with a " or "with an "
  - 504 bibliography
- What can we safely assume in the MARC?
  - aggregate manifestation
    - relate people using contributorPersonOfText
  - 2 contributors
  - aggregating work
    - not an augmentation and not a parallel
      - this is a method used to interpret MARC records: MARC tells me nothing about this, therefore that must be true
    - all subject headings describe the aggregating work
    - representative expression elements for contentType and language
    - relate people using relatedPersonOfWork
- Cannot create any descriptions of aggregated works as there is a lack of MARC 700 with analytical entries
- No matter what, if subject headings exist for aggregated works, we won't be able to determine which heading goes with which aggregated work; as a result, subject headings for aggregates are only useful in the descriptions of the aggregating works
- Additional elements (48:12) include:
  - both agents
  - same as seen above
Next example (49:53): aggregate manifestation + aggregating work + agent from MARC record for Speechless, True North, 2005.
- Collection markers
  - LDR/06 + 008/18-19 + 505$a contains a list of titles
  - An album is a collection of musical works
- No parallel markers
- No augmentation markers
- So we can say this MARC record describes an aggregating work, what can we pull out?
  - aggregate manifestation
  - Bruce Cockburn: may be the aggregator, but it does nt seem safe to always assume the performer/songwriter is always the aggregator; safer to map to contributorPersonToAggregate
    - Can also be in work description as relatedPersonOfWork
  - aggregating work
  - additional elements (58:25)
    - values for things like carrierType are likely taken from a VES; do we want to convert to another VES?
    - do we want to tidy-up abbreviations?
    - do we want to clean up 650 values that are genre headings?
Next example (1:00:03): aggregate manifestation + aggregating work + 2 agents for MARC record for English prose, 1600-1660, Holt, Rinehart and Winston, 1965
- collection markers
  - 100 $e contains 'ed.'
    - one of many possible $e qualifiers safe to use as collection markers, which include: writer of, editor of, etc.
      - 'editor' is not safe; however, under old rules, would we ever enter 'editor' in any way as a main entry for only an editor of text? Probably not!
        
        Note: definitely not safe in a MARC 700 unless it says 'joint ed.': joint is only used as an accompaniment to main entry author; also, it is a pre-AACR2 entry
  - given the above on editors, we can assume, in this MARC record, that Harris and Husain play the same role
- no parallel markers
- augmentation markers
  - 504 bibliography note again.
- What cannot be assumed about this MARC record:
  - There are 9 authors; we cannot relate them all to the appropriate RDA bibliographic entities, they're trapped in a 505 note; were they entered in 7XX fields, we might be able to
  - The subject heading, for this MARC record, analyzed by a human, applies to all aggregated works; however, that cannot be assumed by the machine, so, as usual, we can only apply the subject heading to the aggregating work
- What should be considered for inclusion in our MARC-to-RDA output:
  - 504$a --> noteOnManifestation should include boilerplate; Deborah uses "Includes: ..."
- What we can assume about this MARC record is nothing we haven't already seen
Next example (1:05:29): aggregate manifestation + 3 aggregated/collected expressions/works with a single author from MARC record for Sense and Sensibility ; Emma ; and Persuasion, Thomas Nelson, 1903.
- collection markers
  - another iffy one: 245$a contains ' ;'
  - something we used to do: 300$a contains '[2-9] v. in 1 v.'
  - If we do not use these collection markers, we will create RDA that says this item is a single expression manifestation
- no parallel markers
- no augmentation markers
- If this is output to RDA as an aggregate/collection:
  - Austen will relate to the work as relatedPersonOfWork
  - contentType and language will be in the work description using representative expression elements
  - Note from note taker: note that there is nothing in the MARC that allows us to describe the 3 aggregated works
Next example (1:08:38) aggregate manifestation + aggregating work + 2 agents + aggregated work + 2 aggregated expressions
- note: Deborah just started looking at this, it will be incomplete
- this would be described as a parallel aggregate; however, note that all parallel aggregates should have collection markers, as a parallel aggregate is a type of collection aggregate: it means more than one expression of a common work is collected
- collection markers
  - 7XX 12 _2 (analytical added entries)
    - These can be used for aggregated work markers too
- parallel markers
  - (> 1 041$a) and (245 contains '= $b')
  - 250$a = 'Bilingual edition'
- augmentation markers
  - 504 bibliography
- recommendation: for all parallel aggregates
  - describe the aggregating work
  - if 700 analytical entries exist, describe the aggregated works and expressions
    - note that, for this MARC record, description of the French language expression of En attendant Godot will be difficult/impossible to create due to the absence of the $1 in the first 700 field (this was common LC practice).
      - Is there any way to find that information elsewhere so that we can add representative expression elements to the work description? Maybe some logic using the 041$h?
- What can we assume:
  - aggregating work (like any other collection aggregating work)
    - includes representative expression elements for language and contentType
  - 2 aggregated expressions
  - 1 aggregated work
  - aggregate manifestation
Some chat spilled into meeting discussion; chat:
- Ebe: What NLNZ has decided to do when we implement is to include language for Manifestation: expression manifested 758 ‡4 http://rdaregistry.info/Elements/m/P30139 ‡i Expression manifested: ‡a Mahy, Margaret. Man whose mother was a pirate. English.
  - Sofia: Interesting! Do you have URIs for all works and expressions? Is this why you use 758 and not 700$t$l?
  - Adam: 758 is particularly intended for linked data
  - Ebe: we were looking at 700s, it got horribly complicated. Discovered 758 through OCLC, allows entry of all the data, and what's not wanted can be stripped (like for National Bibliography)
  - Deb: is this in addition to or instead of the 700?
  - Ebe: There will be some 700s but to be mostly clear about what we've got mostly for aggregates (or expressions)
  - Deb: so in the last example (Beckett), you would have 758s instead of those two 700s?
  - Ebe: yes, I think instead, but not totally sure, will take a look
  - Adam: 758 not intended as access points, just labels; can configure systems as you like however
Presentation ends here
What's the way forward now with aggregates?
- simply incorporate what we've learned into the mapping now and carry on?
- go back to identifying aggregates? As well as what we pull out for aggregates.
  - In RDA, aggregates are just one kind of resource
  - RDA is not about "books" "audio" "video" but, rather, is it an aggregate? Is it a reproduction?
    - Reproductions will be a bear for the transformation; for a brutal example, think about reproductions of diachronic works and how they must and new work and manifestation, not just a new manifestation.
      - Diachronics: we'll have to think about identifying those. And how does that effect whether it's an aggregate or a single expression.
  - in this case we'd be doing concurrently:
    - how we pull data out of MARC fields
    - how do we wrap it
- How about we just continue the discussion on January 3 (next meeting)
Chat included:
- Sofia: We are currently oranizing RDA seminars for Greek librarians and we upload them on youtube. But it is in Greek :) Please share the email for (National Library of New Zealand] training to me.
- Sofia: For extension plan we can also consider 008 position 6, right?
- Laura: “If we mint an AAP for a person” - are there reasons to mint rather than use LCNAF authority….?
- Sofia: ... we are thinking something like this 700 1# $i contributor of text $a name $e writer of introduction $4 RDA property for contributor of text
- Laura: AAP for Manifestation is a new thing to me. Can you point to RDA guidance
- Laura: I think adding the subjects to ag work is something Gordon said not to do but it was awhile ago.
- Sofia: I do not think that RDA dictates how exactly this AAP must be. I think it is a matter of policy about the SES a library chooses.
- Laura: Sh’s should be replaced by genre terms. This is old cataloging right?
  - Adam: No. LCSH is still applied for music in addition to genre/form. These headings include medium of performance and those are not genre/form. Until 382 is added, LCSH couldn't be deleted.
- Ebe: I wish we could have an AAP for Work as: Speechless (Bruce Cockburn) rather than the usual card catalogue version of Cockburn, Bruce, Speechless
  - Adam: That's sort of what FAST does: Speechless (Cockburn, Bruce)
- Laura: If I ruled the world, there would be a class of Theme of content for works of literature including plays, poems and novels…. Perhaps art as well… with a different tag - it might use LCSH but the relation would be different
- Ebe: What NLNZ has decided to do when we implement is to include language for Manifestation: expression manifested 758 ‡4 http://rdaregistry.info/Elements/m/P30139 ‡i Expression manifested: ‡a Mahy, Margaret. Man whose mother was a pirate. English.
  - Sofia: Interesting! Do you have URIs for all works and expressions? Is this why you use 758 and not 700$t$l?
Notetaker thinks this is noteworthy (continued from Dec 20 notes) :
- how Deborah formulated authorized access points (discussion required still about AAPs):
  - Works:
    - AP + PToW
    - PToW + RPoW
  - Expressions: AAPfW + CtoE + LoE
    - The AAPfW need to be formulated before creating an AAPfE
  - Manifestations: TP + NoP + DoP + CT
    - This AAP introduced by RDA
    - RDA makes recommendation on what to use for the base (title proper) plus what can be added as qualifiers
    - elemnts used, their order, and how they're separated will be determined bt local SES's
  - Person:
    - PNoP + DoB
    - PNoP

Action items

Start Github aggregate discussion by
- Crystal: create label "Aggregate Question" or something similar
- Crystal: create template question that can be used as a model

December 13, 2023

See time zone conversion
Meeting norms
Present: Junghae Theo Adam Penny Sita Gordon Crystal Laura Deborah Jian Sofia Ebe
Notes: Theo
Time: Ebe

Announcements (5)

Laura is largely finished with the 533 mapping, but needs consultation about 008 field to go further. There's a special subfield containing 008 values that need to be mapped appropriately; however, the logic of some of the 008 mappings could use some clarification or simplification; not the whole 008, just parts; not format-specific fields except for serial frequency.
- CY and LA exchanged email messages about this. Maybe complete that email exchange -- maybe Crystal could take a look. She already replied, she thinks.
Ebe sent a spreadsheet: National Library of New Zealand subgroup on aggregates produced a document with guidance on aggregates. OK to share with our group: Crystal posted to our Google Drive, "Non-Mapping Materials". However, there are still issues in the document under discussion; it's not ok to share outside this group.

Meeting recordings (5)

Proposal: Keep them all in Google Drive without deleting until we run up against storage limits (could take more than 3 years based on current file sizes), then re-evaluate and store them locally at UW on OneDrive so they can be retrieved when needed
- 📢 AGREED; proposal accepted by group.

Classification numbers (10)

Discussion

Some clarification needed to assign classification mapping to Penny
We know: subject part of class number uses the RDA element hasSubject (rdaw:P10256)
- That gets weird for maps; non-subject stuff may go into $a
What are classification numbers?
- Identifiers for Manifestations? Identifiers for subjects?
- If you classify something, it will almost always be subject information. Sub-classifications can get complex. However, you capture a number as a string and bring it into relation with a Work.
- If you have a map, in the MARC 050, you enter a publication date in the $a.
  - That's subject information: tells you the time period when that map was appropriate
Main problem: you don't know if the subject is an LRM entity (like a person) or a Nomen or Place unless you know the classification scheme.
- So: determine the classification number, detect the scheme; the date is important; note that the meanings of subjects drift over the years, resulting in semantics drift of the same number.
- So: what is the given classification about?
  - It's an identifier, but for what? Place? Person? Manifestation? Work?
    - RDA and LRM say it can be an identifier for anything
    - LRM sidesteps the issue, saying any Work hasSubject with range=Res
    - RDA closes the entities; range becomes RDA Entity; Place can be subject of Work
      - RDA choice:
        
        no range; Dewey URI would work fine here
        
        if entity type of value is known, use appropriate RDA element, like hasSubjectPerson (rdaw:P10261)
      - RDA features a hierarchy of subjects, with hasSubject at the top
Mapping recommendation: use default hasSubject (rdaw:P10256) without concern for range; leave it to somebody else to do the semantic parsing
- this is in part due to the distinction between subject analysis and descriptive cataloging
  - Remember Gordon addressed, in discussion 434, that RDA doesn't deal with subjects or classification
- So: WORK hasSubject "classification number as identifier/nomenString" -OR- WORK hasSubject
  - We would not mint a Nomen here
  - And the classification scheme? That's data provenance.
    - Contrast out treatment of ISBNs where we decided to mint Nomens
      - ISBNs have their own Nomen scheme, with IRIs assigned
        
        Remember there is a DOI template for ISBNs that generates IRIs for ISBNs
    - So do we have to reify the statement?
      - Perhaps; alternative: sub-divide the hasSubject relationship (which RDA will not do): in the transform use a property called something like hasDeweySubject od hasDdcClassificationNumber
        
        UW Libraries did this as a cohort member of LD4P2
  - We can't reasonable mint a Nomen for every classification number in every data store
    - It would be about 9.5 Nomens for every 10 records; it's too much
    - But the class numbers are very useful!
Discussion ends here; should be discussed asynchronously going forward

Aggregates (40)

DescribingAggregates.TransformExamples.202312012.pptx

Topics:

The identified entities and RDA description choices in the examples covered last week
MARC records found for those examples
Identifying makers found in those MARC records
The RDA entities that can be pulled from those MARC records and the elements that can be mapped for those entities, along with short forms of the mapping logic for those elements

Meeting discussion included:

Can a meeting be an aggregator as a collective agent? Maybe, but it's not clear that a meeting can be a creator; if it were a creator, it would be an aggregator. Sofia and Deborah can discuss asynchronously.
Deborah's talk aimed to look over the entities from last week using a simplified ER diagram, in an effort to find markers of aggregates.
For the talk, it's best to look at the video; here are the highlights:
- 27:22. Single expression/manifestation + single expression/work + author; Emma / Jane Austen.
  - absence of markers for aggregates informs us this is a single expression
  - also discussed:
    - expanded aggregate markers for the MARC 546 field (specifically, using the phrase "tete-beche")
    - using MARC 260 for the RDA manifestation's publication information -- or does it have to be creatorOfManifestation?
    - using MARC 006/06 for the RDA expression's contentTypeOfExpression
    - using MARC 008/35-37 for the RDA expression's languageOfExpression (if two languages are not represented in the 041, or if it can be determined there is a single expression in more than one language)
    - When deriving the subject of a work from the MARC 6XX, do we represent the source of the heading as part of the value of as part of the data provenance description?
    - using MARC 100 for the RDA work's authorPerson -- is that possible? Can we determine it is actually an authorPerson and not merely a creatorPerson? Proposed: yes, if the LDR/06=a "Language material" (i.e. text).
      - Maybe not; sometimes compilers are in the 100; sometimes artists are in the 100 for exhibition catalogs; HOWEVER, those examples are aggregates, not single expression manifestations. This may allow only a mapping to relatedPersonOfWork.
      - Proposed: indeed, we can, if the resource is a unitary WEMI stack and a single creator, in which case we can determine that the creator is either an author or an artist (if work has a visual characteristic)
        
        A lookup table could cover the main cases, like when author should be used instead of creator
        
        Warning: there may be complication with children's picture books with no words but cataloged as text! (That's not uncommon.)
    - Proposed: compile a list of the questionable aggregates; dictionaries, encylopedias, atlases, etc.
    - Special cases of ... aggregates?
      - Dictionaries
      - Directories
      - Gordon discusses this at 43:44, proposing a reliable pattern cannot be discerned for determining whether or not a "dictionary" or "directory" is an aggregate and what descriptive elements can be derived from a MARC record.
    - Proposed: RDA person names derived from the MARC 100 cannot be considered preferred names, but, rather, simply nameOfPerson
      - If lookups can be performed, the string for a person (but not for expression or manifestations, and only sometimes for works) can be searched in selected data stores, like authority files, for a URI that points to a description that likely includes a preferred name; in this way, we can avoid performing authority control for persons
Of note:
- how Deborah formulated authorized access points (discussion required still about AAPs):
  - Works: PToW + AP
  - Expressions: AAPfW + CtoE + LoE
    - The AAPfW need to be formulated before creating an AAPfE
  - Manifestations: TP + NoP + DoP + CT
  - Person: PNoP + DoB
- RDA does passes AAP SES's to the community; the SES can be revealed using data provenance (a topic for another day!)

535 Mapping (30)

spreadsheet
issue
sketch document containing some ideas on mapping and minting entities
Meeting discussion included:
- We need to determine if MARC 535 values are about the original or duplicate item, thereby minting an item (which, generally speaking, is minting an item using a MARC note field, which is often frowned upon as a rule); or are MARC 535 values noteOnManifestation and, as such, unstructured values.
- 📢 Group was polled: should we mint items MARC 535 values? Few votes for this. Should we just create manifestation notes? Almost everyone preferred notes.
  - As usual, the RDA noteOnManifestation value should begin with boilerplate as means to differentiate from other such notes. In this case, something like what's in the MARC record: "Location of Originals Note" or "Location of Duplicates Note" (depending on the indicator). However, we need to determine the boilerplate for the full field, combined subfields, or individual subfields.
- Also raised: would we also mint entities for the custodians?
- Note: in the MARC specification, the 535 examples all feature a $3! That means specific parts only are expected to be reproduced.
  - There's no point in minting for something we know very little about - or if the information is valid. The subfield $3 materials specified immediately means that it is not an exemplar of the (whole) manifestation being described.
- Perhaps what's most anticipated as the type of thing being described by a MARC 535: government reports and rare materials. So it shouldn't be a frequently-used field.
- How does minting an item and associating it with a custodian, possibly as another entity, serve users? Another related concern: how does having yet another manifestation note, among many such notes, serve the user?
  - We can anticipate complaints about the many notes; particularly those notes that contain entity-to-entity relationship information (but corresponding entities were not produced as needed to express the relationships).
- Another question: why isn't this a note on item? Because items are not specified in any meaningful way except in text as part of a note. Similar to stating "with illustrations" using noteOnManifestation (as illustrations refer to expressions).
  - 533 is used to create a manifestation description of the reproduction (not yet decided)
  - Perhaps this helps to clarify how the note is indeed a note on manifestation: if 533 can be used for a manifestation description (which we have not yet decided to do) with a 535 location, then the description of this manifestation is based on the original described in the 535 that is held somewhere else.

Action items

Put the 535 decision in the decisions index (Crystal)
Create next week's agenda dedicated to aggregates (Crystal)

December 6, 2023

See time zone conversion
Present: Benjamin Riesenberg, Crystal Yragui, Laura Akerman, Adam Schiff, Deborah Fritz, Laura Akerman, Adam Schiff, Ebe Kartus, Jian Ping Lee, Junghae Lee, Pengyan Sun, Sita Bhagwandin, Sofia Zapounidou
Notes: Benjamin Riesenberg
Time: Benjamin Riesenberg

Announcements

Cypress started onboarding last week, will work on mapping and start meeting with Theo about the transform in January

Aggregates

See Aggregate manifestations and options for describing their embodied expressions/works and related agents

Orientation to slides:
- In these slides, I used some items with interesting titles; we should probably touch on titles for aggregates later
- Might also be useful to find MARC for the examples in the slides, and test the transform on those records to see if it matches what I outline here
- I use the term expression/work for when I refer to both an RDA Work and RDA Expression
- I include notes in these slides to explain what I'm doing, but the notes don't include the relationships, those are in the diagram
- Bear in mind that the diagramming in slides linked above indicates how resources might be described newly--looking at the resource and creating relationships, what we will be able to do transforming from MARC may differ
Emma: I call it a "single-expression manifestation", some people call it a "non-aggregate manifestation"
I said there were three basic options earlier, in slide 4 I present three and a half options
- You wouldn't bother to describe the aggregating expression unless you suspect that there may be more than one aggregate manifestation of that same aggregating expression (paperback/hardback example applies here?)
Understanding FRBR: Collection with 13 aggregated (collected) expressions/works
- My decision here was to describe only the aggregating work, I think this is what people will look for, that there are some subject headings that only apply to the aggregating work
- Brief discussion of 'editor' in RDA, this is an expression-level relationship
Who is the aggregator!? Humans can look at a piece and come to a conclusion--"OK, this editor is the aggregator" but how could a machine do this?
- Unless there is a clear distinction such as 'aggregator' in $e (relator term) for example, doesn't seem feasible to assign aggregator relationship at the work level based on information in a manifestation such as "Edited by Jane Smith" etc.
- A role as aggregator may be made clear by reading an introduction, etc., but the group doesn't expect this relationship to be made clear in a way clear for machine-processing in MARC (note also that $e may also contain 'compiler' which is intended to mean 'aggregator', but now means something different in Official RDA!)
- From chat: "A typical example is proceedings. Editors are definitely NOT the aggregators (...) But in journals, editors are the aggregators."
QUESTION: Is an editor a contributor to the text?
- Editor is an expression-level role, if we know that the resource is text then they are editing text
- If we know that an aggregated expression was edited by someone, we can assign 'contributor person of text' (manifestation level) - 'contributor person to aggregate' is another, more generalized option, also at the manifestation level
- NOTE that it was decided to use element label 'contributor person to aggregate' not 'contributor person to manifestation' to make it clear that the element is for aggregate
- See also 'contributor person of text'
DISCUSSION OF MARC TAG 075 - category of entity
- PCC developing a vocab for use in the tag, based on list used by German Natn'l Library
A non-text: Speechless: The Instrumental Bruce Cockburn
If you know something is aggregate, is it invalid to ignore this and describe it as a non-aggregate?
- You could not mention all of the augmenting stuff (liner notes and the like), and end up with something that looks like a non-aggregate manifestation, but don't say that (in the non-text example) Cockburn is the aggregator! (did he put together the photos and liner notes!? we can still give him as 'related person of (aggregating) work')
- The only way to pick up a specific relationship to Cockburn is in a relationship to an aggregated work (a song)
English Prose: Many many aggregated works, but only nine authors
Two aggregate manifestations for Sense and Sensibility, Emma, and Persuation ...
- A different manifestation of the same content, but nothing about the relationship between the two
- - Another rendering, utilizing an aggregating expression...
Parallel expressions: Beckett - Waiting for Godot
- "NOTE no shortcut to link author of introduction to the aggregating work or aggregating expression" - Wait! What about related person of work?? Yes, OK, that makes sense.

Interesting 👀 from the chat

it would be so great if we got rid of 100 for records coded rda
It can be done. There is nothing in MARC to say that you need a main entry (1XX). Yes you will need to have the first indicator as 0 in the 245
I really like this idea, Ebe! I do not think that I could ever persuade our cataloguers, though...
(...) we are on the cusp of making that decision. We are still discussing it as it will have major ramifications for our National Bibliography. If we do end up going this way then that is how the cataloguers will need to catalogue. I see this decision as moving away for having nothing more than a card catalogue in the cloud

Action items

Next week: Continue aggregates and re-start 535 mapping discussions

November 29, 2023

See time zone conversion
Present: Benjamin Riesenberg, Crystal Yragui, Deborah Fritz, Laura Akerman, Adam Schiff, Ebe Kartus, Gordon Dunsire, Junghae Lee, Pengyan Sun, Sita Bhagwandin
Notes: Benjamin Riesenberg
Time: Crystal Yragui

Roles/Agenda Review

Announcements

Grant update
Penny and Crystal met twice and did a first pass on the 034 mapping. Penny is doing great!
Crystal is onboarding another student, Cypress Payne, today

Aggregates

See TransformingAggregates.20231128 from last week (work in progress, not for external sharing)
See TransformingAggregates.20231129 (work in progress, ask Deborah regarding sharing)

QUESTION: RDA guidance on aggregates seems fuzzy on whether an aggregating expression is needed -- is this needed to use the 'aggregates' element? It seems that an aggregate manifestation is needed, it seems that some work properties are needed... what is needed?
- Once you know you have an aggregate manifestation, you'll always describe this. Your next decision is whether to describe the aggregating work, and possibly the aggregating expression, but only aggregating expression if you need an 'aggregates' relationship to one of the aggregated expressions. You'd only need this relationship if the aggregating work (the exact same plan!) is in more than one manifestation! This would allow for linking the aggregating expressions to more than one manifestation -- an example is paperback/hardback.
- How are aggregated expressions related to aggregate manifestation? Standard relationship, 'expression manifested'... same as for relating aggregating expression to aggregate manifestation
QUESTION: When is it necessary to describe an aggregating work? Only describe when needed? For example, when the subject of another work?
- Not always.
- Three options:
  1. Aggregating work only - for example a collection of 100 poems
  2. Aggregating work and aggregated works that I think are useful (and aggregated expressions) - aggregated works might appear in another manifestation
  3. Only aggregated works - for example Emma with an introduction - I'll add supplementary content for the manifestation description - another use case here is a manifestation with a limited number of aggregated works - "nobody cares about the aggregating work, let's just describe the aggregated works"
- There are really only three decisions -- aggregating only, aggregated only, or both
- "It is simple once you know it"
Do we really need the aggregated expressions? Couldn't we just describe aggregated works?
- Well, you could skip it if you think there's only ever going to be one expression of this work...
- It's the same reason we describe expressions at all
☝ Request: Please review the lists for Conventional Collective Titles (CCT) - see additional details below
- Do you have any terms we could add to the lists; do you know of any other sources for these terms?
  - Seems to be missing some terms, for example 'posters'
  - Yes, please just add to the list
- Do you have any questions about the suitability of any of the terms in the lists?
Why start with collections? Attempting to eliminate everything that is an aggregate until you're left with only records which are for single expressions.
Another question:

What should AAPs for aggregating works be?

Title of work only + usual qualifiers

Title of work + Name of single creator of aggregated works as qualifier + other usual qualifiers?

Name of single creator of aggregated works + Title of work?

Name and CCT

Use authorized access point for work group instead

☝ Please also see lists of collection terms - see additional details below, more on collection terms:
- Is a 'bibliography' always an aggregate?
- Is a 'compendium' always an aggregate? 'Genealogy'? Concordance? Dictionary? Index? ...
LCGFT could also help find aggregates
- Not safe (with 6xx) - most could be assigned to individual expressions
- Some expressions - 'Literature' should only be assigned to collections, for example
- OK, so LCGFT should be combined with other markers to determine status as aggregate
Aside: Need to be careful about which subjects apply to the aggregating work vs. those which apply to aggregated works
- Good news is that all subjects could be applied to aggregating work
- But, subject headings that can be applied to aggregated are limited

Collection terms

Deborah seeking feedback on collection terms that indicate that a collection aggregate is being described.
Terms that are:
- Conventional collective titles in 240
  - non-musical collections ListTextsCCT.txt
  - musical collections ListMusicCCT.txt
  - possible collection terms in 245$b List245bCollectionTerms.txt or 246$c List245cCollectionTerms.txt
  - collection terms from LCGFT in 6XX$a List6XXaLCGFTterms.txt

Action items

November 22, 2023

See time zone conversion
Present: Crystal Yragui, Sofia Zapounidou, Adam Schiff, Sita Bhagwandin, Ebe Kartus, Gordon Dunsire, Laura Akerman, Pengyan Sun, Theo Gerontakos, Benjamin Riesenberg, Deborah Fritz
Notes: Benjamin Riesenberg
Time: Ebe Kartus

Roles/Agenda Review (5)

Announcements (5)

Interest from National Library of Belgium in the project, interest in whether we will be clustering WEMI entities
Theo meeting with NEH to determine whether the MARC2RDA application can go to the next phase; if the grant application goes to a further stage Theo will be asking for letters of support
New student employee, Penny Sun, will be working on this project

BSR Milestone Adjustments (10)

From Laura: "there are only about 60 tags (not counting tasks) that are not obsolete and not in the holdings 8xx range (except for 856 which we must map). If we decided to delay those 8xx holdings range tabs to another phase of the project, we’re getting closer. My mental math says there are 113 other BSR tags in various stages including completion. " 📢 Proposal: Move obsolete and 8XX range aside from 856 to another milestone to make "MVP" BSR mapping goal clearer and closer. Will also help with prioritizing/choosing new tags.

No objections, this will be carried out
We should discuss the deadline for phase 1, if we get a thumbs-up on our grant proposal let's discuss this

533 Mapping Update (15)

See issue, see MARC 533 spec

"If you can't determine whether something is published or not (...) the properties in RDA for publication statement and subproperties and so forth, have really been split into published or unpublished, so you have to pick one"
- Yes, there's a binary categorization which splits manifestations which are manufactured artisanally vs. ones manufactured industrially/mechanically; artisanal manifestations generally have only one item, industrial generally many items (of course there are edge cases)
- Likewise for reproductions, an artisanal reproduction is considered to be a 'copy' (not identical, not a reproduction)
- Is the definition of publishing, "making a mechanical reproduction"? Yes, drifting that way
- The means of production is a large influence on how manifestations are described
- Generally for conversion we may go with RDA properties which are associated with industrial/mechanical production
Marker in 040 may indicate whether something is print-on-demand or photocopy... this may be used for conditional mappings

Aggregates (60)

See Aggregates20231120 Google Drive Folder for today's discussion
See MARC2RDA aggregates discussion
See RDA-Maps Repository aggregates discussion

At Natn'l Library of Greece, we are thinking in terms of providing guidance for one scenario per type of aggregate
We are also planning to create a local field, don't know which yet, probably 339, to explicitly state what kind of aggregate it is
Note that for RDA/RDF we understand 'category of work' to be the element for use in stating the kind of aggregate
Gordon discussed the need for communities to identify practices for how to indicate a type of aggregate, RDA won't provide guidance on this
An idea is to use an element on the manifestation to provide the kind of aggregate, perhaps a more narrow element specifically for indicating kind of aggregate - 📝 note that the current thinking for using has category of work <RDA Terms parallel, augmentation, or collection aggregate> did not meet acceptance from the group
Thinking about options for kinds of aggregates:
- A collection aggregate can be augmented, for example, there's an awful lot of crossover
- So the 'options' currently published in guidance should guide and inform, but not be blindly followed
Basically--basically--3 options for describing aggregates:
- Aggregating work only
- Aggregated work(s) only
- BOTH aggregating and aggregated works
Thus pinning down specific options to follow is tricky!
Deborah presents on aggregates material - see resources and outline below
- A process for filtering out diachronic works and other out-of-scope records
- Discussed specific pattern matching for collection aggregates
Why not use 'work manifested'?
- It is my hope that you will be able to identify an an expression adequately, and describe an expression which is manifested, but if you can't you could fall back on the aggregated work only and use 'work manifested'
❓ What is the difference between an aggregator and a compiler? Which to relate to from the aggregating work?? 📢 Use aggregator, not compiler, to relate to the person who made the plan to aggregate expressions.
- Does 'edited by' mean that that person made the decisions to aggregate expressions of works? Maybe, not necessarily.

From the chat, on aggregating expressions

Don’t we HAVE to mint aggregating expression in order to use the aggregates relationship

Only when we want to use the aggregates relationship

It is a shortcut element, and is only useful when an aggregate manifestation is issued in multiple formats.

So then, we don't need an aggregating expression unless we are describing both the aggregating work and one or more aggregated works?

Correct - Deborah and I advise to forget the aggregating expression in the context of the transform (...) I meant you don't need the aggregating expression at all.

Resources on aggregates, provided outline for DF comments

Initial source file and processing steps* (*Deborah displayed a document detailing processing steps during the meeting, this document is not available at this time)
Pattern matching and results (in process, beginning with collection aggregates)
- .txt files with
  - Statistics
  - Pattern match logic
  - Full MARC records found
- Mapping possibilities
- Questions re:
  - pattern matching logic
  - mapping possibilities
How to track and manage the process
Lists of terms indicating aggregates (beginning with collection aggregates):
- see Term Lists folder
Methodology
Additions, problem terms
Sources of additional lists

November 8, 2023

See time zone conversion
Present: Benjamin Riesenberg, Theo Gerontakos, Junghae Lee, Crystal Yragui, Gordon Dunsire, Sita Bhagwandin, Jian Lee, Ebe Kartus, Laura Akerman
Notes: Theo Gerontakos
Time: Ebe Kartus

Roles/Agenda Review (5)

Announcements (5)

None!

Aggregates Feedback Request (Benjamin) (20 minutes)

See RDA-Maps Repository discussion comment
See Aggregates discussion comment (Points to RDA-MAPs repository discussion comment)

Meeting discussion included the following:

There has been significant response in discussion 180; take a look!
Benjamin and the Sinopia profile team are working on templates for describing aggregates.
Discussion 180 starts with 6 questions about their five options for describing the 3 types of aggregates.
The Sinopia Profile Team includes a coterie of template testers producing sample description sets, which they plan to continue.
They are dividing the 3 types of aggregates across the team fpor which they will produce sample description sets.
The description sets need to be reviewed.
Data will be put in the CaMS Sandbox GitHub repo.
Option D does not exist in Sinopia templates; they're ognoring it for now.
The information about aggregates is spread across multiple documents; Benjamin plans to update the documentation and, in the process, consolidate it.
Ebe forwarded the github discussion to their aggregates team; they're thinking about testing the way their doing aggregates against Benjamin's templates in Snopia. In any case, they;re finishing-up their policy on the treatment of aggregates in MARC.
National Library of New Zealand will be including the element extensionPlan on every single record they create; they'll do the same with modeOfIssuance. They'll also create a MARC tag to indicate the type of aggegate.
Is there an RDA element to decribe type of aggregate?
- In RDA terms, there is a term for each of the three. For example, the term for parallel aggregate.
- We use categoryOfManifestation for this, as discussed by RSC. But RSC does not want to provide any detail to categorizations of manifestations, including a controled vocabulary. Same for all "categories", of works, etc. As a result, there has been a discussion about "kinds." Like kinds of aggregate.
- RDA communities should feel free to elevate a kind to a category. The communities should feel free to use their preferred terms.
- This sort of effort would be a good test of the community resources area recently opened in the toolkit.
Work on 7XX analytics might be interesting here; specifically, how the template would model the similar data in original RDA produced by those templates compared to the output of our mapping.
- Maybe that could be on the template review team's radar. However, the mapping do need to be more worked-out for them to be used at this point.
So let's continue the discussion asynchronously.

7XX Work Party Report-Back (20 minutes)

See notes
What do we think of the relator term and code spreadsheet idea?
The team for this: Crystal, Jian, Ebe, Laura
Looked at 710 with a particular emphasis on how to map analtic entries (when 710 ind2=2).
Proposal: if there's a $1, then in RDA/LRM/RDF:
- mint an IRI for an aggregating Expression
- rdae:P20319 "aggregates"
- $1 value as IRI
Proposal: if there's $0 but no $1, then in RDA/LRM/RDF:
- mint an IRI for an aggregating Expression
- rdae:P20319 "aggregates"
- mint IRI for aggregated Expression
  - rdae:P20310 "has access point for expression"
  - mint IRI for Nomen
    - some property like "has identifier for nomen" = value of $0 as string
    - rdan:P80068 "has nomen string" = concates 710 subfield values as a string
  - rdae:P20231 "has work expressed"
  - mint IRI for that aggregated Work
    - rdaw:P10531 "has creator corporate body of work"
    - mint IRI for corporate body
      - rdaa:P50352 "has related nomen of corporate body"
      - mint IRI for nomen
        
        rdan:P80068 "has nomen string" = $a value as string
Those working on 1XX and 7XX mappings found number of conditions needed to accommodate $e and $4 is overwhelming. Literally thousands of conditions are required to specify the RDA element needed to represent a specific relator term or relationship designator.
- Instead of doing it line-by-line in the spreadsheet, how about we create a lookup table that maps relator terms and codes [as well as relationship designators] to RDA elements for whatever entity is referred to in the 1XX and 7XX fields.
  - In spreadsheet, enter, "see table."
- Ebe can create the prototype using:
  - RDA Registry [for RDA elements]
  - https://www.loc.gov/marc/relators/relaterm.html
  - [maybe something else?]
Reminder: rdae:P20319 "aggregates" is an expression-to-expression relation (domain is Expression, range is Expression). Expressions only aggregate expressions only.
In the model, consider the value of rdae:aggregates:
- 710_2 $a [CorpBodyAppellation] $t [Title] $1 [URI]
- [aggregatingExpression] aggregates [$1 value]
- That is, look at the [$1 value as the object of the triple].
- It needs to be an RDA Expression. Anything else would not be well-formed RDA.
- That isn't something we would ever describe.
  - However, probably in most cases, the thing referred to by the $1 value will not be an RDA entity.
- Does the $1 value dereference to trustworthy data?
- Is the data associated with the $1 value compatible with our data?
- The entry of a $1 IRI suggests it is a well-formed IRI. In our graph, that will be a value and that's all, we link. We don't absorb graphs into our graphs. The additional information is "out there." In terms of wht's out there, we have no control, all we can do is make statements.
- Summary:
  - so does the IRI refer the the entity required (an expression)? We are converting legacy data that is untrustworthy on many levels; here, errors will be inevitable; the MARC data is not RDA.
  - In all cases, at some stage in the processing, there needs to be a test: dereference the IRI to retrieve the rdf:type.
An IRI should be forever; if it dereferences ever, it should always. It should never be deleted or rendered invalid. If it evaluates to rdac:Expression of lrm:Expression, then it's good to go; otherwise it's invalid. * If invalid, IRI must be stringified; data consumers will see that and see that someone is using this identifier (no longer an IRI) to identify an expression. * There is not general agreement on this practice at a time when communities are not situated in an environment conducive to collaboration -- which is what's needed here.
Gordon says we can create a description set for an IRI minted at another institution. He says IRIs are not old-world identifiers where we have to go to the source of the IRI for more information.
- One way to negotiate this: have our description set with the common IRI dereference differently. Like maybe with a handshake service with the source of the IRI.
Are there any 700/710 with ind2=2 and $1?
- If so, what would that be? Wikidata items?
  - Wikidata entities are not RDA entities; most often, the Q number will refer to a work
Thinking about the Sinopia RDA templates: any time we want to link to any of the 4 resource entities, we have to create it. Nothing else out there is modeled as RDA. There's some linked data we can link to and fit it into the data somehow. But if we want them to be RDA entities, we have to describe those resources.
So the $1: if not an RDA entity, what is its relationship to the actual thing being aggregated? Is there any way to represent it in the RDA?
- BF entities not useful here either. And no mapping exists to this day, as BF remains unstable, not singularly owned and maintained, plus classes and entity boundaries are not stable, otherwise a reliable mapping would be created.
  - BF community itself is divided on what $1 means, especially in the context of works and expressions. That's the difference between SVDE and LC BIBFRAME.
- As for Wikidata: perhaps some effort should be put into registering RDA entities and properties in Wikidata.
  - After registration: analyses then statements: for example, RDA E is a sub-class of some Wikidata class.
  - There's a strong foundation for success here: CIDOC-CRM is influential in Wikidata, and CIDOC-CRM has a close relationship with LRM.
- Noteworthy: there are several instances of expressions in VIAF, sometimes with accompanyng work links.
  - derived from name-title authorities?
That seems like plenty of feedback on 7XX with analytic entries
However, concerning the table: Theo agrees it is a good idea.
- Laura commented in the 7XX Work Party notes that beside columns for the relator term and code, a column for the three types of agents (person, corp body, family).
- There is an alignment between the MARC relator codes and the unconstrained RDA properties in the registry. There is also a map between the unconstrained and constrained RDA elements. Both should be reasonably up to date. So run a 2 stage process.
  - We don't know what entities the relators refer to, so the mapping was between relators and unconstrained properties.

535 Mapping (40 minutes)

See spreadsheet
See field spec
See issue

Usually a MARC record describes, for the most part, a manifestation that the cataloging institution has; if there's a 533, that changes everything, because most of the fields in the record are no longer describing the thing the institution has, which is, say, a microform; instead, a record for the original was cloned and a note slapped-in about the microform. That means the manifestation we mint for the microform probably cannot make use of the 245 field, the 260, etc., but has to depend on the contents of the 533 field. It changes the mapping practice and is challenging.
In issue 207 regarding MARC 533, Laura added a link to to "LC-PCC PS for 1.11: Facsimiles and Reproductions, October 23, 2014," which provides some guidance on where, in the MARC record, to describe the original or the reproduction.
- It's going to be challenging to pick apart what MARC field describes what RDA entity.
- There are millions of MARC records (largely by vendors) that describe their specific reproduction using a MARC 533. Like think about microfilm sets alone.
- Mapping will require endless conditions and alternative mappings for each field when 533 is present
- Maybe there's a way to send some of the resulting RDA entities down a special path, like flag them somehow
- Or do we need special pipelines for the transform?
- Or maybe post-procssing will be our best bet
So there's a 533; we'll have 2 manifestations, one for the original and one for the reproduction; our challenge is to figure out what in the MARC record goes where in those manifestation (and possibly related entities) descriptions.
Proposed: "Naked IRI": a waystation between more important links.
- However, naked IRIs cannot exist: there has to be statement(s) saying someting about it
  - So: what entities are we talking about? What class(es)?
In our case we end up with 2 entities, one manifestation each for the reproduction and the original. Agreed.
What subfield do we attach with what entity?
- Description set describing each manifestation will be based on conditional relations between different 5XX elements.
Let's think of this as a two-stage process and that we're in an evolutionary process: we contribute IRIs now to an expanding interconnected global graph, including naked IRIs; later, as another stage, the graph expands
- Naked IRIs will get closed (someone will make statements using that IRI as the subject)
- we build things up, we make contributions, it expands as expected by the open world assumption, nothing is fixed
- part of the evolution: AI processes: it's likely to be routine to match 2-3 billion IRIs at the blink of an eye
- best to lose any fear of naked IRIs
We need an efficient way to map 533 (and, therefore, 535) without exploding our mappings.
- Sift the record for subfields describing the original with its IRI
- Any subfields relating to the item reproduced need to be part of the description for the reproduction
- We have to keep track of which is which in the processing
  - this goes all the way up to processing the 008
- Just get it in the spreadsheet and we can get it in the code
  - However, 533/535 will affect much more than we might have originally thought
    - Therefore conditions will have to appear in other spreadsheets, not just those for 533/535
- Propsed: some kind of variable (or function) to handle descriptions that involve reproductions.
  - Can't process everything at once; either that or postprocess incorrectly processed records
  - XSLT modes might be able to handle it
A lot of libraries are following the [provider-neutral guidelines (https://www.oclc.org/bibformats/en/specialcataloging.html) and describe a reproduction with a record almost exactly like the original except there's a 533 stating only that it's a reproduction without any information about the reproduction
- This is another complexity we'll have to account for
For 535 with ind1=1, mint some manifestations, then run a first pass at this mapping with descriptions minimal featuring the RDA element isReproductionOf; when ind1=2, use some other property (like has equivalent manifestation) to represent the relationship between the original and the reproduction
- if there's a 533, then the description in the MARC record is going to be a description of the original
- Choice:
  - do the above now, with minimal descriptions, and fine-tune it later (like add modes to the XSLT)
    - we'll look at some data, discuss it, then make adjustments
  - do the detailed work now, if we think it will save time
The reproduction reproduces an item; may require the use of "is reproduction of item of." Location is of the item. Item needs to be minted. Depending on indicator, associate location with either the holder of the original or the holder of a reproduction.
- There's a special case, in the MARC specification at field 533, for mixed materials (materials under archival and manuscript control). Something else to take into account.
At the meeting, the document "535 & 533 Sketches" was updated at this point.
phase 2: do we have to go back to the MARC data after we process it? (note taker's note: no, Theo misunderstood what Crystal meant by a two-phase approach; Crystal meant we would adjust the XSLT after we review the first pass, not re-use the original XSLT then process the MARC data a second time; Crystal's idea is good, and it maintains Theo's hope that we'd process the mARC and then nec=ver see it again).

Action items

7XX team will continue mapping the 7XX
Ebe will start a lookup table matching relator terms and relationship designators with RDA elements
Theo should also create what he thinks such a table would look like and compare to 7XX team's version

November 1, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Theo, Crystal, Sita, Junghae, Laura, Jian, Ebe, Benjamin, Sofia
Notes: Benjamin
Time:

Roles/Agenda Review

Announcements

Work party tomorrow! Crystal will send a reminder

535 Mapping

See spreadsheet
See field spec
See issue

First question: Map indicator 0 and 3, or no?
- Wouldn't be too painful, right? Set out to map, see how it goes
- Some records may have indicator 3! Example, oral history cassette tapes still floating around
What does qualifier AM mean in spec?

Indicator 1 - Specifies additional information about custodian [USMARC only]
0 - Repository (AM) [OBSOLETE, 1984]
3 - Holder of oral tapes (AM) [OBSOLETE, 1984]
Values 0 and 3 were made obsolete when the scope of the field was redefined for the location of originals or duplicates that are housed in a repository different from the repository of the holder of the materials being described. Records created prior to that time may contain the name of the holder of described materials in this field as noted by indicator values 0 or 3.

'AM' = Archival and Manuscript material

❓ Starting questions:

Is 535 going to accompany a 534?
Will 535 and 534 always refer to the same thing??

💬 Discussion

I don't think there's any way to know if the 535 and 534 refer to the same thing... I think this needs to be a 'note on manifestation'
Are we talking about a Manifestation or an Item? When one says 'original' it seems one is talking about an Item... So, how do we turn this into an Item description? And if we can't do that, this is a note on the manifestation (about an Item)
535 indicators should tell us whether that field refers to an original or a duplicate... if it refers to the original, it follows that the 535 is talking about the same thing as 534...
We are using the WSU VE Sandbox MARC Field Search to look for 535 fields with first indicator = 2
Some ideas from Sofia in the chat:
- Indicator 1 534 Original Manifestation - rdam:P30460 "has holding" - Item - rdai:P40162 "has location of item"
- Indicator 2 Manifestation in the record - has holding - 2ndItem - has location of item - 535
Why would we ever need to use the element 'equivalent item' to relate two items which are exemplifications of the same manifestation?
🧠 'related item of manifestation' seems a safe choice for 535 mapping...
But then we would need to provide the location (535 is 'Location of (...)') so we wouldn't avoid needing to mint an item, in order to provide this location
[See draft transformation rules written by Crystal during the meeting]
[See also 534 Sketches]
Significant realization and point of consideration: Seems that mostly 535 is associated with 533! (And, later, "533 and 535 seem to have a problematic relationship")
❓ What if we looked at all of the 5xxs which (can) indicate relationships between manifestations, and looked at these as a whole? Our discussion today is contradicting previous 5xx decisions made...
- 530, 533, 534, 535, ... (and more?)
See also LC-PCC PS for 1.11: Facsimiles and Reproductions
Can we rely at all on form of item code 'r' for reproductions? Probably not for vendor records.
Let's look at the 533 next week

Action items

Look at related issues before next week - 533, 534, 535

October 25, 2023 8:00am - 9:30am PDT

See time zone conversion

Present: sita theo adam crystal jian gordon ebe junghae laura sophia

Notes: theo

Roles/Agenda Review

Agenda approved

Announcements

No announcements

534 Mapping

See spreadsheet
See field spec for 534 See Issue 534

Continued with examples
534$a is work information but there's noplace to move it in RDA/LRM/RDF
Our transform should consider using a function based on an algorithm for tuirning MARC fields into notes, where punctuation is processed consistently; currently this is done on a field-by-field basis
For serials and integrating resources (but not all diachronic works), due to the WEM lock, the original and the reproduction are different works.
- A good reason not to generate IRIs for the 534 field
- Plan: filter those out and create, in the description of the reproduction, only a noteOnManifestation for the 534 field.
- Plan: drop the data. We have no way of knowing what the reproduction reproduces: an article? A section? An ongoing section? Half an article? Part B of an editorial comment?
  - We don't know what serial is being referred to in $x
- 📢 Plan for serials, integrating resources and 534 fields containing $x different from the 022 field: create note only.
What are we going to do about whole/part relationships?
Given this: 534 ##$pOriginally published as a section of:$kNeology,$x0228-913X.
- We would erroneously generate a false manifestation.
  - We agreed to take that risk and clean-up downstream
- For all minted IRIs, there is a strong likelihood of creating a duplicate.
- We're moving legacy data into a linked data environment; ket's strive to create the best linked data we can.
$z ISBNs can contain extraneous information and are not ISBN values necessarily
If a $3 is present, then we do not have an equivalent manifestation!
- Maybe create a note only if there is a $3?
$8 we don't map
$6 is processed the same for all fields
📢 534 MAPPING IS COMPLETE!

Action items

Enter the 534 mapping into the mapping spreadsheet
Think about a field to map next week
- Crystal favors 255
- Any other suggestions can be emaled to Crystal

October 18, 2023 8:00am - 9:30am PDT

See time zone conversion

Present: Laura Sita Benjamin Theo Crystal Adam Gordon Jian Junghae Sofia Ebe

Notes: Theo

Time: Benjamin

Roles/Agenda Review (5)

Announcements (5)

Laura sent an email to Crystal regarding the 020 field. It would be good if Laura could re-send the message.
Benjamin's project is working on descriptions sets for aggregates in Sinopia (Stage).
- They would like more eyes on those.
- Is there a good place to share those so m2r people can view?
  - Of course they're in Sinopia, but elements are recorded using the opaque identifiers
  - Can also save them somewhere with labels
    📢 Save in the aggregates discussion, discussion 354

008 Review (10)

Wrapped up 008 review last week: do we need someone to go through and make sure we were consistent with ourselves throughout, or is this built into Theo's transformation-writing workflow?
📢 this is built into Theo's transformation-writing workflow!
Benjamin finished their work from last week (Thank you Benjamin!) 🥳

534 Mapping (70)

See spreadsheet
See field spec for 534

When a MARC 534 field is present, when, if ever, do we mint an IRI for an additional manifestation?
Remember the MARC record is for the reproduction; the 534 is a note on the original
If we create a manifestation for the original and the reproduction, there should be a lot of metadata in the MARC record shared by both manifestations.
For example, if there is a work or expression for the reproduction, those will be the same for the original.
- It is possible the two manifestations are two different expressions of the same work, but this is not a preferred use of the MARC fireld.
If we create a manifestation for the original, it's title will be 534$t if present, otherwise it will be 245$a.
- There are complications
- If reating a title statement for the original, usually the 245 only has a chance to suffice; the 534$t will not have other title information.
  - Could also find the elsewhere-created full MARC description of the original to determine the title statement
A lot of 534 values are values like a representation of "note on verso."
534 values will always be strings; any values transformed into linked data would benefit from significant reconciliation efforts. However, often there is simply not enough information about the original for reconciliation efforts.
Looking at the MARC specification, the purpose of the field seems pretty clear: we are describing a reproduction and the 534 describes the original. They will have the same expression, same layout, same order.
Whatever info is missing in 534 describing the original should be present elsewhere in the MARC record.
📢 The property in RDA intended to provide the same descriptive purpose as 534 is rdam:P30024, "equivalent manifestation."
- The definition of rdam:P30024 says the equivalent manifestations embody a common expression.
Minting an IRI for the original is sound linked data practice.
- Likely the original will be described in some other description set; thus, if minting IRIs for MARC 534 manifestations, we will necessarily be creating some duplicates. Also these duplicates will be related to newly-created works and expressions, causing additional duplication.
  - Mass reconciliation is a fact of life in the linked data world.
  - AI tools may come to the rescue for the mass effort.
An alternative to minting IRIs for the originals: create a note on manifestation for the reproduction.
More byproducts of transforming from string-based data stores to thing-based data stores:
- Mass reconciliation
- Deciding when to maintain the string values and when to mint an IRI
- decide whether or not we're performing single-entity cataloging.
At the meeting, some examples were worked-out and recorded in Issue 208
Because conformant RDA requires an appellation for each RDA entity, there are times we will need to record a stringified identifier. It could happen here, with originals.
- could be the direct value of an element that expects a value that is an identifier string; could be a nomenString for a newly minted nomen.
- However, in the case of 534, we anticipate another appellation will be present, so the stringified identifier may not be needed.
If we're not minting nomens for the reproduction's 245 field, then we should not mint nomens for the original's 534$t or 245.
The 534 $c presents some difficulty: is it publication, production, or distribution information?
- We anticipate 534 $c will almost always be publication information; if there's production/distribution information, it will be in addition to publication information.
What complications arise when a MARC 240 is present?
What complications arise when we're describing a reproduction of an aggregate manifestation?

Action items

As time permits, please review the descriptions sets for aggregates in Sinopia (Stage).

October 11, 2023 8:00am - 9:30am PDT

See time zone conversion

Present: Benjamin Riesenberg, Crystal Yragui, Adam Schiff, Laura Akerman, Ebe Kartus, Jian Ping Lee, Junghae Lee, Sita Bhagwandin, Sofia Zapounidou
Notes: Benjamin Riesenberg

Announcements

7XX work Party scheduled! Crystal, Laura, and Jian will be there. Others interested in 7XX mappings are welcome. Use this Zoom link to join. 9:30am - 11am Pacific time, Thursday November 2

008 Mapping review

See spreadsheet

What is the right element for mapping [Category of Material =] VISUAL MATERIALS, [Character Position Label =] Type of visual material, [Code Value Label =] Microscope slide?
- Is this a category of work or a category of manifestation??
- Viewing RDA Toolkit > Guidance > Entity boundaries > Work...
- Does a change in carrier type denote a new work? No
- We believe that 'has category of manifestation' is the correct element to use here
- See spreadsheet for Transformation Notes
IRI values for 'has carrier type' (P30001) vs. 'has category of manifestation' (P30335)
- Can an IRI from the RDA Carrier Type vocabulary be a value for the RDA/RDF property 'has category of manifestation'?
- For 'has carrier type', as a rule, we use IRIs from the RDA Carrier Type vocabulary*
- Do we always use UWLSWD MARC 008 IRIs for 'has category of manifestation'?
- *Ha! Here's a case: RDA Carrier Type overhead transparency does not match Visual Materials / MARC 008/33 / t-Transparency, so we will use UWLSWD MARC 008 IRI for 'has carrier type' in this case
- Thus it seems that generally we use RDA Carrier Type values for 'has carrier type' and UWLSWD MARC 008 values for 'has category of manifestation' but this rule is sometimes broken
Discussion of https://doi.org/10.6069/uwlswd.eje7-jq11#z
- Considering mapping VISUAL MATERIALS 008/34 - Technique - is this value useful?
- Property is 'has nature of content'
- Group members discussed the limitations in usefulness of the label for this resource, considering a display for users which might look like:

Nature of content: Other technique

Continuing discussion of https://doi.org/10.6069/uwlswd.eje7-jq11#z
- Group members considered changing the label for this resource to something like 'other than animation or live action' or 'moving images which are neither animation nor live action'
- Group members decided to output both the UWLSWD MARC 008 IRI https://doi.org/10.6069/uwlswd.eje7-jq11#z and an unstructured value "Moving image technique: neither animation nor live action."
008 mapping is finished!? BUT WAIT, STILL TO-DO is an overall consistency check...
- Benjamin will make some limited fixes based on email exchange with Theo and Crystal
- Group will ask Theo whether he would be able to report inconsistencies while coding the conversion
- We shall check back about finalizing the 008 mapping next week

534 Mapping

See spreadsheet

Looking at the field spec for 534, $b - 'Edition statement of original' threw us off - could the 534 indicate a new expression?
- Our working theory is that the 534 is a different manifestation of the same expression - the original which was used to create the new manifestation (a digital surrogate, for example), had an edition statement which the new manifestation may or may not include
Will we attempt to describe manifestations (mint IRIs for manifestations) described in a 534? What if we only have very minimal information??
- We might create very minimal manifestations that don't make any sense at all 😓
- Perhaps we might consider creating structured literal values for a property such as equivalent manifestation, and leave reconciliation with/creation of manifestation IRIs for a future time
- We might set some conditions which could trigger minting a manifestation if sufficient information exists

October 4, 2023 8:00am - 9:30am PDT

See time zone conversion

Present: junghae theo jian laura sita adam gordon sofia crystal ebe
Notes: theo
Time: ebe

Announcements

Postponing Cypress onboarding by a couple of weeks
Crystal is behind on stuff but hasn't forgotten about 7XX work party

020 identifier review

See comment from Laura
- Proposal: identifiers follow the pattern established in discussion 375.
- That is, value of has identifier for manifestation will not be a literal but an IRI identifying a Nomen instance.
- This will require a change in a number of mappings.
- Laura can identify which MARC fields contain identifiers and therefore will follow the pattern, then, in the spreadsheets, bring the corresponding rows into conformance with the pattern.
- 📢 AGREED: identifiers should be mapped following the pattern, establishing Nomen instances
- note: recording method for these identifiers is indeed "IRI."

008 Mapping review

See spreadsheet

008 mapping started at Row 1067
008 mapping ended at Row 1102, visual materials/type of visual material = q, model.
Benjamin did some spreadsheets and unearthed some issues. Were they resolved? Should the group know about/look at them?
- Yes.
- Benjamin asked if form of material value 's' for electronic should become, in RDA/RDF, the value https://doi.org/10.6069/uwlswd.dh5m-5y16# or http://rdaregistry.info/termList/RDAMediaType/1003
- 📢 ANSWER: http://rdaregistry.info/termList/RDAMediaType/1003
Kit
- typeOfVisualMaterial in this case maps to category of manifestation
- this value ("b") set the stage for the discussion that followed about other values; namely, are we describing a work or manifestation; if the latter, are we describing a carrier type?
- kit also raises the question of what can and cannot br FRBRized. Kit cannot.
- at any rate, kit is not a carrier type. Usually it includes a range of carrier types.
- the correct property to map to: categoryOfManifestation
Obsolete value Electronic videorecording was deemed unmappable.
"Motion picture" is not a description of a Work.
- media type = projection makes sense; we looked, but nothing else made sense. Carrier type was considered.
Microscope slide is not a work; a set of slides with a theme could be considered a work.

Pick a group mapping for next time

📢 FIELD CHOSEN: 534!

Action items

Crystal schedule 7XX work party with Laura and Jian(?)
Laura will work on mapping identifiers

September 27, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: benjamin theo crystal junghae laura sits sofia jian adam ebe
Notes: theo
Time: n/a

Announcements

UW MLIS student Cypress Payne will be joining the project October 11.
- She has been working on the RDA Sinopia profiles project
- undergraduate degree in computer science
- onboarding meeting with Crystal scheduled October 4th
- they will also consudt a "Directed Fieldwork" in cataloging

QUESTION

Laura asks if we should suspend 7XX mapping until after aggregates discsussion.
- No, Crystal says
- Crystal working on 700
- Laura working on 710
- Idea: a 7XX Work Party
  - Crystal and Laura can organize that

Meeting topics check-in

We are nearly done with the 008 mapping review.
Topics for discussion seem to be slowing down, maybe due to conference season.
What to discuss moving forward? Co-work on mappings next? More review?
Idea:
- October: let's do some group mapping
- November: aggregates
- December: data review
"Let's do some group mapping"
- beneficial to clarify how we conduct mapping work in this project. especially for newcomers
- maybe pick a new field to map, like when we did the 490 together
- Theo favors doing "reviews" rather than "fresh fields" in the interest of having mappings marked "done" for the transform.
- The group favors a mixture of both reviews and fresh
- Next week we'll decide on a field

THOUGHTS

Laura notes that Deborah is working on markers for aggregates; however, these and other aggregate-concerns are not reflected in the mapping.
- maybe we should have some indicators for further work?
  - this would include indicators of "aggregateness"
Theo reminds the group that a phase 2 work plan is in progress
- 2024 and 2025 timespan
- hopefuly some grant money will be awarded
Ebe notes that she is involved in efforts to catalog aggregates in a MARC environment
- the effort focuses on what data belongs in what field
- maybe we could use those guidelines, engineer something that could use that information in the mapping
- she will be happy to share that documentation
Laura reminds us that a problem persists mapping identifiers, like with the 010 field
- Zhuo was involved in this previously, but he has a new job now and probably won't be around much.
- after-meeting note: Laura added to discussion 365, "Creating identifiers" to continue the identifier discussion

008 Mapping review

See spreadsheet

Started off at row 1025, Visual Materials, position 23-27
ended at Row 1067, Visual Material, position 33
Discussion included:
- Crystal has question for Cate on Visual Materials 23-27
- we have a missing value for 008/23-27 in the 008 MARC/RDF vocabularies: "m" for script materials
  - post meeting note: David is working on this. There is a set of obsolete values that don't have a home vocabulary, llike "form of item," and "script material is one of them." David plans to add these in October.
- the distinction is not clear between direct electronic and online
  - before the meeting:
    - direct electronic (notation "q") is a media type (on a physical carrier)
    - online (notation "o") is a carrier type (on RDA list as a carrier type; an "onine resource")
  - after the meeting:
    - online = has carrier type (no change from present)
    - direct electronic = has carrier type (change all occurrences in spreadsheet)
    - electronic (notation "s"):
      - do not map twice, once to media, once to carrier; map as has media type.
  - noteworthy: in the beginning, there was only s; then o and q were added as more specific ... "categories"?
  - Sofia: electronic is media type; direct electronic says something about the carrier; electronic does not say something more, direct electronic does. Electronic says something generic about how we access the resource; the other two say something more about the carrier. Note we can have double mappings if we want; also, we can derive media type from the carrier.
- Adam points out OCLC may have entirely wiped-out "s" values
- What is a kit? Not a carrier type, but, rather, a category of manifestation.
Next time: continue 008 mapping, start at row 1067, Visual Materials, position 33, value "a", "Art original"

Action items

Crystal will ask Cate the questions she has for 008/23-27
Benjamin will edit all 008 mapping of electronic/direct electronic/online so that it is done consistently for all types of material.
Theo will investigate missing obsolete values in the MARC/RDF vocabularies.

September 20, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Gordon Dunsire, Jian Ping Lee, Sita Bhagwandin, Junghae Lee, Ebe Kartus, Laura Akerman, Crystal Yragui, Benjamin Riesenberg

Announcements

Some discussion about SWIB 2023
Some discussion on BFWE 2023

008 mapping review

See spreadsheet

Did we get a reply to the question about 008/30-31 in issue 50?
- Yes, see replies from Cate and Deborah
- Interesting note from Gordon, paraphrased, that in the LRM context, only humans can create Works
On 'has duration': a duration is not an instance of an RDA Timespan
- Ended up using note on manifestation a couple of times because duration is an expression element, and there are some upcoming changes, I think...

Relevant aside

RDA is an integrating resource, it will change, mappings may change

September 13, 2023 : NO MEETING

September 6, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Benjamin, Crystal, Ebe, Laura, Sita, Sofia
Time: Crystal
Notes: Crystal
Recording: Crystal (Junghae won't be here today!)

Announcements

No meeting next week: Crystal & Adam will be presenting at SWIB in Berlin
Missing space we noticed in the Toolkit and reported to RSC has been fixed, will appear in next Toolkit release. See issue
Sita and Sofia attended IFLA WLIC, and Sofia kindly shared her notes with us

008 mapping review

Spreadsheet
We noticed that we've been inconsistent in our mappings for punched paper tape and multimedia across different record formats. Sita suggested that someone go through the mappings at the end of the review to check for consistency, and bring inconsistencies back to the group for review. Others agreed.

There was some confusion about MUSIC 30-31 "Literary text for sound recordings"

Are we talking about supplementary material/aggregation aggregates? Or, is the literary text the "main work"? Or, is that unclear?
When is code "s" for "sounds" applicable? Work or expression?

For next time:

We left off at music character position 33. Is it category of work?

Action items

Crystal will follow up on MUSIC 008 30-31 in the 008 issue and ping Gordon, Adam, and Cate

August 30 : NO MEETING

August 23, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: sita theo jian junghae crystal benjamin adam laura
Time: not assigned
Notes:

Announcements

No meeting September 13
There's a new map in the RDA Registry, Map from RDA properties to MARC 21 Bibliographic encodings.

Inquiry into mapping MARC X00 fields

Combination of fields in their MARC order should be retained:
- 100 $a $b $c $d $g $q $u
No separate mapping for the three possible values of indicator 1 (entry element: forename, surname, family name).
- in linked data environments, usually will not differentiate inverted names vs. names in direct order
However, ind1=3 will signal special treatment.
- Do we use the class collective agent or family agent? (That is,the entity described in the X00 is an instance of what RDA class?)
  - Family agent is what we would use. We should not need the class collective agent, as it is over-broad.
In doing this, aren't we eliminating the possibility of "round-tripping," that is, MARC-to-RDA then RDA-back-to-MARC, without loss?
- Our task is to map MARC-to-RDA and not worry about round-tripping.

008 mapping review

Decisions recorded in (and discussion reflected in) 008 spreadsheet.
Started at row 742, mixed materials, position 23 form of item.
Ended at row 867, where will will pick up this mapping next time.

Action items

None specified.

August 16, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Benjamin Riesenberg, Crystal Yragui, Adam Schiff, Laura Akerman, Ebe Kartus, Gordon Dunsire, Jian Lee, Junghae Lee, Sita Bhagwandin, Sofia Zapounidou, Theo Gerontakos
Time: Ebe Kartus
Notes: Benjamin Riesenberg

Announcements

Upcoming IFLA WLIC - Sofia and Sita will be in attendance
Upcoming SWIB 23 - Crystal and Adam will attendance
Discussion on RSC RDA-to-MARC mappings for 700, 710, 711 ...
Deborah out of meetings until November 13, but said she would respond to emails.

Aggregates wrap-up

Do we have enough to go on to continue mapping?

I think I can keep mapping and as questions come up we can answer them, but any problems that prevent work right now? (No comments, so, okay, we'll keep mapping)

Plan to start putting together documentation/requirements for transform in November?

Let the aggregates discussion 'rest' for now...that is, don't attempt to define specific requirements for the transform with regard to aggregates until Deborah is back in November
I think we may not have tackled the issue that the descriptions for aggregating and non-aggregating works are quite different
- For example, the mapping of 245 for agg. vs. non-agg., like looking for 700 12 for example to indicate aggregate
OK, but we want to flag aggregates vs. non-aggregates for transform purposes--that is, I believe the plan is to separate into sets of aggregates and non-aggregates and process those sets
- So we need markers for aggregates , like for example 7xx #2, 7xx $t, ...
- I'd like to avoid duplicating spreadsheets for agg. vs. non-agg! How to do this?? Or do we just need separate spreadsheets
We do have issue #383, wherein we have submitted some possible 'aggregate markers'
How would the transform team like to proceed?
- What about a column for 'possible marker of aggregate': yes/no, for specific fields/subfields?
I'm just thinking about an if statement that would apply to an entire record, something along the lines of:

if condition A or condition B or condition C:
	process using mapping-aggregate 
else:
	process using mapping-non-aggregate-mapping

OK, but I'm not sure what we would actually do for the 'else' here...
So perhaps all we can do now is look for markers of aggregates??

008 mapping review

See spreadsheet '008', we started at Maps > 33-34/Special format characteristics > k/Calendar

Is a calendar a form of work ('category of work')?
What about a puzzle??
- What is the content of a jigsaw of a map? What is the carrier??
- "If you assemble the jigsaw puzzle, you end up with a sheet, that's the carrier type"*
- If you assemble it, you also end up with a map ... (I believe the point here was that 'map' = content, not carrier)
- * The speaker later stated that the carrier type of puzzle should actually be 'object'...
Much discussion of aggregates
- "A carrier that contains two or more distinct expressions"
- OK what's an expression? Well an expression has content type, so usually different content types = different expressions, thus different (distinct) works
Maps > 33-34/Special format characteristics > n/Game
- "I agree that in some cases a game is a separate work, but from this legacy data we can't tell"
We realized that we've put off finding the OMR (soon to be UWLSWD) vocabulary value for [MARC form of item | RDA/RDF has carrier type] microfilm - Theo will add this to spreadsheets

Aggregates Discussion

August 9, 2023 8:00am - 9:30am PDT

See time zone conversion
**Present:adam theo deborah jian junghae sita crystal zhuo benjamin laura ebe gordon sofia Time: Theo
Notes: not applicable today

Announcements

Crystal out of town August 30 and September 13: no meetings those days
Deborah will take a leave from this project and return in November.
- will watch recordings, follow issues, and meeting notes.
- will be available via email.

Aggregates discussion included:

Deborah added to last week's slides
- additional slides have a green background and start at slide 20
Questions from last week on slide 27
Question 1 "If the aggregated content has a title, and a responsible person, could we still use the shortcut leaving out the Work / Expression or would it be unwise?"
- slide 21 addresses some of this.
- Catalogers have to choose what to describe and how
  - example of "how": could describe aggregating and aggregated separately.
- Slide 21 points out the aggregate markers in existing MARC
  - 700 ind2=2 states what goes with what
  - Slide 21 coice: just aggregate Manifestation triples
  - However, if there's no 700, there are fewer choices on what to do with the MARC
- How about MARC 505 ind2=0 (enhanced). Is that usable to establish the relationships? Maybe useful for the transform?
  - It's unstructured. Not authorized access points. Maybe use it for an unstructured title?
  - Certainly useful for a note.
  - Current systems overload the title index with 505 data
When do we absolutely need to know that an information resource is an aggregate?
- slide 22 raises this questions well.
- Metadata quality benefits from distinguishing the aggregating works.
- However can we not create adequate RDA with MARC if we can determine if it best describes an aggregating work?
  - Analytical entries allow us to describe the aggregated works
- There is, however, much more information in a MARC record; for example, the leader (LDR/06 Type of record, Expression information) and 008 (008/35-37, Expression information) may allow us to descrbe the Expression beyong the 700.
Question 2+3: "Your clue to contributor to aggregate was the role of the person, right? Editor?" + "Can we use the following relator term for aggregator? Editor of compilation [edc]."
- What did the editor do? Edit the collection or the text?
- Patterns can be discerned in the MARC data.
  - Deborah has experience with this: parsed 5M LC resords and distinguished records the were aggregates.
    - went a step further: looked for editors
      - when there's an editor, there's usually a collection aggregate.
    - However, not all editors of collection editd a collection
    - When there's an editor and a 505, it's likely a collection
    - When performing an editor search, 143,000 records that did in fact have an editor were not retrieved.
    - possible method: create spreadsheet from the MARC; assess the markers and eliminate resources not marked as aggregates; among the remaining, try to determine what if collection vs. parallel vs. augmenting.
      - Alternative: quick and dirty transformation described in slide 29.
Possible markers of aggregates
- parallel titles, MARC 245 field?
  - Not dependable (reason has to do something with the fact that parallel titles can be only one language)
  - How about 245 with an equal sign and two $a?
    - Also not dependable; Hebrew novels, for example, feature characters speaking English and Hebrew, two title pages (English and Hebrew), all in one work, meaning two $a in 245 with an equal sign -- but not an aggregate. Parallel titles are common in Hebrew literature where there's an English title at the end of the book.
- multiple language codes in MARC 041 field?
  - Not dependable; some single works can be written in multiple languages, like War and Peace
Is it useful to identify aggregating works without any relationship to the aggregated works?
- Maybe it could be enhanced later. For example, retrieve all aggregates and filter to display only those without relationships to aggregated W/E: maybe add info to those as it becomes available? But if we don't mark them, they'll just get lost and never better-described.
  - In other words, distinguish them for administrative purposes.
Probably best to start with facts and work outward
- End users do not need to know anything about aggregates and other features of the data model
- MARC data is not aware of aggregates
- If there is evidence in a MARC record that an aggregate is being described, there must be an aggregating work. This is the one thing we can be sure of.
  - Key information for the aggregating work is in the MARC 245 field.
  - Beyond this, it will be very difficult to determine aggregated expressions
    - Gordon thinks maybe 95% of aggregations will not have sufficient information for this
    - Deborah is investigating this
- In MARC we usually cannot determine what works or expressions are aggregated.
- To go beyond the aggregating work, we need solid evidence that a specific expression has been aggregated.
Recommendation: try not to extrapolate from the past into the future; what we will do in RDA is different fro what RDA we will get out of the MARC data of the past.
Quick and dirty transform method may be the most elegant.
- In short, that just determines the aggregate manifestation, describes the aggregating work, done. But Deborah thinks we can do better. Especially with augmentation aggregates.
  - Slide 23 shows an example of doing better, where we describe the aggregated W/E and not the aggregating work for an augmentation aggregate.
Can MARC be transformed in stages?
- Maybe. Like this (bouncing off slide 23):
  - First describe the aggregating work Emma.
  - Process further, machine can likely determine it is an augmentation aggregate
  - Generate aggregated work for Emma
  - Expression will be more difficult; MARC 7XX provides the best markers.
  - Maybe go deeper later and determine which aggregating works are parallel aggregates.
  - We should not forget about MARC indicators' role in analyzing aggregates.
  - The result will be conditional processing of sets of MARC records.
- Here's a sequence:
  - Is this an aggregate?
  - If yes, what is the aggregating work?
    - Although it doesn't need to be described, we can be sure it was described in the MARC data, and we should use that data
  - Break down into 3 categories
    - Augmentation; if simple and detectable, then generate aggregated work for augmented work
    - Collections; how do you isolate the expressions?
    - Parallels
  - Proceed with conditional processing of specific MARC tags
Consider this: maybe we can deal with 7XX fields completely outside the treatment of aggregates
We know:
- end users don't care about agregates as aggregates
- catalogers do, as they assign different relationships based on aggregating/aggregated, like, for example, between the agent and the resource. In the aggregating work (slide 23) Jane Austen is only a contributor to the aggregate.
  - The WEM structure will allow local systems to process them differently (collocation, etc.)
    - So? A name index is faster than processing the WEM structure.
    - The origins of FRBR: it's about consistency and its role in supporting user needs.
It's probably worth further processing to determine the aggregates
- extract what consistency exists in the legacy data
- knit legacy data as much as possible.
- There will always be requirement for human review. Let's make review smaller; better if machines can do more. Probably worth it in second pass.
Deborah has a list of questions; we'll approach those later; they are mostly about further splitting of the MARC data.
Slide 22
- static aggregating work (know from 504 field)
- 6 editions, all separate aggregating works (due to WE lock).
- cannot record edition in aggregating work description except as a note
- No representative expression element for designation of version
- Not uncommon: aggregating work with same title but different edition
- There is "has designation of version" (rdae:P20572) for describing expressions.
- Should we try to get that representative element added?
  - Was likely considered at same time as designation of version
  - Might be worth raising it, Gordon thinks
  - Might have something to do with versions of the bible (reason the version property was added at all)
Old FRBR approach: pre-FRBR thought edition must be expression level (different editions mean different content). FRBR points out majority of edition do not substantially change content; changes manifestation statements. So FRBR says it's man level data; however, if content changes substantially, it must be recorded at expression level (as other distinguishing characteristics of the expression).
- So there's ambiguity about what edition means.
- However edition for aggregation almost certainly means changes at the expression level
  - Any change at expression level also requires a new work (WE lock)
- So checking on with the technical group at RDA is worthwhile, Gordon believes.
Deborah question: If all aggregate manifestation is linked to aggregating work, then there will be many titles that are exactly the same with no distinguishing charateristics. So should we map all aggregating work AAPs as AAPs for groups?
- Gordon: no, no group involved here.
- Need a clear distinction in SES syntaxes for AAPs
  - SES for manifestation vs SES for work
  - Manifestation
    - primary AP is title then various creation details
  - Work
    - primary AP is creator of work followed by title
- Many aggregating works will not have an aggregator and so no name
  - Yes, no aggregator, no creator. primary AP becomes the title.
- How about date in AP? Will that help?
  - That's difficult; it transports manifestation data up to work level -- and there's no method for that.
  - But isn't date of aggregate manifestation often assumed to be the date of the work?
    - Gordon: I would prefer to distinguish using edition rather than date of publication. THat is, distinguish the aggregating works by inserting the edition in the AAP.
      - Deborah: However, remember, it cannot be part of the expression description, as there is no appropriate element
      - Here we get caught in something not relevant to our dicussion: the French problem with aggregates.
Picture books. Different author, different illustrator: it's an aggregate. But what is same person did both?
Graphic novels also: should we be thinking about them now too?
It's not modeled yet. It's being discussed. But neither RDA nor LRM comes close to resolving this.
- Writer of text gains supremecy in Anglo model; AAP for work features the writer, not the illustrator
- graphic novels turn that upside down, where illustrator is more important.
- There is no complete approach to this (generation of AAPs) in any cataloging standard.
- "Combination works." Like songs; music/lyrics. This is not resolved.
Deborah: but for mapping: find, say, "ill." in 300$b, or maybe a code in the 008; better to treat as aggregates; for example, the text may appear elsewhere with different pictures. Comic books and graphic novels are examples in the amalgamation terminology, which is in RDA, where they're so merged they can't be sepatrated.
Gordon notes: AAP discussion largely irrelevant. What identifies an aggregating work in linked open data is the IRI. So what's meaningful here is the generation of IRIs for aggregating works, not AAPs. AAPs are not required; stringified IRIs fulfill the conformance issue for RDA.
- That is, mint IRI then assert that this IRI has the same IRI stringified as an identifier. That fulfills the requirement.
- But of course an empty IRI is useless...
Deborah asked about Sofia's work on authority mapping. Are we going to mint every aggregating work and, when possible, aggregated expression, or is there any way to get NACO file transformed and in a triple store? If so, then matching processes can be used to retrieve values. Strings would be retrieved but care would need to be taken to get the full string from the authority data.
Laura asks, about slide 22, why even treat that resource as an aggregate? Why not ignore its aggregating nature?
- It's a collection that has been augmented; it cannot be treated as a single work except as aggregating work.
- MARC record seems tolack info for us to determine that, no?
- Editor in $z + 504, then there's sufficient info. Even without the editor, using the augmented and describing static aggregating work, you have done what's correct for this record.
- But for our transform, why not just treat it as a static single work?
  - Then you're describing an expression
  - The static work here is the aggregating work
  - If we apply the quick and dirty method:
    - we mint an iri for the aggregating work
    - we give title proper
    - then the expression that embodies the work
    - then the manifestation
    - We don't have to say anything about "aggregation."
    - The one thing we have to say is about static vs diachronic.
      - We're not going down the rabbit hole of diachronic works just yet
  - For aggregating works: there one or more expressions attached and each expression must have a work. Knowing that, just use the work shortcut.
- Deborah adds, it is important to know it is an aggregating vs. a single work because if it's an aggregating work, then expression elements (language, content type, etc) will be transformed as representative expression elements; if it is a single work, then they will be transformed as expression elements. That's the biggest distinction between the two approaches.
Crystal notes: we will take a break from aggregates until November.

August 2, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Crystal, Deborah, Junghae, Adam, Benjamin, Laura, Sita, Gordon, Jian, Sofia, Theo, Zhuo, Ebe
Time: Crystal
Notes: Crystal

Aggregates Discussion (80 mins)

Aggregates presentation by Deborah (30 mins)

Presentation explains types of aggregates broadly, skipping a lot of detail for the sake of time
Slides : slides should be kept internal for now, as they are not ready to be shared outside this group.
Aggregate manifestations: At least 3 (but sometimes just two) expressions and their works. Cataloger has a choice of which expressions and works will be described. Only the aggregating expression? Just some or all of the aggregated expressions? All expressions?
- Sometimes just 2: Edge case: serial/series w/aggregating plan where single instances happen to only have one. Still counts as aggregate, along with aggregating expression/work.
Past terminology: comprehensive/analytical/hierarchical description
Categories of aggregate manifestations:
- Augmentation aggregate
- Collection aggregate
- Parallel aggregate
Special elements for aggregates which could help us identify: contributor agent to aggregate, supplementary content, illustrative content, accessibility content, aggregator agent, transformation of, authorized access point for work group. Contents notes go in note on manifestation, if $r or " / ". Representative expression elements of an aggregating work. Subject headings if they apply only to aggregating works
Describing aggregating works:
- Special guidance for description.
- WE-lock. Aggregating work and expression, can be embodied in more than one manifestation. Representative expression elements allow us to describe an expression in the work description set. Don't have to include/describe aggregating expressions for this reason.
- Manifestation: work manifested rather than expression.
- Special guidance on titles of aggregates. Collective titles.

Aggregates presentation by Gordon (20 mins): Aggregate shortcuts in RDA

Aggregate model
LRM shortcut cuts embodies between aggregate manifestation and aggregated expressions. Aggregating expression aggregates aggregated expressions. Not so useful for transform.
RDA primary shortcut. Aggregate manifestation --> aggregating work. Extremely useful for transform. Cuts out aggregating expression entirely.
Contents shortcuts/contributor shortcuts are useful
Combining 3/4 sets of shortcuts simplifies things a lot. Very useful especially when we haven't described aggregated expressions.

Aggregates discussion (30 mins)

According to new RDA, libraries must decide how to handle each type of aggregate. Library can decide case by case how much they will describe. Deborah made a slide on deciding which expressions and works to describe based on category of manifestation.
If the aggregated content has a title and responsible person, could we still use the shortcut leaving out the work/expression or would that be unsafe? And just leave the title in the aggregating manifestation description in a note or something?
Access points for "expressions" in original language set up to look like works. LC practice is not in compliance with RDA. Potential explanation for strange modeling choices in BIBFRAME, leaving out expressions? Creates collocation issues
LC decisions about aggregates won't be compatible with RDA
100 fields are often for aggregated works and expressions, not aggregating. How to map these, especially when $e is not present? What relator ought to be used for an aggregator? "editor of compilation"?
Using 245 + 1XX is not a good way to make access points for aggregates.
Best way forward is to reserve 1XX for aggregates only, or, better, get rid of the concept of a main entry
UNIMARC/INTER-MARC: entity-based MARC. BNF acting as liaison. Problems with aggregates modeling. Lots of initiatives going on for entity-based cataloging

July 26, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Adam, Crystal, Theo, Deborah, Gordon, Laura, Sita, Sofia
Time:
Notes: Sofia

Announcements

If you haven't already, please get your bio to Crystal soon.
If you have more thoughts on the grant, please add them to discussion 431 today

Aggregates (20 minutes)

How can we identify aggregate manifestations in legacy records?
TG. How do we recognise? What are the elements we need to safely identify an aggregate manifestation?
DF. Important point. Adopt and use the aggregate report terminology which is different than the one we are used to. Many terms have been used about aggregates, and we must use common terminology.
- Regarding identification of aggregates
- identify static vs integrating publication plan
- How do we identify? Which elements are the essential ones?
- single unit or multiple units?
- possible workflow. Identify in MARC21 records that **seem **to describe:
  - successive integrating works, aka serials
  - integrating aggregating works
  - static aggregatting works
- If these works are identified in this order, then a search in the records must be done to identify which elements can be selected as characteristic ones for each case.
GD. Do not delve into successive aggregating works. Focus on static aggregating works. Identify how many expressions are in a record. That is what we are doing so far.
AS. Most staticw works are going to be aggregates
DF. Yes, granularity decisions come in. Which aggregated works/expressions to describe. What to choose. Look for patterns

Discussion will continue next week

008 mapping review (35 minutes)

See spreadsheet 008

Action items

Backburner

July 19, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Theo Sita Benjamin Gordon Crystal Junghae Adam Deborah Laura Jian Ebe Zhuo
Time: Ebe
Notes:

Announcements

If you haven't already, please get your bio to Crystal soon.
Any further ideas/discussion on goals for a potential grant?
- meeting notes
  - Any thoughts? Enter into Github Discussion 431
  - We hope to consolidate thoughts after the next meeting; if you have any ideas, please record them in the next week.
  - Theo is hoping to "get to the next level" of the grant process sometime next week
Are weekly meetings still working for us? The 90 minute length? If so, Crystal can extend meetings through the fall.
- meeting notes:
  - No objections to once/week 90-minute meetings
  - Crystal will therefore schedule more once/week 90-minute meetings

Aggregates

Review discussion so far, set some goals for August discussions
Would the group be willing to put together a public-facing panel on aggregates?
meeting notes:
- DF may not be able to attend Aug 16 and 23; maybe move the aggregate discussion to Agu 2 and 9?
  - DF will know her availability next week.
- Our discussion is Github Discussion 354. Major topics include:
  - IRIs, especially for Works and Expressions
  - WE lock
  - IRI pollution: dupes, empty entities, etc.
  - Aggregating Works and Expressions
  - Types of aggregates: collection, augmentation, parallel
  - Identifying aggregates in MARC data
  - We had a lively meeting discussion about aggregates on 2022-06-01
  - Aggregates and the MARC 505 field
- Regarding our August discussion:
  - DF:
    1. explain what aggregates are, their peculiarities; get us all on the same page;
    2. then focus on how we can pull aggregates out of a MARC record
  - GD:
    1. compliment what DF has to say as described above;
    2. current situation and future;
    3. triage past practice; lack of post AACR2R discussion on aggregates up to the 3R Project;
    4. limitations of aggregates and the discontent with aggregates in our community
- regarding a panel:
  - maybe something we host, maybe at a conference?
  - should focus on aggregates, not the transformation of "aggregates" in MARC
  - there have been other panels/presentations/discussion about aggregates; they often get bogged-down in details
    - We should steer the discussion to avoid such minutiae
    - People are applying old AACR2-based thinking
  - Panel could support moving toward current and future practice that works better with RDA aggregates
    - One possible focus: how we can describe aggregates in MARC
  - If it would help, let's create a discussion or issue to discuss this forum; anyone can launch that.
resources that may be useful to review going into our aggregates discussion:

Dataset 2 Review wrap-up

comment 8

If we have more to say about 245, lets add it to issue 115

comment 9

"No Place" can be difficult to detect
we can catch the english string "not identified" but that will not catch everything
Are we sure we cannot identify a place with no specific locus?
- Maybe "Planet Earth" but that's not terribly useful
However we detect, it should go in a statement
- project documentation should identify this as a known problem: sometimes "no place" will be recorded as a place using rdam:P30088
Other values that could be used to detect: old ISBD "s.l." and old pre-AACR "n.p."
A list a values that signal "no place"could be compiled
- maybe use in postprocessing
  - is postprocessing the best idea? What about pre-processing the MARC; i.e. perform data clean-up on the MARC
So we can create a few conditions and output something imperfect with a disclaimer.
But the value of the field cannot be in a manifestation publication statement: that is reserved for transcribed values only.
- use publication statement
  - But publication statement is based on the place of publication and may not be appropriate
  - Actually publication statement is best seen as a legacy element. P30088 is not, and we apply different standards

comment 10

weird MARC but transform performed well here.

SPRING 2023 DATA REVIEW IS NOW COMPLETE!

008 mapping review

See spreadsheet 008

meeting notes:
- completed 008/29 (see spreadsheet)
- 008/30 is undefined
- 008/31: no OMR vocabulary for index.
  - Landed on P30137 has supplementary content as appropriate RDA property with an IRI value from id.loc.gov
- 008/32: no need to map these very old positions
- 008/33-34 special format characteristics
  - After some effort determined Photocopy, blue line print OBSOLETE = "blueline process"
- left off at 008/33-34 = c, Row 704

Action items

Continue preparation of Aggregate sessions
Continue drafting and discussing grant proposal

Backburner

Deep issues on the 245 field; they can be entered in issue 115

July 12, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: benjamin theo crystal gordon deborah sits jian adam junghae ebe
Time: benjamin
Notes: theo

Announcements (5 minutes)

Laura is presenting on our project for the LD4 conference a couple of hours after our meeting. Link

Grant groundwork (20 minutes)

Theo and Crystal are planning to apply for a grant. Part of that includes expanding documentation. Please get your bios in to Crystal when you have a chance!
Brainstorm: What do our hopes and dreams look like? If we could have funding for anything, what would it be?

Meeting Notes:

Hire full time workers!
expand documentation
- good practices for creating metadata, especially RDA data; share with others
- how to create more linked-data-friendly MARC; especially MARC destined to become RDA
  - this benefits catalogers still cataloging in MARC21: they can use our guidelines and BPs
Hire someone to be in charge of documentation
Help with conversion code
- make it more "intelligent"; for example, reduce quantity of notes.
Create something about data cleanup, pre-processing, data scrubbing. How to prepare data for the transform. [Documents and tools.]
Use this opportunity to clarify the aim of this project.
- transformation of MARC to RDA? Retro spective conversion of legacy MARC to RDA?
- Possibility: run entire LC backfile (5,000,000 records? into 400,000,000 RF statements?) of legacy MARC through transform and post (preferably to Wikidata) as a one-off process.
  - dump into Wikidata, once it's done it's done
    - Would ut be RDA/RDF in Wikidata?
    - Wikibase has a differrent data model
      - Would we map RDA to Wikibase?
      - Would we establish properties for all RDA properties in Wikidata?
    - Getting the data into Wikidata seems complicated
    - Once in Wikidata, data can be exported from Wikidata as RDA/RDF; it is not stored as RDA/RDF
    - our Wikibase instance might be of assistance here?
  - Others can edit as appropriate
  - We're still in a MARC21 environment; this represents a practical way forward
- Possibility: two prong approach:
  - convert all LC records
  - contribute to a total [hybrid] cataloging environment: how to use MARC21 effectively
- An aspiration: wherever we store, it shold be public; other libraries should be able to create/read/update (maybe not delete?); it would be useful if it were a living RDA data store
Problems
- Current tools may not be powerful enough to process Big Data (i.e. 5 million records)
- We have not yet determined what data we'll convert
- We do not yet know where to store/publish the output of the conversion
  - Sinopia was also considered
Crystal started a discussion about the grant hoping to discuss grant planning, that's discussion 431.

RDA Toolkit Paywall (15 minutes)

What would it take to make the RDA Toolkit an open resource? Is there interest in this group in trying? Is this a position we all share, or not so much?

Meeting Notes

Toolkit will never be free
- Although before 3R ALA made good money on the Toolkit, presently they strive to not lose money, perhaps making a small amount.
  - ALA would prefer ceasing the Toolkit rather than cover the full expense, which is enormous
- The Toolkit is already available on a sliding scale
- Consider the cost of running the Toolkit: international meetings need to be supported -- they are expensive; production cost is high; Toolkit requires an expensive CMS license; other commercial licenses must be purchased, like translation software; cost may be around 1 million per year.
- Additional problem: there are 3 different copyright holders from three different countries. This presents a formidable legal process.
- ALA staff themselves favor this as an open resource but, alas, it cannot be.
- Do note that the RDA Registry is open, and it is even re-usable for commercial use. The Registry accounts for about 60% of the Toolkit contents.
- We have some work trying to determine exactly what constitutes a copyright violation in RDA use and reproduction. RSC will likely be discussing this in Octiber, in conjunction with questions raised by creators of a German cataloging Wiki.
  - There is no RDA copyright police. The main thing that is prohibited is wholesale reproduction of RDA. That has happened on at least one occassion. There is great leniency, however; just re-wording the text sidesteps copyright violation.
- In some ways, opening RDA Toolkit could have undesireable results. Instances of RDA will get developed in site-specific instances, local changes will be introduced, and the RDA living standard will no longer be synched.

Dataset 2 Review

Dataset
Gordon's review comments ** comments on sample dataset 2 > number 7

comment 7

It's just a rotten bit of data.
Could be fixed
However $5 cannot be fixed every time there's an oddity

comment 8

Possible solution: 245 $a has only ISBD terminal punctuation, so all other punctuation can remain. $b is more complicated but that's not the issue here.
We could use some clarification on how to use the 245 for W, E and M.
- We know it must be title of M
- We know the W title is derived from the M, so wouldn't we also use the 245 for title (not preferred) of work?
- Problem: If M embodies more than one W or E, we don't know what the 245 title is a title of.
  - Possible work-around: in description of M, use the property "has work manifested."
- It's useful for the E to have a title also; that's derived from the W; 245 can be used for that as well. In addition, RDA also offers an option to derive the title of E from the title of M.
  - The M title is always an E title, it just may not be the E you're describing
- We did make a decision to not use the 245 for title of W as we cannot determine without doubt the W the title pertains to.
- We do know we should add all titles of M; that property is endlessly repeatable.
- Some inferencing rule may be possible, using "title of manifestation" as source of the W and E titles.
- Aggregations cause problems in titles, both in matching the works aggregated as well as determining the title of the aggregating work.
  - Right; however other optons include: 1 W, 1 M, 2 Es...
- We know: a M is being described. The rest is uncertain.
- Note: instances of M that embody only one work are in the minority.
- Can't we record the title of M then use the same s the title of some work, we know it is some work, mint an IRI, give it that title, and establish a relation between the W and M.
- Yes, title of M is the title of some work. If only one W embodies, it is that W's title. If it is an aggregation, then it is the title of the aggregating W.
- Works must have an appellation of W:
  - title of W
  - access point for W
  - identifier for W

Action items

We need to schedule our long-awaited discussion on aggregates
- We're getting bogged down at times because we have not cleared this up.
- At that session, Deborah could give a presentation
- This meeting will be rough and not suitable for publis consumption (i.e. do not record)
- NOTE: PCC's SCT has a 2 modules on aggregates.
  - What is IFLA/LRM: Module 10: Introduction to aggregates available at https://www.loc.gov/catworkshop/RDA2020/index.html
  - What is IFLA/LRM: Module 11: Modelling Aggregates available at https://www.loc.gov/catworkshop/RDA2020/index.html

Backburner

Next time finish comment 8 and 9 to complete the data review!

July 5, 2023 8:00am - 9:30am PDT

See time zone conversion
Present: Crystal Yragui, Laura Akerman, Theo Gerontakos, Sita Bhagwandin, Ebe Kartus, Gordon Dunsire, Deborah Fritz, Adam Schiff, Junghae Lee, Sofia Zapounidou
Time: Ebe Kartus
Notes: Benjamin Riesenberg

Announcements (5 minutes)

UW preparing to apply for a grant, might require some group discussion in future; grant would potentially extend project through 2026
- Are we close to any milestone(s) with mapping?
- No, but we are laying a lot of groundwork with our discussions, there is reason to think that the mapping work may speed up later

Dataset 2 Review (40 minutes)

Dataset
Gordon's review comments

Waiting for feedback from a music cataloger on the 382
Left off last time on the 490 - see comment number 6 for this
The "standard 'statement' transform" (per comment) is missing the ' ; ' before TR 30
What about the rule that indicates that is something is meant to be read twice that it is included twice? (From guidelines on normalized transcription)
- I don't think that applies to numbering
- But, doesn't this apply to any manifestation statement?
Do we need separate mappings for 490 with first indicator 0 versus 1?
When we mapped this before, I think we decided that the individual elements didn't add any value which wasn't already in the series statement
Problem when you have 'two repeats of the field' - how to pair or associate the right subfields? (Not clear to note-taker, due to lack of MARC knowledge, which 490 subfields are being referred to in this discussion, or even if we are talking about associating a 490 value with a value from another field [880?] entirely...)
Facilitator points out that we already have mappings for the 490
If the ISSN is not a subelement within the series statement, then there is a good argument for throwing it away
- It would be an identifier for the series work
- In the context of the element series statement I would treat the ISSN as other title information
Include 490 $x in mapping, or not??
In 'manifestation series statement' the ISSN is included, but ISSN is not included in 'series statement'
"If we can find a way to include an ISSN that's included in the 490, that's probably a good thing"
Let's look at more transformations of 490s
OK how would we handle this?
- Take all the text, remove the subfields, so input / yield:

490	1#$aLund studies in geography,$x1400-1144 ;$v101$aSer. B, Human geography,$x0076-1478 ;$v48

Lund studies in geography, 1400-1144 ; 101 Ser. B, Human geography, 0076-1478 ; 48

Note that a period is missing in the MARC after 'v101'
To summarize, follow Gordon's recommendation in comments > number 6
- Note that we are only using $a, $x, and $v
For next time we will pick back up at comments on sample dataset 2 > number 7

008 mapping review (35 minutes)

See spreadsheet 008

See mapping details in spreadsheet 008*
'Map bound as part of another work' - is this an aggregate?
- Bound-withs are collections, not aggregates
Found an inconsistency with mapping of 'unknown if item is government publication' between books, computer files, etc. vs. visual materials - this was corrected
Reminder - according to decisions index II.D.1 we prefer values from RDA Registry over those from other sources.
If something has large print, should we assume that it is a volume? No
- Upshot: Map to font size, but not to carrier type
Same with braille, get rid of mapping to carrier type

*Limited access

Backburner

Action items

Next time, continue with comments on test dataset 2 and with 008 review

2023 Meeting Minutes - uwlib-cams/MARC2RDA GitHub Wiki

December 20, 2023

Announcements (5)

Aggregates (85)

Presentation from Deborah

Action items

December 13, 2023

Announcements (5)

Meeting recordings (5)

Classification numbers (10)

Discussion

Aggregates (40)

Topics:

Meeting discussion included:

535 Mapping (30)

Action items

December 6, 2023

Announcements

Aggregates

Interesting 👀 from the chat

Action items

November 29, 2023

Roles/Agenda Review

Announcements

Aggregates

Collection terms

Action items

November 22, 2023

Roles/Agenda Review (5)

Announcements (5)

BSR Milestone Adjustments (10)

533 Mapping Update (15)

Aggregates (60)

From the chat, on aggregating expressions

Resources on aggregates, provided outline for DF comments

November 8, 2023

Roles/Agenda Review (5)

Announcements (5)

Aggregates Feedback Request (Benjamin) (20 minutes)

Meeting discussion included the following:

7XX Work Party Report-Back (20 minutes)

535 Mapping (40 minutes)

Action items

November 1, 2023 8:00am - 9:30am PDT

Roles/Agenda Review

Announcements

535 Mapping

❓ Starting questions:

💬 Discussion

Action items

October 25, 2023 8:00am - 9:30am PDT

Roles/Agenda Review

Announcements

534 Mapping

Action items

October 18, 2023 8:00am - 9:30am PDT

Roles/Agenda Review (5)

Announcements (5)

008 Review (10)

534 Mapping (70)

Action items

October 11, 2023 8:00am - 9:30am PDT

Announcements

008 Mapping review

534 Mapping

October 4, 2023 8:00am - 9:30am PDT

Announcements

020 identifier review

008 Mapping review

Pick a group mapping for next time

Action items

September 27, 2023 8:00am - 9:30am PDT

Announcements

QUESTION

Meeting topics check-in

THOUGHTS

008 Mapping review

Action items

September 20, 2023 8:00am - 9:30am PDT

Announcements