Audiovisual Identifiers & Metadata (Upcoming releases) - BB-Media-IT/Data-Hub GitHub Wiki

The following document shows the schemas that we will have in the next updates of the Audiovisual Identifiers & Metadata product, estimated ETA February 28, 2025.


This document details the schemas used to manage the Audiovisual Identifiers & Metadata product data. During the delivery process, a specific S3 Bucket is created for each client. In each of these buckets, we have a main folder containing files in JSONL format. This folder is named Sources. Additionally, within this folder, there is a subfolder named latest, where the latest snapshot of the Metadata BB database is stored.

Document Structure

This document is organized into the following sections:

Details of the S3 Buckets

Each client receives an S3 Bucket where data is organized into specific folders to ensure efficient management and quick access. The folder names reflect the types of data they contain, facilitating the identification and specific processing of each data set:

Example

  • s3://bucket-client/Sources

This folder contains a latest subfolder, which is periodically updated with the latest available snapshot, ensuring that customers always have access to the latest information.

Update frequency and scope

  • The update of data in S3 Bucket is daily.
  • The scope is defined according to the needs of each customer, guaranteeing at least Metadata BB.

🚀 You can get a weekly updated demo by connecting to the following Bucket s3://bb-media-data/audiovisual-identifier-and-metadata/ using AWS CLI or any software and the endpoint parameter --endpoint https://nyc3.digitaloceanspaces.com.

Command example aws s3 --endpoint https://nyc3.digitaloceanspaces.com cp s3://bb-media-data/audiovisual-identifier-and-metadata/ /Demo/BB-Media --recursive

File Description

We provide a detailed description of the files contained in the Sources folder, explaining the structure and type of data handled by each one. If you wish to see the schemas in YAML, click here.

BBIDs

Field Type Description Example
UID string The ID created by BB which identifies a group of external IDs. f6343770f9212ebc396d74d21354ad33
ExternalIds array - View More In ExternalIds

ExternalIds

Field Type Description Example
UID string The ID created by BB which identifies a group of external IDs. f6343770f9212ebc396d74d21354ad33
Provider string Identifies the source of the ID. eidr
ProviderType string Source type. Data Base Social Network Provider
ID string The ID from the external provider. 10.5240/8805-D5B9-3435-1B99-E538-L
Type string The type assigned by the external provider. Movie
CreatedAt string The time when the ID was inserted in our database. 2023-04-10T20:39:33Z
UpdatedAt string The time when the ID was last updated in our database. 2024-04-21T04:59:04Z

SourcesContents

Field Type Description Example
UID string Universal BB ID. f6343770f9212ebc396d74d21354ad33
Title string Content title. Star Wars: The Rise of Skywalker
OriginalTitle string Title in the original language. Star Wars: The Rise of Skywalker
Year integer Year of release. 2019
Directors array List of directors. View More In Crew
CreatedBy array Name of the creator. View More In Crew
Cast array List of cast. View More In Crew
Akas array List of localized titles for different regions. View More In Akas
PrimaryCountry string Primary production countries. "US"
Country array List of production countries. ["US"]
PrimaryCompany string Primary production company. "Lucasfilm Ltd."
Companies array List of companies involved in the production and distribution of this title. View More In Companies
Synopsis array List of synopsis and their languages View More In Synopsis
Images array URLs to different kinds of images. View More In Images
Videos array Trailers or other promotional videos. View More In Videos
OriginalLanguage string Original Language. "en"
Languages array Languages used in the title. ["en"]
ReleaseYears array List of release years by territory. View More In ReleaseYears
ReleaseDate string Global release day 2019-12-18
Duration integer Runtime in minutes. 142
Genres array List of genres ["Adventure","Action","Science Fiction"]
Homepage string Website for the content. https://www.starwars.com/films/star-wars-episode-ix-the-rise-of-skywalker
Keywords array List of keywords ["Space"]
Status string Content status. null Planned Pilot In Production Returning Canceled Ended
Type string Content type. Movie TV Show Episode
IsAdult boolean Flag indicating whether the title is for adult audiences. true false
Scripted boolean - true false
Seasons array - View More In Seasons
Episode integer - 1
Season integer - 1
Rating array Age rating. PG-13
Hispanic boolean - Hispanic Production Hispanic Co-Production No Hispanic
CreatedAt string Indicates when the record was created. 2023-04-10T20:39:33Z
UpdatedAt string Indicates when the record was updated. 2024-04-21T04:59:04Z

People

Field Type Description Example
Name string - J.J. Abrams
Role string - Director
Character string - -
UID string UID People Level cr38000sddfs0sebc396d74d213343445d
Image string -

Akas

Field Type Description Example
Language string ISOAlpha2 kr
Region string ISOAlpha2 KR
Title string - 스타워즈-라이즈 오브 스카이워커

Companies

Field Type Description Example
Country string ISOAlpha2 US
Name string - Lucasfilm Ltd.
Type string Production or distributor company. Production

Synopsis

Field Type Description Example
Region string ISOAlpha2 JP
Language string ISOAlpha2 ja
Description string - かつて銀河に君臨していた祖父ダース・ベイダーに傾倒し...

Images

Field Type Description Example
Type string - Poster Backdrop Thumbnail Logo null
Language string ISOAlpha2 null
URL string Image URL.

Videos

Field Type Description Example
Provider string - YouTube
Id string - oln5JMXRT9U
Language string ISOAlpha2 en
Type string - Featurette

ReleaseYears

Field Type Description Example
Country string ISOAlpha2 US
Year integer - 2019
Type string - Theatrical

Seasons

Field Type Description Example
Number integer Season number. 1
Episodes integer Number of episodes. 10
Year integer N/D 2023
UID integer UID Season Level f6343770f9212ebc396d74d21334rty6
Title string N/D Seasons 1
Synopsis array List of synopsis and their languages View More In Synopsis
Images array URLs to different kinds of images. View More In Images

Popularity

Field Type Description Example
UID string Hash identifying. f6343770f9212ebc396d74d21354ad33
Popularity integer Popularity score. 68.87
Type string - Movie TV Show Episode Person
CreatedAt string Indicates when the record was created in our database. 2024-04-21T00:00:00Z

Ratings

Field Type Description Example
UID string Hash identifying. f6343770f9212ebc396d74d21354ad33
Type string - Movie TV Show Episode Person
Votes integer Amount of votes. 492415
Score number Average score. 6.4
CreatedAt string Indicates when the record was created in our database. 2024-04-21T00:00:00Z

People

Field Type Description Example
Name string - John C. Reilly
UID string - cr38000sddfs0sebc396d74d213343445d
Birthday string - 1965-05-24
Nationality string - American
PlaceOfBirth string - Chicago, Illinois, USA
Gender string - Male Female null
Race string - White or Caucasian (Non-Hispanic)
Active bolean - true false
Active bolean - true false
Image string -
Akas array List of localized name for different regions. View More In Akas

People-Akas

Field Type Description Example
Language string ISOAlpha2 bg
Name string - Джон Си Райли