Content Tracker & Coming Soon - BB-Media-IT/Data-Hub GitHub Wiki

This document details the schemas used to manage the Content Tracker & Coming Soon product data. During the delivery process, a specific S3 Bucket is created for each client. In each of these buckets, we have a main folder containing files in JSONL format. This folder is named ContentTracker and ComingSoon. Additionally, within this folder, there is a subfolder named latest.

Document Structure

This document is organized into the following sections:

Details of the S3 Buckets

Each client receives an S3 Bucket where data is organized into specific folders to ensure efficient management and quick access. The folder names reflect the types of data they contain, facilitating the identification and specific processing of each data set:

Example

  • s3://bucket-client/ContentTracker
  • s3://bucket-client/ComingSoon

This folder contains a latest subfolder, which is periodically updated with the latest available snapshot, ensuring that customers always have access to the latest information.

Update frequency and scope

  • The update of data in S3 Bucket is weekly.

🚀 You can get a weekly updated demo by connecting to the following Bucket s3://bb-media-data/content-tracker-coming-soon/ using AWS CLI or any software and the endpoint parameter --endpoint https://nyc3.digitaloceanspaces.com.

Command example aws s3 --endpoint https://nyc3.digitaloceanspaces.com cp s3://bb-media-data/content-tracker-coming-soon/ /Demo/BB-Media --recursive

File Description

We provide a detailed description of the files contained in the Content Tracker & Coming Soon folder, explaining the structure and type of data handled by each one. If you wish to see the schemas in YAML, click here.

ContentTracker

Field Type Description Example
BASED_ON_BOOK_VIDEOGAME boolean - false true null
CHANNEL_OTT array - ["Apple TV+"]
CONTENT_NAME string - Jane
CONTENT_TYPE string - Series Movie
COUNTRY_OF_PRODUCTION array - ["CA"]
COUNTRY_OF_THE_ORIGINAL_CONTENT array - null
DATE string - 1712102400000
DISTRIBUTION_COMPANY array - ["Apple Inc"]
DISTRIBUTION_COMPANY_OF_THE_ORIGINAL_CONTENT array - null
DISTRIBUTION_COUNTRY_IES array - ["US"]
EPISODES integer - null
GENRE string - Kids & Family
ID_CONTENT integer - 5311
ID_IMDb string - tt14014144
ID_NEWS integer - 44678
LINK string - https://tvlatina.tv/tvninos/segundo-ciclo-de-jane-llegara-a-apple-tv
MINUTES integer - null
MULTIPLATFORM string - null
NAME_OF_ORIGINAL_CONTENT array -
PRODUCTION_COMPANY array - ["Sinking Ship Entertainment", "Jane Goodall Institute"]
PRODUCTION_COMPANY_OF_THE_ORIGINAL_CONTENT array - null
PRODUCTION_TYPE string - null
REGION array - ["North America"]
RELEASE_DATE string - 2024-04-19
SEASONS array - null
STAGE string - Released Distribution Production Sale Theatrical Revival
SUBGENRE array - ["Animation"]
SYNOPSIS string - Follows 10-year-old Jane Garcia, whose active...
TYPE_OF_AGREEMENT string - null Individual season Last season
TYPE_OF_BUSINESS array - null
TYPE_OF_PLATFORM array - null
TYPE_OF_RELEASE string - null All At Once Daily Week Over Week

RelevantNews

Field Type Description Example
COUNTRY array - nulll
DATE string - 1712102400000
DESCRIPTION string - 26/6: Disney+ se relanza en América Latina, integrando Star+...
ID_CONTENT integer - 4354
ID_NEWS integer - 4540
LINK string - https://www.thedailytelevision.com/articulo/new-media/266-disney-se-relanza-en-america-latina-integrando-star-y-espn
NAME array - ["Disney+"]
PARTNERED_TO array - null
TERRITORY array - ["Latin America"]
TYPE_OF_AGREEMENT string - Acquisition Industry News New Channels New Companies New Platforms Partnership-Co-production null

ComingSoon

Field Type Description Example
PlatformId integer ID for the platform. The ID refers to the platform across all available territories. 651
PlatformCode string Code identifying the platform and the territory. us.netflix
PlatformName string Official name of the platform. Netflix
PlatformCountry string Country ISO 3166-1 alpha-2 code. US
Title string - Marnie
AvailabilityDay string - 2023-04-01T00:00:00
URL string - https://www.netflix.com/title/60000769
Year integer Original release year. null
Classification string Incoming title classification. Possible values: premiere: Titles reasing this year. returning: New episodes or season for an existing series. library: Older titles that become available on the platform. null
UID string Hash identifying the movie or series universally. bf93af1a28a2ae518684242c62026117
ExternalIds array IDs from external databases that are mapped to the UID assigned to the movie or series. View More In ExternalIds
Type string The content type as it is assigned by the platform. Movie
ReleaseType string Indicates the cadence of episode releases. Possible values are: All At Once - the complete season is made available at the same time. Daily - episodes are released daily. Weekly - episodes are released weekly. Weekly (Preview) - the first two episodes are released simultaneously, the rest on a weekly basis. null
Seasons array Este campo representa la información relacionada con seasons. View More In Seasons
Source string News article URL that is the source for the data. https://www.whats-on-netflix.com/coming-soon/whats-coming-to-netflix-in-april-2023-03-19/"
Duration integer The content runtime in minutes. null
IsOriginal boolean Indicates whether the show is original to the platform. false true null
Genres array - ["Crime","Drama","Mystery"]
Themes array Similar to Genres but describes with more detail the content of the show. ["Suspense"]
ProductionCountries array - ["US"]
ProductionCompanies array - null
CreatedAt datatime - 2024-03-06 11:04:45.835747

ExternalIds

Field Type Description Example
Provider string Este campo representa la información relacionada con provider. eidr
ID string Este campo representa la información relacionada con id. 10.5240/9B99-AD58-0BD0-0DB4-2B65-1
Type string Type defined in the external database. It may not match to the type defined in the platform. For example, a 'Movie' in a platform may be classified as an 'Episode' in an external database. Movie

Seasons

Field Type Description Example
Number integer Season number. 1
Episodes integer Number of episodes in the season. 8

Table Visualization & Data Relationships

We include tables to clearly visualize the relationships and key fields in each JSONL file and analyze how the various files interrelate to provide a complete view of the overall Content Tracker & Coming Soon product data model.

Content Tracker