Content Tracker & Coming Soon - BB-Media-IT/Data-Hub GitHub Wiki
This document details the schemas used to manage the Content Tracker & Coming Soon
product data. During the delivery process, a specific S3 Bucket is created for each client. In each of these buckets, we have a main folder containing files in JSONL format. This folder is named ContentTracker
and ComingSoon
. Additionally, within this folder, there is a subfolder named latest
.
Document Structure
This document is organized into the following sections:
Details of the S3 Buckets
Each client receives an S3 Bucket where data is organized into specific folders to ensure efficient management and quick access. The folder names reflect the types of data they contain, facilitating the identification and specific processing of each data set:
Example
s3://bucket-client/ContentTracker
s3://bucket-client/ComingSoon
This folder contains a latest
subfolder, which is periodically updated with the latest available snapshot, ensuring that customers always have access to the latest information.
Update frequency and scope
- The update of data in S3 Bucket is weekly.
🚀 You can get a weekly updated demo by connecting to the following Bucket s3://bb-media-data/content-tracker-coming-soon/
using AWS CLI or any software and the endpoint parameter --endpoint https://nyc3.digitaloceanspaces.com
.
Command example
aws s3 --endpoint https://nyc3.digitaloceanspaces.com cp s3://bb-media-data/content-tracker-coming-soon/ /Demo/BB-Media --recursive
File Description
We provide a detailed description of the files contained in the Content Tracker & Coming Soon
folder, explaining the structure and type of data handled by each one. If you wish to see the schemas in YAML, click here.
ContentTracker
Field | Type | Description | Example |
---|---|---|---|
BASED_ON_BOOK_VIDEOGAME | boolean | - | false true null |
CHANNEL_OTT | array | - | ["Apple TV+"] |
CONTENT_NAME | string | - | Jane |
CONTENT_TYPE | string | - | Series Movie |
COUNTRY_OF_PRODUCTION | array | - | ["CA"] |
COUNTRY_OF_THE_ORIGINAL_CONTENT | array | - | null |
DATE | string | - | 1712102400000 |
DISTRIBUTION_COMPANY | array | - | ["Apple Inc"] |
DISTRIBUTION_COMPANY_OF_THE_ORIGINAL_CONTENT | array | - | null |
DISTRIBUTION_COUNTRY_IES | array | - | ["US"] |
EPISODES | integer | - | null |
GENRE | string | - | Kids & Family |
ID_CONTENT | integer | - | 5311 |
ID_IMDb | string | - | tt14014144 |
ID_NEWS | integer | - | 44678 |
LINK | string | - | https://tvlatina.tv/tvninos/segundo-ciclo-de-jane-llegara-a-apple-tv |
MINUTES | integer | - | null |
MULTIPLATFORM | string | - | null |
NAME_OF_ORIGINAL_CONTENT | array | - | |
PRODUCTION_COMPANY | array | - | ["Sinking Ship Entertainment", "Jane Goodall Institute"] |
PRODUCTION_COMPANY_OF_THE_ORIGINAL_CONTENT | array | - | null |
PRODUCTION_TYPE | string | - | null |
REGION | array | - | ["North America"] |
RELEASE_DATE | string | - | 2024-04-19 |
SEASONS | array | - | null |
STAGE | string | - | Released Distribution Production Sale Theatrical Revival |
SUBGENRE | array | - | ["Animation"] |
SYNOPSIS | string | - | Follows 10-year-old Jane Garcia, whose active... |
TYPE_OF_AGREEMENT | string | - | null Individual season Last season |
TYPE_OF_BUSINESS | array | - | null |
TYPE_OF_PLATFORM | array | - | null |
TYPE_OF_RELEASE | string | - | null All At Once Daily Week Over Week |
RelevantNews
Field | Type | Description | Example |
---|---|---|---|
COUNTRY | array | - | nulll |
DATE | string | - | 1712102400000 |
DESCRIPTION | string | - | 26/6: Disney+ se relanza en América Latina, integrando Star+... |
ID_CONTENT | integer | - | 4354 |
ID_NEWS | integer | - | 4540 |
LINK | string | - | https://www.thedailytelevision.com/articulo/new-media/266-disney-se-relanza-en-america-latina-integrando-star-y-espn |
NAME | array | - | ["Disney+"] |
PARTNERED_TO | array | - | null |
TERRITORY | array | - | ["Latin America"] |
TYPE_OF_AGREEMENT | string | - | Acquisition Industry News New Channels New Companies New Platforms Partnership-Co-production null |
ComingSoon
Field | Type | Description | Example |
---|---|---|---|
PlatformId | integer | ID for the platform. The ID refers to the platform across all available territories. | 651 |
PlatformCode | string | Code identifying the platform and the territory. | us.netflix |
PlatformName | string | Official name of the platform. | Netflix |
PlatformCountry | string | Country ISO 3166-1 alpha-2 code. | US |
Title | string | - | Marnie |
AvailabilityDay | string | - | 2023-04-01T00:00:00 |
URL | string | - | https://www.netflix.com/title/60000769 |
Year | integer | Original release year. | null |
Classification | string | Incoming title classification. Possible values: premiere : Titles reasing this year. returning : New episodes or season for an existing series. library : Older titles that become available on the platform. |
null |
UID | string | Hash identifying the movie or series universally. | bf93af1a28a2ae518684242c62026117 |
ExternalIds | array | IDs from external databases that are mapped to the UID assigned to the movie or series. | View More In ExternalIds |
Type | string | The content type as it is assigned by the platform. | Movie |
ReleaseType | string | Indicates the cadence of episode releases. Possible values are: All At Once - the complete season is made available at the same time. Daily - episodes are released daily. Weekly - episodes are released weekly. Weekly (Preview) - the first two episodes are released simultaneously, the rest on a weekly basis. |
null |
Seasons | array | Este campo representa la información relacionada con seasons. | View More In Seasons |
Source | string | News article URL that is the source for the data. | https://www.whats-on-netflix.com/coming-soon/whats-coming-to-netflix-in-april-2023-03-19/" |
Duration | integer | The content runtime in minutes. | null |
IsOriginal | boolean | Indicates whether the show is original to the platform. | false true null |
Genres | array | - | ["Crime","Drama","Mystery"] |
Themes | array | Similar to Genres but describes with more detail the content of the show. |
["Suspense"] |
ProductionCountries | array | - | ["US"] |
ProductionCompanies | array | - | null |
CreatedAt | datatime | - | 2024-03-06 11:04:45.835747 |
ExternalIds
Field | Type | Description | Example |
---|---|---|---|
Provider | string | Este campo representa la información relacionada con provider. | eidr |
ID | string | Este campo representa la información relacionada con id. | 10.5240/9B99-AD58-0BD0-0DB4-2B65-1 |
Type | string | Type defined in the external database. It may not match to the type defined in the platform. For example, a 'Movie' in a platform may be classified as an 'Episode' in an external database. | Movie |
Seasons
Field | Type | Description | Example |
---|---|---|---|
Number | integer | Season number. | 1 |
Episodes | integer | Number of episodes in the season. | 8 |
Table Visualization & Data Relationships
We include tables to clearly visualize the relationships and key fields in each JSONL file and analyze how the various files interrelate to provide a complete view of the overall Content Tracker & Coming Soon
product data model.