film film id_criteria - freebase-schema/freebase GitHub Wiki

Identity Criterion for Film type

Features for establishing identity relationships

  • name_match: There is a name_match if the names of the films or aliases on both the sources are same. Some films may also have foreign translations for their names or may have been re-released with a different name. (For ex. “Mr Cat Poop” for As good as It Gets or “Six Naked Pigs” for The Full Monty).

  • initial_release_date_match: There is an initial_release_date_match if the film’s earliest release date (to the day level) listed in Freebase property /film/film/initial_release_date (Initial release date) matches with the date (to the day level) on the other source. Sometimes only the year may be available in which case if it matches, it is a match.

  • director_match: There is a director_match if at least one of the film directors listed in Freebase property /film/film/directed_by (Directed by) matches with the film directors on the other source.

  • performances_match: There is a performances_match if at least one of the starring actors listed in Freebase property /film/film/starring (Performances) matches with the actors on the other source.

  • executive_produced_by_match: There is an executive_produced_by_match if at least one of the executive film producers listed in Freebase property /film/film/executive_produced_by (Executive produced by) matches with the executive film producers on the other source.

  • produced_by_match: There is a produced_by_match if at least one of the film producers listed in Freebase property /film/film/produced_by (Produced by) matches with the film producers on the other source.

  • casting_director_match: There is a casting_director_match if at least one of the film casting directors listed in Freebase property /film/film/film_casting_director (Casting director) matches with the casting directors on the other source.

  • cinematographer_match: There is a cinematographer_match if at least one of the film cinematographers listed in Freebase property /film/film/cinematography (Cinematography) matches with the film cinematographers on the other source.

  • screenplay_by_match: There is a screenplay_by_match if at least one of the film writers listed in Freebase property /film/film/written_by (Screenplay by) matches with the film writers on the other source.

  • edited_by_match: There is an edited_by_match if at least one of the film editors listed in Freebase property /film/film/edited_by (Edited by) matches with the film editors on the other source.

  • production_companies_match: There is a production_companies_match if at least one of the film production companies listed in Freebase property /film/film/production_companies (Production companies) matches with the production companies on the other source.

  • featured_song_match: There is a featured_song_match if at least one of the film’s songs listed in Freebase property /film/film/featured_song (Featured song) matches with the songs on the other source.

  • music_by_match: There is a music_by_match if at least one of the film music contributors listed in Freebase property /film/film/music (Music by) matches with the persons on the other source.

  • awards_match: There is an awards_match if at least one of the awards (including country/association, category and years) or award nominations that the film received matches with the awards on the other source.

  • poster_scene_match: There is a poster_scene_match if the film poster or any other still shot from the film given in the freebase topic matches exactly with poster or shot on the other source.

  • story_by_match: There is a story_by_match if at least one of the film story contributors listed under Freebase property /film/film/story_by (Story by) matches with the story contributors on the other source.

General guidelines for establishing identity between two Films

Films should not be matched to other incompatible types such as books, or TV Programs. (Example 17)

Here are some typical patterns for determining identity of two films:

  1. name_match and music_by_match (Example 1)
  2. name_match and awards_match (Example 2)
  3. name_match and edited_by_match (Example 3)
  4. name_match and initial_release_date_match and screenplay_by_match (Example 4, Example 5)
  5. name_match and featured_song_match (Example 6)
  6. name_match and production_companies_match (Example 7)
  7. name_match and cinematographer_match (Example 8)
  8. name_match and casting_director_match (Example 9)
  9. name_match and produced_by_match (Example 10)
  10. name_match and executive_produced_by_match (Example 11)
  11. name_match and performances_match (Example 12)
  12. name_match and director_match (Example 13, Example 14)
  13. poster_scene_match (Example 15)
  14. name_match and initial_release_date_match and story_by_match (Example 16)

Examples





Pattern 1 Example 1: Permalink The name of the films and the name of the persons who gave music for the film, match on both the sources. Hence it is a match.





Pattern 2 Example 2: Permalink The name of the films and the nominations and awards received by the movie match on both the sources. Hence it is a match.





Pattern 3 Example 3: Permalink The name of the films and the movie editors match on both the sources. Hence it is a match.





Pattern 4 Example 4: Permalink The names of the films, initial release dates (years) and the names of the persons who wrote the screenplay match on both the sources. Hence it is a match.





Pattern 4 Example 5: Permalink1 Permalink2 The name of the film and the name of the person who wrote screenplay for film in freebase topic match with two films on IMDB. Since screenplays can be adapted for different movies with the same title, initial release year is also used in addition of the writer for the match. Hence the middle topic is a match with the Left Hand Side.





Pattern 5 Example 6: Permalink The name of the films and a song written for the film match on both the sources. Hence it is a match.





Pattern 6 Example 7: Permalink The name of the films and two of the production companies match on both the sources. Hence it is a match.





Pattern 7 Example 8: Permalink The name of the films and the cinematographers match on both the sources. Hence it is a match.





Pattern 8 Example 9: Permalink The name of the films and the casting directors match on both the sources. Hence it is a match.





Pattern 9 Example 10: Permalink The name of the films and their producers match on both the sources. Hence it is a match.





Pattern 10 Example 11: Permalink The name of the films and their executive producers match on both the sources. Hence it is a match.





Pattern 11 Example 12: Permalink The film names and some of the lead actors and actresses match on both the sources. Hence it is a match.





Pattern 12 Example 13: Permalink The film names and the directors match on both the sources. Hence it is a match.





Pattern 12 Example 14: In this example, there are 3 topics on the Right Hand Side (RHS) that have a matching name with the Left Hand Side (LHS) topic. But only the first one has a director match in addition to name match with the topic on LHS. Hence it is a match.





Pattern 13 Example 15: Permalink The posters of the film match on both the sources. Hence it is a match.





Pattern 14 Example 16: Permalink The film titles, the initial release dates and the story writers of the film match on both the sources. Hence it is a match.





Example 17: Permalink Here the topic titles match but Left Hand Side consists of a film topic while Right Hand Side has a book topic. Hence these incompatible topics should not be reconciled.

⚠️ **GitHub.com Fallback** ⚠️