HOW TO obtain a movie list from imdb - marianonieves/mimica GitHub Wiki

In the folder /material you can find the files mentioned in this article.

1) Download from IMDB lists the csv files.

Parse them to obtain an array of IMDB codes and remove duplicated elements. ie: http://www.imdb.com/list/QiGak8mMTIE/ > Export this list

2) Get the data from IMDB using a PHP api that parsed the IMDB information from the HTML template

http://web3o.blogspot.com.ar/2010/10/php-imdb-scraper-for-new-imdb-template.html

Using the PHP IMDb Scraper/API for new IMDb Template I create a php webpage that runs in Localhost (I use XAMPP) http://localhost/imdb/?callback=responseReadyCallback&movieId=tt0211915 to retrieve the information responseReadyCallback(["tt0211915;Amelie;Le fabuleux destin d'Amélie Poulain;2001;Comedy-Romance;She'll change your life."])

The information retrieve is embeded as a parameter of the javascript call "responseReadyCallback".

3) Make several requests to the php API in time

http://www.hunlock.com/blogs/Howto_Dynamically_Insert_Javascript_And_CSS I created an HTML (IMDB_RESULTS.html) page with the javascript callback mentioned above.

Every two seconds a new request is called and shown in the div results with the format ie: tt0211915;Amelie;Le fabuleux destin d'Amélie Poulain;2001;Comedy-Romance;She'll change your life.

4) Polish the data

  • convert all ' by ´
  • remove all " inside a "" block

5) In the Air project, activate the module CODE TO ADD MORE CONTENT